34
CRICOS No. 000213J a university for the world real R Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure Adriano Augusto, Raffaele Conforti, Marlon Dumas, Marcello La Rosa, and Giorgio Bruno

Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

Embed Size (px)

Citation preview

Page 1: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Automated Discovery of Structured Process Models:

Discover Structured vs

Discover and Structure

Adriano Augusto, Raffaele Conforti, Marlon Dumas,

Marcello La Rosa, and Giorgio Bruno

Page 2: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Automated Process Discovery

CID Task Time Stamp …

13219 Enter Loan Application 2007-11-09 T 11:20:10 -

13219 Retrieve Applicant Data 2007-11-09 T 11:22:15 -

13220 Enter Loan Application 2007-11-09 T 11:22:40 -

13219 Compute Installments 2007-11-09 T 11:22:45 -

13219 Notify Eligibility 2007-11-09 T 11:23:00 -

13219 Approve Simple Application 2007-11-09 T 11:24:30 -

13220 Compute Installements 2007-11-09 T 11:24:35 -

… … … …

Page 3: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Process Quality Dimensions

Process DiscoveryProcess

Discovery

Fitness

Precision

Generalization

Complexity

Page 4: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Process Quality Dimensions

Process Discovery

Fitness

Page 5: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Process Quality Dimensions

Process Discovery

Fitness

Precision

Page 6: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Process Quality Dimensions

Process Discovery

Fitness

Precision

Generalization

Page 7: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Process Quality Dimensions

Process Discovery

Fitness

Precision

Generalization

Complexity

Page 8: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Process Discovery Algorithms:The Two Worlds

High-FitnessHigh-Precision

High-FitnessLow-Complexity

Page 9: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Process Discovery Algorithms:The Two Worlds

High-FitnessHigh-Precision

Heuristic Miner

Fodina Miner

High-FitnessLow-Complexity

Page 10: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Process Model discovered with Heuristics Miner

Page 11: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Process Discovery Algorithms:The Two Worlds

High-FitnessHigh-Precision

Heuristic Miner

Fodina Miner

High-FitnessLow-Complexity

Page 12: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Process Discovery Algorithms:The Two Worlds

High-FitnessHigh-Precision

Heuristic Miner

Fodina Miner

High-FitnessLow-Complexity

Inductive Miner

Evolutionary Tree Miner

Page 13: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Process Model discovered with Inductive Miner

• Structured by construction• Based on process tree

Page 14: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Process Discovery Algorithms

High-FitnessHigh-Precision

Low-Complexity

Page 15: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Process Discovery Algorithms

High-FitnessHigh-Precision

Low-ComplexityStructured

Miner

Page 16: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Process Model discovered with Structured Miner

Page 17: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Discover and Structure:A two phases approach

• Phase One: discover a process model focussing on fitness and precision without constraints on its structure. For example using Heuristic Miner or Fodina Miner.

• Phase Two: simplify the discovered process model structuring it at posteriori.

Page 18: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Phase Two: Structuring

Discover the RPST of the model

Process Fragment:• Trivial (T) – single edge• Polygon (P) – sequence of fragments• Bond (B) – set of fragments sharing two nodes• Rigid (R) – none of the above cases

Page 19: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Phase Two: Structuring

Discover the RPST of the model

P1

P1

B1

B1

P3

R1P2

P2 P3

R1

Page 20: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Phase Two: Structuring

Discover the RPST of the model

Structure sound AND-Homogeneous or Heterogeneous rigids using BPSTruct (Polyvyanyy 2014)

Page 21: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Phase Two: Structuring

Discover the RPST of the model

Structure sound AND-Homogeneous or Heterogeneous rigids using BPSTruct (Polyvyanyy 2014)

Structure XOR-Homogeneous and unsound rigids using Extended Oulsnam

Page 22: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Oulsnam’s Algorithm Extended for BPMN Process Models

• Injection

Page 23: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Oulsnam’s Algorithm Extended for BPMN Process Models

• Push-Down– Push down-stream the gateway causing the injection– Duplicate everything in between the gateway causing

the injection and the gateway down-stream

Page 24: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Oulsnam’s Algorithm Extended for BPMN Process Models

• Ejection

Page 25: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Oulsnam’s Algorithm Extended for BPMN Process Models

• Pull-Up– Pull up-stream the gateway causing the injection– Duplicate everything in between the gateway causing

the injection and the gateway up-stream

Page 26: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Evaluation Setup

• Real-Life dataset: IBM (54 models) and SAP (545 models) collections

• Synthetic dataset: 20 models

• Generated three sets of logs for a total of 619 logs

• We retained all logs for which Heuristics Miner produced an unstructured model - 129 logs

Page 27: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Evaluation Setup

• Four process discovery algorithms:– Inductive Miner– Evolutionary Tree Miner– Heuristics Miner– Structured Miner (on top of Heuristics Miner)

• Four quality dimensions:– Fitness– Precision– Generalization– Complexity

Page 28: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Evaluation Results

• Real-life datasets:

Page 29: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Evaluation Results

• Real-life datasets:

Page 30: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Heuristics Miner - Real-life Dataset

Page 31: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Inductive Miner - Real-life Dataset

Page 32: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Structured Miner - Real-life Dataset

Page 33: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Future Work

• Experiment with alternative discovery algorithms to explore alternative tradeoffs between model quality metrics

• Explore the option of sacrificing weak bisimilarity to obtain models with higher structuredness

• Use process model clone detection techniques to refactor duplicates introduced by the structuring phase

Page 34: Automated Discovery of Structured Process Models: Discover Structured vs Discover and Structure

CRICOS No. 000213Ja university for the worldreal R

Questions

?