4.2 Article

Process mining: a two-step approach to balance between underfitting and overfitting

Journal

SOFTWARE AND SYSTEMS MODELING
Volume 9, Issue 1, Pages 87-111

Publisher

SPRINGER HEIDELBERG
DOI: 10.1007/s10270-008-0106-z

Keywords

-

Ask authors/readers for more resources

Process mining includes the automated discovery of processes from event logs. Based on observed events (e.g., activities being executed or messages being exchanged) a process model is constructed. One of the essential problems in process mining is that one cannot assume to have seen all possible behavior. At best, one has seen a representative subset. Therefore, classical synthesis techniques are not suitable as they aim at finding a model that is able to exactly reproduce the log. Existing process mining techniques try to avoid such overfitting by generalizing the model to allow for more behavior. This generalization is often driven by the representation language and very crude assumptions about completeness. As a result, parts of the model are overfitting (allow only for what has actually been observed) while other parts may be underfitting (allow for much more behavior without strong support for it). None of the existing techniques enables the user to control the balance between overfitting and underfitting. To address this, we propose a two-step approach. First, using a configurable approach, a transition system is constructed. Then, using the theory of regions, the model is synthesized. The approach has been implemented in the context of ProM and overcomes many of the limitations of traditional approaches.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.2
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Software Engineering

OC-PM: analyzing object-centric event logs and process models

Alessandro Berti, Wil M. P. van der Aalst

Summary: Object-centric process mining is a branch of process mining that aims to analyze event data without forming exclusive event groups. This paper provides an overview of exploring and processing object-centric event logs to discover information related to the lifecycle of different objects. The paper also introduces a comprehensive tool support for implementing the proposed techniques.

INTERNATIONAL JOURNAL ON SOFTWARE TOOLS FOR TECHNOLOGY TRANSFER (2023)

Editorial Material Computer Science, Information Systems

Sustainable Systems Engineering Opportunities and Challenges

Wil M. P. van der Aalst, Oliver Hinz, Christof Weinhardt

BUSINESS & INFORMATION SYSTEMS ENGINEERING (2023)

Article Computer Science, Artificial Intelligence

A natural language querying interface for process mining

Luciana Barbieri, Edmundo Madeira, Kleber Stroeh, Wil van der Aalst

Summary: Despite recent advances in process mining, making it accessible to non-technical users remains a challenge. To democratize this technology, a natural language querying interface is proposed to allow users to retrieve relevant information and insights about their processes by asking questions in plain English. A reference architecture is proposed to support natural language questions and provide the right answers by integrating with existing process mining tools.

JOURNAL OF INTELLIGENT INFORMATION SYSTEMS (2023)

Editorial Material Computer Science, Information Systems

Welcome to the Era of ChatGPT et al. The Prospects of Large Language Models

Timm Teubner, Christoph M. Flath, Christof Weinhardt, Wil van der Aalst, Oliver Hinz

BUSINESS & INFORMATION SYSTEMS ENGINEERING (2023)

Article Computer Science, Information Systems

Explainable concept drift in process mining

Jan Niklas Adams, Sebastiaan J. van Zelst, Thomas Rose, Wil M. P. van der Aalst

Summary: This paper introduces a framework to extract concept drifts and their potential root causes from event data. It extracts time series describing process measures, detects concept drifts, and tests these drifts for correlation. The framework supports object-centric event data with multiple case notions, non-linear relationships, and an arbitrary number of process measures.

INFORMATION SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Performance-preserving event log sampling for predictive monitoring

Mohammadreza Fani Sani, Mozhgan Vazifehdoostirani, Gyunam Park, Marco Pegoraro, Sebastiaan J. van Zelst, Wil M. P. van der Aalst

Summary: Predictive process monitoring aims to estimate case or event features for running process instances in process mining. However, current methods for predictive monitoring require training complex machine learning models, which is inefficient. This paper proposes an instance selection procedure that improves training speed for prediction models while maintaining prediction accuracy.

JOURNAL OF INTELLIGENT INFORMATION SYSTEMS (2023)

Article Mathematics

Object-Centric Process Mining: Unraveling the Fabric of Real Processes

Wil M. P. van der Aalst

Summary: Traditional process modeling and analysis methods focus on one type of object per event, but in reality, events often involve multiple objects of different types. Object-centric process mining takes a more holistic approach by considering multiple object types and events. This paper introduces object-centric event data and demonstrates their use in discovering, analyzing, and improving complex processes.

MATHEMATICS (2023)

Article Automation & Control Systems

Preserving complex object-centric graph structures to improve machine learning tasks in process mining

Jan Niklas Adams, Gyunam Park, Wil M. P. van der Aalst

Summary: This paper presents a method to preserve the graph structure of object-centric event data in process mining using machine learning techniques. Two different techniques are provided to preserve these structures, and their performances are evaluated in predictive process monitoring tasks. Different graph embedding techniques for object-centric event logs are compared for future research and deployment.

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE (2023)

Proceedings Paper Computer Science, Artificial Intelligence

Detecting Complex Anomalous Behaviors in Business Processes: A Multi-perspective Conformance Checking Approach

Azadeh Sadat Mozafari Mehr, Renata M. de Carvalho, Boudewijn van Dongen

Summary: In recent years, organizations have been paying more attention to anomaly detection as anomalies in business processes can indicate system faults, inefficiencies, or even fraudulent activities. This paper introduces an approach that considers different perspectives of a business process simultaneously to detect complex anomalies such as spurious data processing and misusage of authorizations. The approach has been implemented in the ProM framework and evaluated using real-life data from a financial organization, showing its ability to detect anomalies related to multiple aspects of a business process.

PROCESS MINING WORKSHOPS, ICPM 2022 (2023)

Proceedings Paper Computer Science, Artificial Intelligence

Monitoring Constraints in Business Processes Using Object-Centric Constraint Graphs

Gyunam Park, Wil M. P. van der Aalst

Summary: Constraint monitoring aims to detect constraint violations in business processes by analyzing event data. However, existing techniques assume a single case notion and fail to consider the interaction of multiple objects in the processes. In this work, we propose an approach using Object-Centric Constraint Graphs (OCCGs) to represent and evaluate constraints in object-centric business processes. We conducted case studies with a real-life SAP ERP system to validate the proposed approach.

PROCESS MINING WORKSHOPS, ICPM 2022 (2023)

Proceedings Paper Computer Science, Artificial Intelligence

Creating Translucent Event Logs to Improve Process Discovery

Harry H. Beyel, Wil M. P. van der Aalst

Summary: Event logs capture information about executed activities, but not about enabled but not executed activities. Translucent event logs contain information on enabled activities. We propose two techniques to increase the availability of translucent event logs. The first technique records system states as snapshots and links them to events, adding information about enabled activities. The second technique uses a process model to add information about enabled activities to existing event logs, improving process discovery.

PROCESS MINING WORKSHOPS, ICPM 2022 (2023)

Proceedings Paper Computer Science, Artificial Intelligence

Enhancing Data-Awareness of Object-Centric Event Logs

Alexandre Goossens, Johannes De Smedt, Jan Vanthienen, Wil M. P. van der Aalst

Summary: When multiple objects are involved in a process, object-centric event logs are an interesting development in process mining as they support the presence of multiple types of objects. However, the current object-centric event log formats do not fully support dynamic object attributes. This paper introduces the Data-aware OCEL format, along with an algorithm that automatically translates XES logs to this format, addressing the issues present in the existing formats.

PROCESS MINING WORKSHOPS, ICPM 2022 (2023)

Proceedings Paper Computer Science, Interdisciplinary Applications

Comparing Ordering Strategies for Process Discovery Using Synthesis Rules

Tsung-Hao Huang, Wil M. P. van der Aalst

Summary: Process discovery aims to learn process models from observed behaviors in information systems. The discovered models serve as the starting point for process mining techniques to address performance and compliance problems. The algorithm applying synthesis rules from the free-choice net theory can discover process models with flexible structures while ensuring desirable properties. The proposed ordering strategy improves the quality of resulting process models and reduces computation time compared to frequency-based ordering strategy.

SERVICE-ORIENTED COMPUTING - ICSOC 2022 WORKSHOPS (2023)

Review Computer Science, Information Systems

An Abstraction-Based Approach for Privacy-Aware Federated Process Mining

Majid Rafiei, Wil M. P. van der Aalst

Summary: Process awareness is crucial for business success, and process mining is a powerful tool to discover and analyze actual business processes using event data. However, privacy concerns in inter-organizational settings hinder the sharing of event data and processes between partner organizations. This paper proposes an abstraction-based approach to enable privacy-aware process mining in such settings, addressing the challenges of collaboration and confidentiality.

IEEE ACCESS (2023)

Article Computer Science, Interdisciplinary Applications

Pattern-based action engine: Generating process management actions using temporal patterns of process-centric problems

Gyunam Park, Daniel Schuster, Wil M. P. van der Aalst

Summary: As business environments become more competitive, organizations strive to improve their business processes to reduce costs and increase quality and productivity. In this study, we propose an action engine that automatically generates action plans by analyzing monitoring results to address operational problems in business processes.

COMPUTERS IN INDUSTRY (2023)

No Data Available