Master’s Project: More on process mining.

More on Discovery Plug-ins for ProM.

Last week I talked about the discovery plug-ins for the tool ProM, which only take the event log as the input. I showed how to mine the control-flow perspective of a process only by mining the event log, using as example the alpha-algorithm. By doing this, you answer the question “How are the cases actually being executed?”.

These plug-ins are basically process mining algorithms. You give them the event-log as input, and then you ‘mine’ the event log for something. Which means you run the algorithm and it goes through the event log and returns something (for example: petri nets or control-flow perspective), depending on what kind of algorithm it is.

Now there’s other things you can mine with discovery plug-ins. You can mine case-related information about a process. By doing this, you answer the questions “What are the most frequent paths in the process?”, “Are there any loops?”, “Can I simplify the log by abstracting the most frequent paths?”.

You can mine organisational related information about a process. This answers questions like “How many people are involved in a specific case?”, “What is the communication structure and dependencies among people?”, etc.  You mine this with the social-network miner plug-in.

If you just want to know how many people are involved at all the cases in the log, you can simply use the log summary:

I’m still using the same example as the last post, which is the event log of the (one) process of a telephone repair company. You can see the different resources above. We have testers, solvers, and the system. So to answer the question, there’s 12 people involved in all the cases together. You can also use the inspector to inspect specific cases (process instances) at a time and check who was involved.

For the rest of the questions, you can use the social network plug-in. For example, let’s check if there are employees who outperform others. We can find out who is better at fixing defects. We first filter the event log to only show us the relevant tasks on solvers. Then we run the plug-in to mine for a handover-of-work social network. We get this:

The graph shows which employees handed over work to other employees in the cases(process instances). SolverS3 and solverC3 have the best performance. Because the telephones that they fix, always pass the tests, and therefore, are not resent to the repair department, and therefore are not resent to other solvers. That’s why you don’t see arrows going out of these two employees, only arrows coming to them, which means they get work handed over from other solvers. As far as the oval shapes go, taken from the tutorial:

The oval shape of the nodes in the graph visually expresses the relation between the in and out degree of the connections (arrows) between these nodes. A higher proportion of in-going arcs lead to more vertical oval shapes while higher proportions of outgoing arcs produce more horizontal oval shapes. From this remark, can you tell
which employee has more problems to fi x the defects?

To answer the question from the tutorial, It seems that solverS2 and solverS1 have the most difficulty to fix the defects. Because they have the highest outgoing/ingoing degree of connection (both 5/1).

You can also use discovery plug-ins to use temporal logic to verify if the cases in a log  satisfy certain properties. An example to this from the tutorial:

We know that after a try to fix the defect, the telephone should be tested to check if it is indeed repaired.

Thus, we could use the “LTL Checker” plug-in to verify the property: Does the task “Test Repair” always happen after the tasks “Repair (Simple)” and before the task “Archive Repair”?

And so ends the tutorial, which gave us a brief overview on how some discovery plug-ins can be used for mining knowledge related to processes.

Some Literature

Apart from reading the Process Mining Manifesto, which gives you a general introduction to process mining. I’ve also read “Process Mining for the multi-faceted analysis of business processes—A case study in a financial services organization“. As the title says its a case study, so I got to read on the impact of process mining on real organizations.

Aside from the case specific content, the journal/article also talks about broad subjects such as the link between process mining and business intelligence. Process mining can be seen as the link between Business Intelligence and Business Process Management (BPM). Process mining occurs during the diagnosis phase of BPM. Normally, Business Process Analysis (BPA) and Business Activity Monitoring (BAM) are the techniques used on getting statistics on the business process, but process mining provides deeper insight, by going into the exact paths of execution. Process mining discovers, monitors and enhances processes by extracting knowledge from event logs.

Leave a comment