@@ -8,8 +8,9 @@ executed activities in the index.
88For example, for an ongoing case ` A-B-F-T-W-S-G-T-D ` , after building the 5-gram index, the state would be computed
99by searching in the index with the sequence ` [W, S, G, T, D] ` .
1010
11- This approach has been submitted as a publication to VLDB 2025 under the title "Efficient Online Computation of Business
12- Process State From Trace Prefixes via N-Gram Indexing", by David Chapela-Campa and Marlon Dumas.
11+ This approach has been submitted as a publication to IEEE Transactions on Services Computing under the title "Efficient
12+ Online Computation of Business Process State From Trace Prefixes via N-Gram Indexing", by David Chapela-Campa and
13+ Marlon Dumas.
1314
1415## Requirements
1516
@@ -47,7 +48,7 @@ ongoing_state = n_gram_index.get_best_marking_state_for(n_gram)
4748
4849## Evaluation Reproducibility
4950
50- The scripts with a name starting with ` vldb25_ ` under folder ` tests/ ` contain the necessary code to reproduce the
51+ The scripts with a name starting with ` ieeetsc_ ` under folder ` tests/ ` contain the necessary code to reproduce the
5152evaluation presented in the publication.
5253Most of them are only necessary to preprocess the original datasets.
5354This data is already available in this [ Zenodo repository] ( doi.org/10.5281/zenodo.11409897 ) .
5859
5960The evaluation scripts depend on two versions of PM4PY:
6061
61- 1 . To run the script ` vldb25_compute_states .py` where the prefix-alignment technique is used, the requirement is a
62+ 1 . To run the script ` ieeetsc_compute_states .py` where the prefix-alignment technique is used, the requirement is a
6263 package with a PM4PY fork implemented by Daniel Schuster
6364 ([ repo] ( https://github.com/fit-daniel-schuster/online_process_monitoring_using_incremental_state-space_expansion_an_exact_algorithm/ ) ).
6465 Download the project from the corresponding repository and specify its path in the ` pyproject.toml ` file in the
@@ -69,22 +70,22 @@ The evaluation scripts depend on two versions of PM4PY:
6970### Synthetic Evaluation
7071
71721 . Install the project with the PM4PY version specified in point 1 (see above).
72- 2 . Comment the lines in the ` main() ` function in ` vldb25_compute_states .py` that run the state computation for real-life
73- logs, leaving only the calls to function ` compute_current_states() ` for the synthetic datasets.
73+ 2 . Comment the lines in the ` main() ` function in ` ieeetsc_compute_states .py` that run the state computation for
74+ real-life logs, leaving only the calls to function ` compute_current_states() ` for the synthetic datasets.
74753 . Run the script, obtaining the results with the computed states and runtimes (also the reachability graphs) for each
7576 proposal in the folder ` outputs ` .
76774 . Reinstall the project with the PM4PY version specified in point 2 (see above).
77- 5 . Run the script ` vldb25_compute_states_token_replay .py` , adding the token-based replay results to the previous result
78+ 5 . Run the script ` ieeetsc_compute_states_token_replay .py` , adding the token-based replay results to the previous result
7879 files.
79806 . Move these files to the folder ` results ` .
80- 7 . Run the script ` vldb25_exact_state_accuracy .py` , obtaining the accuracy results in the folder ` outputs ` .
81+ 7 . Run the script ` ieeetsc_exact_state_accuracy .py` , obtaining the accuracy results in the folder ` outputs ` .
8182
8283### Real-life Evaluation
8384
84851 . Install the project with the PM4PY version specified in point 1 (see above).
85- 2 . Comment the lines in the ` main() ` function in ` vldb25_compute_states .py` that run the state computation for synthetic
86- logs, leaving only the calls to function ` compute_current_states() ` for the real-life datasets.
86+ 2 . Comment the lines in the ` main() ` function in ` ieeetsc_compute_states .py` that run the state computation for
87+ synthetic logs, leaving only the calls to function ` compute_current_states() ` for the real-life datasets.
87883 . Run the script, obtaining the results with the computed states and runtimes for each proposal in the folder
8889 ` outputs ` .
89904 . Move this files to the folder ` results ` .
90- 5 . Run the script ` vldb25_next_activity_accuracy .py` , obtaining the accuracy results in the folder ` outputs ` .
91+ 5 . Run the script ` ieeetsc_next_activity_accuracy .py` , obtaining the accuracy results in the folder ` outputs ` .
0 commit comments