The performance datasets and the analysis notebooks for the publication "Learning from models: high-dimensional analyses on the performance of machine learning interatomic potentials"
The performance datasets data/. D124_log.csv corresponds to the data/. Detailed locations of data for these figures can be found in the analysis notebooks.
Note: the N-optimal_random_variables.png in data/ is a figure mentioned in Notebook 04, which corresponds to an analysis not included in the publication.
Each notebook corresponds to an analysis in the paper:
- Notebook 01: Figure 4, section 2.3
- Notebook 02: Figure 5, Supplementary Figure S5 and S6, section 2.4
- Notebook 03: Figure 7, Supplementary Figure S8, S9, S10, and S11, section 2.6
- Notebook 04: Related to Supplementary Figure S7
Several algorithms and methods are from online, including the algorithms to compute Pareto fronts, inverted generational distance, and the Cholesky method. Their websites are in corresponding files and notebooks. References about other packages or algorithms are in the Methods of the paper.
If you use the datasets or the analyses extensively, you may want to cite the following publication:
Y. Liu, Y. Mo, Learning from models: high-dimensional analyses on the performance of machine learning interatomic potentials. (ready to submit)