Dataset

discrepencies_and_error_metrics_NPJ_2023_interstitial_enhanced_training_set



Download Dataset XYZ file

Name discrepencies_and_error_metrics_NPJ_2023_interstitial_enhanced_training_set
Extended ID discrepencies_and_error_metrics_NPJ_2023_interstitial_enhanced_training_set_LiuHeMo__DS_nublbp38wse0_0
Description Structures from discrepencies_and_error_metrics_NPJ_2023 training set, enhanced by inclusion of interstitials. The full discrepencies_and_error_metrics_NPJ_2023 dataset includes the original mlearn_Si_train dataset, modified with the purpose of developing models with better diffusivity scores by replacing ~54% of the data with structures containing migrating interstitials. The enhanced validation set contains 50 total structures, consisting of 20 structures randomly selected from the 120 replaced structures of the original training dataset, 11 snapshots with vacancy rare events (RE) from AIMD simulations, and 19 snapshots with interstitial RE from AIMD simulations. We also construct interstitial-RE and vacancy-RE testing sets, each consisting of 100 snapshots of atomic configurations with a single migrating vacancy or interstitial, respectively, from AIMD simulations at 1230 K.
Authors Yunsheng Liu
Xingfeng He
Yifei Mo
DOI 10.60732/f0a44294
https://commons.datacite.org/doi.org/10.60732/f0a44294
https://doi.datacite.org/dois/10.60732%2Ff0a44294
https://doi.org/10.60732/f0a44294

Cite as: Liu, Y., He, X., and Mo, Y. "discrepencies and error metrics NPJ 2023 interstitial enhanced training set." ColabFit, 2023. https://doi.org/10.60732/f0a44294.
For other citation formats, see the DataCite Fabrica page for this dataset.
Elements Si (100.0%)
Number of Data Objects 218
Number of Configurations 218
Number of Atoms 13,629
Links https://github.com/mogroupumd/Silicon_MLIP_datasets
https://doi.org/10.1038/s41524-023-01123-3
Configuration Sets by Name (None)
Configuration Sets by ID (None)
Data Objects
ColabFit ID DS_nublbp38wse0_0
Files colabfitspec.json

No uploaded content is transferred in ownership from the original creators to ColabFit. All content is distributed under the license specified by its contributor who has stated that he or she has the authority to share it under the specified license.