The ColabFit Exchange: Data for Advanced Materials Science

Welcome to the ColabFit Exchange! This is the world’s largest hub for the discovery, exploration and submission of datasets for the development of machine learning interatomic potentials (MLIPs) for materials science and chemistry. ColabFit datasets are carefully vetted and cleaned, and made available in a variety of standard formats including LMDB, Parquet and xyz. Content on the ColabFit Exchange is open source and freely available.

Datasets
 
Configuration Sets
 
Property Objects
 
Configurations
 
 

Advanced Search



Min
Max

Min
Max
Min
Max


Reset

Results: 419

23-Single-Element-DNPs_RSCDD_2023-Ag
Description : Configurations of Ag from Andolina & Saidi, 2023. One of 23 minimalist, curated sets of DFT-calculated properties for individual elements for the purp...
Authors : Christopher M. Andolina, Wissam A. Saidi
Elements : Ag
Num. Configurations : 3795
Num. Atoms : 104827
Num. Elements : 1
23-Single-Element-DNPs_RSCDD_2023-Al
Description : Configurations of Al from Andolina & Saidi, 2023. One of 23 minimalist, curated sets of DFT-calculated properties for individual elements for the purp...
Authors : Christopher M. Andolina, Wissam A. Saidi
Elements : Al
Num. Configurations : 2572
Num. Atoms : 88139
Num. Elements : 1
23-Single-Element-DNPs_RSCDD_2023-Au
Description : Configurations of Au from Andolina & Saidi, 2023. One of 23 minimalist, curated sets of DFT-calculated properties for individual elements for the purp...
Authors : Christopher M. Andolina, Wissam A. Saidi
Elements : Au
Num. Configurations : 3601
Num. Atoms : 89366
Num. Elements : 1
23-Single-Element-DNPs_RSCDD_2023-Co
Description : Configurations of Co from Andolina & Saidi, 2023. One of 23 minimalist, curated sets of DFT-calculated properties for individual elements for the purp...
Authors : Christopher M. Andolina, Wissam A. Saidi
Elements : Co
Num. Configurations : 3356
Num. Atoms : 67320
Num. Elements : 1
23-Single-Element-DNPs_RSCDD_2023-Cu
Description : Configurations of Cu from Andolina & Saidi, 2023. One of 23 minimalist, curated sets of DFT-calculated properties for individual elements for the purp...
Authors : Christopher M. Andolina, Wissam A. Saidi
Elements : Cu
Num. Configurations : 3366
Num. Atoms : 96568
Num. Elements : 1
23-Single-Element-DNPs_RSCDD_2023-Ge
Description : Configurations of Ge from Andolina & Saidi, 2023. One of 23 minimalist, curated sets of DFT-calculated properties for individual elements for the purp...
Authors : Christopher M. Andolina, Wissam A. Saidi
Elements : Ge
Num. Configurations : 2895
Num. Atoms : 195270
Num. Elements : 1
23-Single-Element-DNPs_RSCDD_2023-I
Description : Configurations of I from Andolina & Saidi, 2023. One of 23 minimalist, curated sets of DFT-calculated properties for individual elements for the purpo...
Authors : Christopher M. Andolina, Wissam A. Saidi
Elements : I
Num. Configurations : 4532
Num. Atoms : 115902
Num. Elements : 1
23-Single-Element-DNPs_RSCDD_2023-Kr
Description : Configurations of Kr from Andolina & Saidi, 2023. One of 23 minimalist, curated sets of DFT-calculated properties for individual elements for the purp...
Authors : Christopher M. Andolina, Wissam A. Saidi
Elements : Kr
Num. Configurations : 2975
Num. Atoms : 97920
Num. Elements : 1
23-Single-Element-DNPs_RSCDD_2023-Li
Description : Configurations of Li from Andolina & Saidi, 2023. One of 23 minimalist, curated sets of DFT-calculated properties for individual elements for the purp...
Authors : Christopher M. Andolina, Wissam A. Saidi
Elements : Li
Num. Configurations : 2536
Num. Atoms : 93724
Num. Elements : 1
23-Single-Element-DNPs_RSCDD_2023-Mg
Description : Configurations of Mg from Andolina & Saidi, 2023. One of 23 minimalist, curated sets of DFT-calculated properties for individual elements for the purp...
Authors : Christopher M. Andolina, Wissam A. Saidi
Elements : Mg
Num. Configurations : 3004
Num. Atoms : 58567
Num. Elements : 1
23-Single-Element-DNPs_RSCDD_2023-Mo
Description : Configurations of Mo from Andolina & Saidi, 2023. One of 23 minimalist, curated sets of DFT-calculated properties for individual elements for the purp...
Authors : Christopher M. Andolina, Wissam A. Saidi
Elements : Mo
Num. Configurations : 3718
Num. Atoms : 66612
Num. Elements : 1
23-Single-Element-DNPs_RSCDD_2023-Nb
Description : Configurations of Nb from Andolina & Saidi, 2023. One of 23 minimalist, curated sets of DFT-calculated properties for individual elements for the purp...
Authors : Christopher M. Andolina, Wissam A. Saidi
Elements : Nb
Num. Configurations : 3246
Num. Atoms : 56191
Num. Elements : 1
23-Single-Element-DNPs_RSCDD_2023-Ni
Description : Configurations of Ni from Andolina & Saidi, 2023. One of 23 minimalist, curated sets of DFT-calculated properties for individual elements for the purp...
Authors : Christopher M. Andolina, Wissam A. Saidi
Elements : Ni
Num. Configurations : 3817
Num. Atoms : 75534
Num. Elements : 1
23-Single-Element-DNPs_RSCDD_2023-Os
Description : Configurations of Os from Andolina & Saidi, 2023. One of 23 minimalist, curated sets of DFT-calculated properties for individual elements for the purp...
Authors : Christopher M. Andolina, Wissam A. Saidi
Elements : Os
Num. Configurations : 4779
Num. Atoms : 117968
Num. Elements : 1
23-Single-Element-DNPs_RSCDD_2023-Pb
Description : Configurations of Pb from Andolina & Saidi, 2023. One of 23 minimalist, curated sets of DFT-calculated properties for individual elements for the purp...
Authors : Christopher M. Andolina, Wissam A. Saidi
Elements : Pb
Num. Configurations : 5350
Num. Atoms : 119252
Num. Elements : 1
23-Single-Element-DNPs_RSCDD_2023-Pd
Description : Configurations of Pd from Andolina & Saidi, 2023. One of 23 minimalist, curated sets of DFT-calculated properties for individual elements for the purp...
Authors : Christopher M. Andolina, Wissam A. Saidi
Elements : Pd
Num. Configurations : 3478
Num. Atoms : 140196
Num. Elements : 1
23-Single-Element-DNPs_RSCDD_2023-Pt
Description : Configurations of Pt from Andolina & Saidi, 2023. One of 23 minimalist, curated sets of DFT-calculated properties for individual elements for the purp...
Authors : Christopher M. Andolina, Wissam A. Saidi
Elements : Pt
Num. Configurations : 2609
Num. Atoms : 62152
Num. Elements : 1
23-Single-Element-DNPs_RSCDD_2023-Re
Description : Configurations of Re from Andolina & Saidi, 2023. One of 23 minimalist, curated sets of DFT-calculated properties for individual elements for the purp...
Authors : Christopher M. Andolina, Wissam A. Saidi
Elements : Re
Num. Configurations : 5029
Num. Atoms : 101248
Num. Elements : 1
23-Single-Element-DNPs_RSCDD_2023-Sb
Description : Configurations of Sb from Andolina & Saidi, 2023. One of 23 minimalist, curated sets of DFT-calculated properties for individual elements for the purp...
Authors : Christopher M. Andolina, Wissam A. Saidi
Elements : Sb
Num. Configurations : 5529
Num. Atoms : 122289
Num. Elements : 1
23-Single-Element-DNPs_RSCDD_2023-Sr
Description : Configurations of Sr from Andolina & Saidi, 2023. One of 23 minimalist, curated sets of DFT-calculated properties for individual elements for the purp...
Authors : Christopher M. Andolina, Wissam A. Saidi
Elements : Sr
Num. Configurations : 3155
Num. Atoms : 49426
Num. Elements : 1
23-Single-Element-DNPs_RSCDD_2023-Ti
Description : Configurations of Ti from Andolina & Saidi, 2023. One of 23 minimalist, curated sets of DFT-calculated properties for individual elements for the purp...
Authors : Christopher M. Andolina, Wissam A. Saidi
Elements : Ti
Num. Configurations : 5665
Num. Atoms : 153659
Num. Elements : 1
23-Single-Element-DNPs_RSCDD_2023-Zn
Description : Configurations of Zn from Andolina & Saidi, 2023. One of 23 minimalist, curated sets of DFT-calculated properties for individual elements for the purp...
Authors : Christopher M. Andolina, Wissam A. Saidi
Elements : Zn
Num. Configurations : 4052
Num. Atoms : 107039
Num. Elements : 1
23-Single-Element-DNPs_RSCDD_2023-Zr
Description : Configurations of Zr from Andolina & Saidi, 2023. One of 23 minimalist, curated sets of DFT-calculated properties for individual elements for the purp...
Authors : Christopher M. Andolina, Wissam A. Saidi
Elements : Zr
Num. Configurations : 4730
Num. Atoms : 81165
Num. Elements : 1
23-Single-Element-DNPs_all_trajectories
Description : The full trajectories from the VASP runs used to generate the 23-Single-Element-DNPs training sets. Configuration sets are available for each element.
Authors : Christopher M. Andolina, Wissam A. Saidi
Elements : Ag, Al, Au, Co, Cu, Ge, I, Kr, Li, Mg, Mo, Nb, Ni, Os, P...
Num. Configurations : 108644
Num. Atoms : 2352424
Num. Elements : 23
3BPA_isolated_atoms
Description : Reference C, H, O, and N atoms from 3BPA, used to showcase the performance of linear atomic cluster expansion (ACE) force fields in a machine learning...
Authors : Dávid Péter Kovács, Cas van der Oord, Jiri Kucera,...
Elements : C, H, N, O
Num. Configurations : 4
Num. Atoms : 4
Num. Elements : 4
3BPA_test_1200K
Description : Test configurations with MD simulations performed at 1200K from 3BPA, used to showcase the performance of linear atomic cluster expansion (ACE) force ...
Authors : Dávid Péter Kovács, Cas van der Oord, Jiri Kucera,...
Elements : C, H, N, O
Num. Configurations : 2139
Num. Atoms : 57753
Num. Elements : 4
3BPA_test_300K
Description : Test configurations with MD simulations performed at 300K from 3BPA, used to showcase the performance of linear atomic cluster expansion (ACE) force f...
Authors : Dávid Péter Kovács, Cas van der Oord, Jiri Kucera,...
Elements : C, H, N, O
Num. Configurations : 1669
Num. Atoms : 45063
Num. Elements : 4
3BPA_test_600K
Description : Test configurations with MD simulations performed at 600K from 3BPA, used to showcase the performance of linear atomic cluster expansion (ACE) force f...
Authors : Dávid Péter Kovács, Cas van der Oord, Jiri Kucera,...
Elements : C, H, N, O
Num. Configurations : 2138
Num. Atoms : 57726
Num. Elements : 4
3BPA_test_dih_beta120
Description : Test configurations with fixed value for dihedral beta in alpha-gamma plane of 120 degreesfrom 3BPA dataset. Used to showcase the performance of line...
Authors : Dávid Péter Kovács, Cas van der Oord, Jiri Kucera,...
Elements : C, H, N, O
Num. Configurations : 2347
Num. Atoms : 63369
Num. Elements : 4
3BPA_test_dih_beta150
Description : Test configurations with fixed value for dihedral beta in alpha-gamma plane of 150 degreesfrom 3BPA dataset. Used to showcase the performance of line...
Authors : Dávid Péter Kovács, Cas van der Oord, Jiri Kucera,...
Elements : C, H, N, O
Num. Configurations : 2350
Num. Atoms : 63450
Num. Elements : 4
3BPA_test_dih_beta180
Description : Test configurations with fixed value for dihedral beta in alpha-gamma plane of 180 degreesfrom 3BPA dataset. Used to showcase the performance of line...
Authors : Dávid Péter Kovács, Cas van der Oord, Jiri Kucera,...
Elements : C, H, N, O
Num. Configurations : 2350
Num. Atoms : 63450
Num. Elements : 4
3BPA_train_300K
Description : Training configurations with MD simulations performed at 300K from 3BPA, used to showcase the performance of linear atomic cluster expansion (ACE) for...
Authors : Dávid Péter Kovács, Cas van der Oord, Jiri Kucera,...
Elements : C, H, N, O
Num. Configurations : 500
Num. Atoms : 13500
Num. Elements : 4
3BPA_train_mixed
Description : Training configurations with MD simulation performed at 300K, 600K and 1200K from 3BPA dataset, used to showcase the performance of linear atomic clus...
Authors : Dávid Péter Kovács, Cas van der Oord, Jiri Kucera,...
Elements : C, H, N, O
Num. Configurations : 500
Num. Atoms : 13500
Num. Elements : 4
ABC2D6-16_PRL_2018
Description : Dataset used to train a machine learning model to calculate density functional theory-quality formation energies of all ~2 x 106 pristine ABC2D6 elpas...
Authors : Felix Faber, Alexander Lindmaa, O. Anatole von Lil...
Elements : Al, Ar, As, B, Ba, Be, Bi, Br, C, Ca, Cl, Cs, F, Ga, Ge,...
Num. Configurations : 21882
Num. Atoms : 218820
Num. Elements : 39
AENET_amorphous_LiSi_JCP2021
Description : The amorphous LiSi data set comprises 45,169 atomic structures with compositions Li(x)Si (0.0≤x≤4.75) and the corresponding energies and interatomic f...
Authors : Michael S. Chen, Tobias Morawietz, Thomas E. Markl...
Elements : Li, Si
Num. Configurations : 44652
Num. Atoms : 5741142
Num. Elements : 2
AENET_liquid_water_dataset_JCP2021
Description : The water data set comprises energies and forces of 9,189 condensed-phase structures. The data was obtained in an iterative procedure described in det...
Authors : Michael S. Chen, Tobias Morawietz, Thomas E. Markl...
Elements : H, O
Num. Configurations : 9189
Num. Atoms : 1788288
Num. Elements : 2
AFF_JCP_2022
Description : Approximately 145,000 configurations of alkane, aspirin, alpha-glucose and uracil, partly taken from the MD-17 dataset, used in training an 'Atomic Ne...
Authors : Hao Li, Musen Zhou, Jessalyn Sebastian, Jianzhong ...
Elements : C, H, N, O
Num. Configurations : 143770
Num. Atoms : 1911240
Num. Elements : 4
ANI-1
Description : ANI-1 is a dataset of 20 million conformations with calculated non-equilibrium energy values. The conformations are based on a subset of the GDB-11 da...
Authors : Justin S. Smith, Olexandr Isayev, Adrian E. Roitbe...
Elements : C, H, N, O
Num. Configurations : 24416306
Num. Atoms : 392606016
Num. Elements : 4
ANI-1x
Description : ANI-1x contains DFT calculations for approximately 5 million molecular conformations. From an initial training set, an active learning method was used...
Authors : Justin S. Smith, Roman Zubatyuk, Benjamin Nebgen, ...
Elements : C, H, N, O
Num. Configurations : 4956005
Num. Atoms : 75700481
Num. Elements : 4
ANI-2x-B973c-def2mTZVP
Description : ANI-2x-B973c-def2mTZVP is a portion of the ANI-2x dataset, which includes DFT-calculated energies for structures from 2 to 63 atoms in size containing...
Authors : Kate Huddleston, Roman Zubatyuk, Justin Smith, Adr...
Elements : C, Cl, F, H, N, O, S
Num. Configurations : 9643594
Num. Atoms : 146656635
Num. Elements : 7
ANI-2x-wB97MD3BJ-def2TZVPP
Description : ANI-2x-wB97MD3BJ-def2TZVPP is a portion of the ANI-2x dataset, which includes DFT-calculated energies for structures from 2 to 63 atoms in size contai...
Authors : Kate Huddleston, Roman Zubatyuk, Justin Smith, Adr...
Elements : C, Cl, F, H, N, O, S
Num. Configurations : 9650572
Num. Atoms : 146715621
Num. Elements : 7
ANI-2x-wB97MV-def2TZVPP
Description : ANI-2x-wB97MV-def2TZVPP is a portion of the ANI-2x dataset, which includes DFT-calculated energies for structures from 2 to 63 atoms in size containin...
Authors : Kate Huddleston, Roman Zubatyuk, Justin Smith, Adr...
Elements : C, Cl, F, H, N, O, S
Num. Configurations : 9650572
Num. Atoms : 146715621
Num. Elements : 7
ANI-2x-wB97X-631Gd
Description : ANI-2x-wB97X-631Gd is a portion of the ANI-2x dataset, which includes DFT-calculated energies for structures from 2 to 63 atoms in size containing H, ...
Authors : Kate Huddleston, Roman Zubatyuk, Justin Smith, Adr...
Elements : C, Cl, F, H, N, O, S
Num. Configurations : 9651712
Num. Atoms : 146736809
Num. Elements : 7
ANI-Al_NC2021-test
Description : Approximately 2800 configurations from a test dataset–one of a pair of train/test datasets of aluminum in crystal and melt phases, used for training a...
Authors : Justin S. Smith, Benjamin Nebgen, Nithin Mathew, J...
Elements : Al
Num. Configurations : 2872
Num. Atoms : 371645
Num. Elements : 1
ANI-Al_NC2021-train
Description : Approximately 2800 configurations from a train dataset–one of a pair of train/test datasets of aluminum in crystal and melt phases, used for training ...
Authors : Justin S. Smith, Benjamin Nebgen, Nithin Mathew, J...
Elements : Al
Num. Configurations : 2864
Num. Atoms : 375121
Num. Elements : 1
Ag-PBE_MSMSE_2021
Description : Approximately 7,600 configurations of Ag used as part of a training dataset for a DP-GEN-based ML model for a Ag-Au nanoalloy potential.
Authors : Yinan Wang, Xiaoyang Wang, Linfeng Zhang, Ben Xu, ...
Elements : Ag
Num. Configurations : 7608
Num. Atoms : 152318
Num. Elements : 1
AgAu-nanoalloy_MSMSE_2021
Description : Approximately 50,000 configurations of Au, Ag and AuAg used as part of a training dataset for a DP-GEN-based ML model for a Ag-Au nanoalloy potential.
Authors : Yinan Wang, Xiaoyang Wang, Linfeng Zhang, Ben Xu, ...
Elements : Ag, Au
Num. Configurations : 51771
Num. Atoms : 1188220
Num. Elements : 2
AgPd_NPJ_2021
Description : The dataset consists of energies, forces and virials for DFT-VASP-generated Ag-Pd systems. The data was used to fit an active learned dataset which wa...
Authors : Conrad W. Rosenbrock, Konstantin Gubaev, Alexander...
Elements : Ag, Pd
Num. Configurations : 1691
Num. Atoms : 14180
Num. Elements : 2
AlNiCu_AIP_2020
Description : This dataset is formed from two parts: single-species datasets for Al, Ni, and Cu from the NOMAD Encyclopedia and multi-species datasets that include ...
Authors : Berk Onat, Christoph Ortner, James R. Kermode
Elements : Al, Cu, Ni
Num. Configurations : 1017
Num. Atoms : 4650
Num. Elements : 3
AlNiTi_CMS_2019
Description : This dataset was generated using the following active learning scheme: 1) candidate structures were relaxed by a partially-trained MTP model, 2) struc...
Authors : Konstantin Gubaev, Evgeny V. Podryabinkin, Gus L.W...
Elements : Al, Ni, Ti
Num. Configurations : 2684
Num. Atoms : 25067
Num. Elements : 3
Al_Cu_Mg_GSFE_JMPS2019
Description : Dataset from "Stress-dependence of generalized stacking fault energies":DFT calculations of generalized stacking fault energies (GSFE) for Al, Cu, and...
Authors : Binglun Yin, Predrag Andric, W. A. Curtin
Elements : Al, Cu, Mg
Num. Configurations : 273
Num. Atoms : 3264
Num. Elements : 3
Alexandria_geometry_optimization_paths_PBE_1D
Description : The Alexandria Materials Database contains theoretical crystal structures in 1D, 2D and 3D discovered by machine learning approaches using DFT with PB...
Authors : Jonathan Schmidt, Noah Hoffmann, Hai-Chen Wang, Pe...
Elements : Ag, Al, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce, Cl, Co...
Num. Configurations : 614833
Num. Atoms : 6062475
Num. Elements : 74
Alexandria_geometry_optimization_paths_PBE_2D
Description : The Alexandria Materials Database contains theoretical crystal structures in 1D, 2D and 3D discovered by machine learning approaches using DFT with PB...
Authors : Jonathan Schmidt, Noah Hoffmann, Hai-Chen Wang, Pe...
Elements : Ac, Ag, Al, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce, Cl...
Num. Configurations : 11742482
Num. Atoms : 118265549
Num. Elements : 84
Alexandria_geometry_optimization_paths_PBE_3D
Description : The Alexandria Materials Database contains theoretical crystal structures in 1D, 2D and 3D discovered by machine learning approaches using DFT with PB...
Authors : Jonathan Schmidt, Noah Hoffmann, Hai-Chen Wang, Pe...
Elements : Ac, Ag, Al, Ar, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce...
Num. Configurations : 106825218
Num. Atoms : 1313552132
Num. Elements : 89
Au-PBE_MSMSE_2021
Description : Approximately 20,000 configurations of Au used as part of a training dataset for a DP-GEN-based ML model for a Ag-Au nanoalloy potential.
Authors : Yinan Wang, Xiaoyang Wang, Linfeng Zhang, Ben Xu, ...
Elements : Au
Num. Configurations : 19434
Num. Atoms : 310792
Num. Elements : 1
BA10-18
Description : Dataset (DFT-10B) contains structures of the 10 binary alloys AgCu, AlFe, AlMg, AlNi, AlTi, CoNi, CuFe, CuNi, FeV, and NbNi. Each alloy system include...
Authors : Chandramouli Nyshadham, Matthias Rupp, Brayden Bek...
Elements : Ag, Al, Co, Cu, Fe, Mg, Nb, Ni, Ti, V
Num. Configurations : 15920
Num. Atoms : 116380
Num. Elements : 10
BOTnet_ACAC_2022_Dihedral_scan
Description : Dihedral scan about one of the C-C bonds of the conjugated system. Acetylacetone dataset generated from a long molecular dynamics simulation at 300 K ...
Authors : Ilyes Batatia, Simon Batzner, Dávid Péter Kovács, ...
Elements : C, H, O
Num. Configurations : 45
Num. Atoms : 675
Num. Elements : 3
BOTnet_ACAC_2022_H_transfer
Description : NEB path of proton transfer reaction between the two forms of acetylacetone. Acetylacetone dataset generated from a long molecular dynamics simulation...
Authors : Ilyes Batatia, Simon Batzner, Dávid Péter Kovács, ...
Elements : C, H, O
Num. Configurations : 15
Num. Atoms : 225
Num. Elements : 3
BOTnet_ACAC_2022_isolated
Description : Energies of the isolated atoms evalauted at the reference DFT settings. Acetylacetone dataset generated from a long molecular dynamics simulation at 3...
Authors : Ilyes Batatia, Simon Batzner, Dávid Péter Kovács, ...
Elements : C, H, O
Num. Configurations : 3
Num. Atoms : 3
Num. Elements : 3
BOTnet_ACAC_2022_test_300K_MD
Description : Test set of decorrelated geometries sampled from 300 K xTB MD. Acetylacetone dataset generated from a long molecular dynamics simulation at 300 K usin...
Authors : Ilyes Batatia, Simon Batzner, Dávid Péter Kovács, ...
Elements : C, H, O
Num. Configurations : 650
Num. Atoms : 9750
Num. Elements : 3
BOTnet_ACAC_2022_test_600K_MD
Description : Test set of decorrelated geometries sampled from 600 K xTB MD. Acetylacetone dataset generated from a long molecular dynamics simulation at 300 K usin...
Authors : Ilyes Batatia, Simon Batzner, Dávid Péter Kovács, ...
Elements : C, H, O
Num. Configurations : 650
Num. Atoms : 9750
Num. Elements : 3
BOTnet_ACAC_2022_train_300K_MD
Description : 500 decorrelated geometries sampled from 300 K xTB MD run. Acetylacetone dataset generated from a long molecular dynamics simulation at 300 K using a ...
Authors : Ilyes Batatia, Simon Batzner, Dávid Péter Kovács, ...
Elements : C, H, O
Num. Configurations : 500
Num. Atoms : 7500
Num. Elements : 3
BOTnet_ACAC_2022_train_600K_MD
Description : 500 decorrelated geometries sampled from 600 K xTB MD run. Acetylacetone dataset generated from a long molecular dynamics simulation at 300 K using a ...
Authors : Ilyes Batatia, Simon Batzner, Dávid Péter Kovács, ...
Elements : C, H, O
Num. Configurations : 500
Num. Atoms : 7500
Num. Elements : 3
C7H10O2
Description : 6095 isomers of C7O2H10. Energetics were calculated at the G4MP2 level of theory.
Authors : Raghunathan Ramakrishnan, Pavlo Dral, Matthias Rup...
Elements : C, H, O
Num. Configurations : 6095
Num. Atoms : 115805
Num. Elements : 3
CA-9_BB_training
Description : Binning-binning configurations from CA-9 dataset used for training NNP_BB potential. CA-9 consists of configurations of carbon with curated subsets ch...
Authors : Daniel Hedman, Tom Rothe, Gustav Johansson, Fredri...
Elements : C
Num. Configurations : 20012
Num. Atoms : 1054055
Num. Elements : 1
CA-9_BB_validation
Description : Binning-binning configurations from CA-9 dataset used during validation step for NNP_BB potential. CA-9 consists of configurations of carbon with cura...
Authors : Daniel Hedman, Tom Rothe, Gustav Johansson, Fredri...
Elements : C
Num. Configurations : 4003
Num. Atoms : 233034
Num. Elements : 1
CA-9_BR_training
Description : Binning-random configurations from CA-9 dataset used for training NNP_BR potential. CA-9 consists of configurations of carbon with curated subsets cho...
Authors : Daniel Hedman, Tom Rothe, Gustav Johansson, Fredri...
Elements : C
Num. Configurations : 20013
Num. Atoms : 1072779
Num. Elements : 1
CA-9_BR_validation
Description : Binning-random configurations from CA-9 dataset used during validation step for NNP_BR potential. CA-9 consists of configurations of carbon with curat...
Authors : Daniel Hedman, Tom Rothe, Gustav Johansson, Fredri...
Elements : C
Num. Configurations : 4002
Num. Atoms : 214310
Num. Elements : 1
CA-9_RR_training
Description : Random-random configurations from CA-9 dataset used for training NNP_RR potential. CA-9 consists of configurations of carbon with curated subsets chos...
Authors : Daniel Hedman, Tom Rothe, Gustav Johansson, Fredri...
Elements : C
Num. Configurations : 20013
Num. Atoms : 1100042
Num. Elements : 1
CA-9_RR_validation
Description : Random-random configurations from CA-9 dataset used during validation step for NNP_RR potential. CA-9 consists of configurations of carbon with curate...
Authors : Daniel Hedman, Tom Rothe, Gustav Johansson, Fredri...
Elements : C
Num. Configurations : 4002
Num. Atoms : 218184
Num. Elements : 1
CA-9_test
Description : Test configurations from CA-9 dataset used to evaluate trained NNPs.CA-9 consists of configurations of carbon with curated subsets chosen to test the ...
Authors : Daniel Hedman, Tom Rothe, Gustav Johansson, Fredri...
Elements : C
Num. Configurations : 2727
Num. Atoms : 206302
Num. Elements : 1
CA-9_training
Description : Configurations from CA-9 dataset used for training NNP_CA-9 potential. CA-9 consists of configurations of carbon with curated subsets chosen to test t...
Authors : Daniel Hedman, Tom Rothe, Gustav Johansson, Fredri...
Elements : C
Num. Configurations : 40000
Num. Atoms : 2195399
Num. Elements : 1
CA-9_validation
Description : Configurations from CA-9 dataset used during validation step for NNP_CA-9 potential. CA-9 consists of configurations of carbon with curated subsets ch...
Authors : Daniel Hedman, Tom Rothe, Gustav Johansson, Fredri...
Elements : C
Num. Configurations : 8000
Num. Atoms : 436601
Num. Elements : 1
CGM-MLP_natcomm2023_Cr-C_deposition
Description : Training simulations from CGM-MLP_natcomm2023 of carbon deposition on a Cr surface. This dataset was one of the datasets used in training during the p...
Authors : Di Zhang, Peiyun Yi, Xinmin Lai, Linfa Peng, Hao L...
Elements : C, Cr
Num. Configurations : 1192
Num. Atoms : 298114
Num. Elements : 2
CGM-MLP_natcomm2023_Cu-C-O
Description : Training simulations from CGM-MLP_natcomm2023 of carbon on an oxygen-contaminated Cu surface. This dataset was one of the datasets used in training du...
Authors : Di Zhang, Peiyun Yi, Xinmin Lai, Linfa Peng, Hao L...
Elements : C, Cu, O
Num. Configurations : 1717
Num. Atoms : 387151
Num. Elements : 3
CGM-MLP_natcomm2023_Cu-C-O_deposition
Description : Training simulations from CGM-MLP_natcomm2023 of carbon deposition on a Cu surface. This appears similar to CGM-MLP_natcomm2023_CU-C_deposition, as th...
Authors : Di Zhang, Peiyun Yi, Xinmin Lai, Linfa Peng, Hao L...
Elements : C, Cu
Num. Configurations : 1694
Num. Atoms : 326328
Num. Elements : 2
CGM-MLP_natcomm2023_Cu-C_deposition
Description : Training simulations from CGM-MLP_natcomm2023 of carbon deposition on a Cu surface. This dataset was one of the datasets used in training during the p...
Authors : Di Zhang, Peiyun Yi, Xinmin Lai, Linfa Peng, Hao L...
Elements : C, Cu
Num. Configurations : 1177
Num. Atoms : 204591
Num. Elements : 2
CGM-MLP_natcomm2023_Cu-C_metal_surface
Description : Training simulations from CGM-MLP_natcomm2023 of carbon on a Cu metal surface. This dataset was one of the datasets used in training during the proces...
Authors : Di Zhang, Peiyun Yi, Xinmin Lai, Linfa Peng, Hao L...
Elements : C, Cu
Num. Configurations : 520
Num. Atoms : 122294
Num. Elements : 2
CGM-MLP_natcomm2023_GAP_20
Description : Carbon_GAP_20 dataset from CGM-MLP_natcomm2023. This dataset was one of the datasets used in training during the process of producing an active learni...
Authors : Di Zhang, Peiyun Yi, Xinmin Lai, Linfa Peng, Hao L...
Elements : C, Cu
Num. Configurations : 6178
Num. Atoms : 400485
Num. Elements : 2
CGM-MLP_natcomm2023_Ti-C_deposition
Description : Training simulations from CGM-MLP_natcomm2023 of carbon deposition on a Ti surface. This dataset was one of the datasets used in training during the p...
Authors : Di Zhang, Peiyun Yi, Xinmin Lai, Linfa Peng, Hao L...
Elements : C, Ti
Num. Configurations : 1309
Num. Atoms : 259636
Num. Elements : 2
CGM-MLP_natcomm2023_screening_amorphous_carbon_test
Description : 493 structures available from the GAP-20 database, excluding any structures present in the training set. This dataset was one of the datasets used in ...
Authors : Di Zhang, Peiyun Yi, Xinmin Lai, Linfa Peng, Hao L...
Elements : C
Num. Configurations : 494
Num. Atoms : 32279
Num. Elements : 1
CGM-MLP_natcomm2023_screening_amorphous_carbon_train
Description : 2558 structures selected from the GAP-20 database. This dataset was one of the datasets used in testing screening parameters during the process of pro...
Authors : Di Zhang, Peiyun Yi, Xinmin Lai, Linfa Peng, Hao L...
Elements : C
Num. Configurations : 2559
Num. Atoms : 168191
Num. Elements : 1
CGM-MLP_natcomm2023_screening_carbon-cluster@Cu_test
Description : 192 structures were uniformly selected from the AIMD simulation, excluding any structures that are part of the training set. This dataset was one of t...
Authors : Di Zhang, Peiyun Yi, Xinmin Lai, Linfa Peng, Hao L...
Elements : C, Cu
Num. Configurations : 193
Num. Atoms : 38004
Num. Elements : 2
CGM-MLP_natcomm2023_screening_carbon-cluster@Cu_train
Description : 588 structures selected from the AIMD simulation of the Cu(111) slab, including both the C1-C18 clusters on the Cu(111) slab. This dataset was one of ...
Authors : Di Zhang, Peiyun Yi, Xinmin Lai, Linfa Peng, Hao L...
Elements : C, Cu
Num. Configurations : 588
Num. Atoms : 115460
Num. Elements : 2
CGM-MLP_natcomm2023_screening_deposited-carbon@Cu_test
Description : 468 structures uniformly selected from the MD/tfMC simulation, excluding any structures that are part of the training set. This dataset was one of the...
Authors : Di Zhang, Peiyun Yi, Xinmin Lai, Linfa Peng, Hao L...
Elements : C, Cu
Num. Configurations : 469
Num. Atoms : 156312
Num. Elements : 2
CGM-MLP_natcomm2023_screening_deposited-carbon@Cu_train
Description : 1090 structures uniformly selected from the MD/tfMC simulation during the training process of CGM-MLPs. This dataset was one of the datasets used in t...
Authors : Di Zhang, Peiyun Yi, Xinmin Lai, Linfa Peng, Hao L...
Elements : C, Cu
Num. Configurations : 1091
Num. Atoms : 362898
Num. Elements : 2
CGM-MLP_natcomm2023_screening_graphite_train
Description : 40 graphite structures with different lattice constants ranging from 2.0 to 3.2 Å, with a 0.03 Å increment. This dataset was one of the datasets used ...
Authors : Di Zhang, Peiyun Yi, Xinmin Lai, Linfa Peng, Hao L...
Elements : C
Num. Configurations : 41
Num. Atoms : 1968
Num. Elements : 1
CHON_JCP_2020
Description : This dataset of molecular structures was extracted, using the NOMAD API, from all available structures in the NOMAD Archive that only include C, H, O,...
Authors : Berk Onat, Christoph Ortner, James R. Kermode
Elements : C, H, N, O
Num. Configurations : 5216
Num. Atoms : 96736
Num. Elements : 4
COHInPt_schaaf_2023
Description : Training and simulation data from machine learning force field model applied to steps of the hydrogenation of carbon dioxide to methanol over an indiu...
Authors : Lars Schaaf, Edvin Fako, Sandip De, Ansgar Schafer...
Elements : C, H, In, O, Pt
Num. Configurations : 1994
Num. Atoms : 163746
Num. Elements : 5
COLL_test
Description : Test set from COLL. Consists of configurations taken from molecular collisions of different small organic molecules. Energies and forces for 140,000 r...
Authors : Johannes Gasteiger, Shankari Giri, Johannes T. Mar...
Elements : C, H, O
Num. Configurations : 9480
Num. Atoms : 97886
Num. Elements : 3
COLL_train
Description : Training set from COLL. Consists of configurations taken from molecular collisions of different small organic molecules. Energies and forces for 140,0...
Authors : Johannes Gasteiger, Shankari Giri, Johannes T. Mar...
Elements : C, H, O
Num. Configurations : 120000
Num. Atoms : 1225350
Num. Elements : 3
COLL_validation
Description : Validation set from COLL. Consists of configurations taken from molecular collisions of different small organic molecules. Energies and forces for 140...
Authors : Johannes Gasteiger, Shankari Giri, Johannes T. Mar...
Elements : C, H, O
Num. Configurations : 10000
Num. Atoms : 101847
Num. Elements : 3
COMP6v2-B973c-def2mTZVP
Description : COMP6v2-B973c-def2mTZVP is the portion of COMP6v2 calculated at the B973c/def2mTZVP level of theory. COmprehensive Machine-learning Potential (COMP6) ...
Authors : Kate Huddleston, Roman Zubatyuk, Justin Smith, Adr...
Elements : C, Cl, F, H, N, O, S
Num. Configurations : 156330
Num. Atoms : 3786071
Num. Elements : 7
COMP6v2-wB97MD3BJ-def2TZVPP
Description : COMP6v2-wB97MD3BJ-def2TZVPP is the portion of COMP6v2 calculated at the wB97MD3BJ/def2TZVPP level of theory. COmprehensive Machine-learning Potential ...
Authors : Kate Huddleston, Roman Zubatyuk, Justin Smith, Adr...
Elements : C, Cl, F, H, N, O, S
Num. Configurations : 156353
Num. Atoms : 3787055
Num. Elements : 7
COMP6v2-wB97MV-def2TZVPP
Description : COMP6v2-wB97MV-def2TZVPP is the portion of COMP6v2 calculated at the wB97MV/def2TZVPP level of theory. COmprehensive Machine-learning Potential (COMP6...
Authors : Kate Huddleston, Roman Zubatyuk, Justin Smith, Adr...
Elements : C, Cl, F, H, N, O, S
Num. Configurations : 156369
Num. Atoms : 3787406
Num. Elements : 7
COMP6v2-wB97X-631Gd
Description : COMP6v2-wB97X-631Gd is the portion of COMP6v2 calculated at the wB97X/631Gd level of theory. COmprehensive Machine-learning Potential (COMP6) Benchmar...
Authors : Kate Huddleston, Roman Zubatyuk, Justin Smith, Adr...
Elements : C, Cl, F, H, N, O, S
Num. Configurations : 157728
Num. Atoms : 3897978
Num. Elements : 7
C_Gardner_2022
Description : Approximately 115,000 configurations of carbon with 200 atoms, with simulated melt, quench, reheat, then annealing at the noted temperature. Includes ...
Authors : John L. A. Gardner, Zoé Faure Beaulieu, Volker L. ...
Elements : C
Num. Configurations : 115206
Num. Atoms : 23041200
Num. Elements : 1
C_NPJ2020
Description : The dataset consists of energies and forces for monolayer graphene, bilayer graphene, graphite, and diamond in various states, including strained stat...
Authors : Mingjian Wen, Ellad B. Tadmor
Elements : C
Num. Configurations : 4776
Num. Atoms : 228852
Num. Elements : 1
Carbon_GAP_JCP_2020
Description : GAP-20 describes the properties of the bulk crystalline and amorphous phases, crystal surfaces, and defect structures with an accuracy approaching tha...
Authors : Patrick Rowe, Volker L. Deringer, Piero Gasparotto...
Elements : C
Num. Configurations : 17227
Num. Atoms : 1312457
Num. Elements : 1
Carbon_GAP_JCP_2020_train
Description : Training data generated for GAP-20. GAP-20 describes the properties of the bulk crystalline and amorphous phases, crystal surfaces, and defect structu...
Authors : Patrick Rowe, Volker L. Deringer, Piero Gasparotto...
Elements : C
Num. Configurations : 6088
Num. Atoms : 400275
Num. Elements : 1
Carbon_allotrope_multilayer_graphene_graphite_PRB2019
Description : The dataset consists of energies and forces for pristine and defected monolayer graphene, bilayer graphene, and graphite in various states. The confi...
Authors : Mingjian Wen, Ellad B. Tadmor
Elements : C
Num. Configurations : 14179
Num. Atoms : 656204
Num. Elements : 1
Carolina_Materials
Description : Carolina Materials contains structures used to train several machine learning models for the efficient generation of hypothetical inorganic materials....
Authors : Yong Zhao, Mohammed Al-Fahdi, Ming Hu, Edirisuriya...
Elements : Ag, Al, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Cl, Co, Cr...
Num. Configurations : 214280
Num. Atoms : 3168502
Num. Elements : 64
Cationic_phenoxyimine_complexes_of_yttrium
Description : This dataset contains DFT calculations that were carried out in conjunction with experimental investigation of a cationic phenoxyimine yttrium complex...
Authors : Alexis D. Oswald, Ludmilla Verrieux, Pierre-Alain ...
Elements : Al, B, C, F, H, N, O, Si, Y
Num. Configurations : 109
Num. Atoms : 9074
Num. Elements : 9
Chig-AIMD_random_test
Description : Test configurations from the 'random' split of Chig-AIMD. This dataset covers the conformational space of chignolin with DFT-level precision. We seque...
Authors : Tong Wang, Xinheng He, Mingyu Li, Bin Shao, Tie-Ya...
Elements : C, H, N, O
Num. Configurations : 199000
Num. Atoms : 33034000
Num. Elements : 4
Chig-AIMD_random_train
Description : Training configurations from the 'random' split of Chig-AIMD. This dataset covers the conformational space of chignolin with DFT-level precision. We s...
Authors : Tong Wang, Xinheng He, Mingyu Li, Bin Shao, Tie-Ya...
Elements : C, H, N, O
Num. Configurations : 1592800
Num. Atoms : 264404800
Num. Elements : 4
Chig-AIMD_random_val
Description : Validation configurations from the 'random' split of Chig-AIMD. This dataset covers the conformational space of chignolin with DFT-level precision. We...
Authors : Tong Wang, Xinheng He, Mingyu Li, Bin Shao, Tie-Ya...
Elements : C, H, N, O
Num. Configurations : 199000
Num. Atoms : 33034000
Num. Elements : 4
Chig-AIMD_scaffold_test
Description : Test configurations from the 'scaffold' split of Chig-AIMD. This dataset covers the conformational space of chignolin with DFT-level precision. We seq...
Authors : Tong Wang, Xinheng He, Mingyu Li, Bin Shao, Tie-Ya...
Elements : C, H, N, O
Num. Configurations : 199000
Num. Atoms : 33034000
Num. Elements : 4
Chig-AIMD_scaffold_train
Description : Training configurations from the 'scaffold' split of Chig-AIMD. This dataset covers the conformational space of chignolin with DFT-level precision. We...
Authors : Tong Wang, Xinheng He, Mingyu Li, Bin Shao, Tie-Ya...
Elements : C, H, N, O
Num. Configurations : 1592800
Num. Atoms : 264404800
Num. Elements : 4
Chig-AIMD_scaffold_val
Description : Validation configurations from the 'scaffold' split of Chig-AIMD. This dataset covers the conformational space of chignolin with DFT-level precision. ...
Authors : Tong Wang, Xinheng He, Mingyu Li, Bin Shao, Tie-Ya...
Elements : C, H, N, O
Num. Configurations : 199000
Num. Atoms : 33034000
Num. Elements : 4
Co-Co_coupling_at_liquid_water-Cu(100)_interfaces_JC2021
Description : This dataset contains data from eight AIMD simulations run in VASP to study electrochemical *CO-*CO coupling -- coupling of two *CO molecules -- at th...
Authors : Henrik H. Kristoffersen, Karen Chan
Elements : C, Cs, Cu, H, Li, O
Num. Configurations : 1671203
Num. Atoms : 226264514
Num. Elements : 6
CoCrFeNiPd_MRL2020
Description : The dataset for "Origin of high strength in the CoCrFeNiPd high-entropy alloy", containing DFT-calculated values of the high-entropy alloy CoCrFeNiPd,...
Authors : Binglun Yin, W. A. Curtin
Elements : Co, Cr, Fe, Ni, Pd
Num. Configurations : 116
Num. Atoms : 8552
Num. Elements : 5
CoNbV_CMS2019
Description : This dataset was generated using the following active learning scheme: 1) candidate structures were relaxed by a partially-trained MTP model, 2) struc...
Authors : Konstantin Gubaev, Evgeny V. Podryabinkin, Gus L.W...
Elements : Co, Nb, V
Num. Configurations : 383
Num. Atoms : 2812
Num. Elements : 3
Co_dimer_JPCA_2022
Description : This dataset contains dimer molecules of Co(II) with potential energy calculations for structures with ferromagnetic and antiferromagnetic spin config...
Authors : Sijin Ren, Eric Fonseca, William Perry, Hai-Ping C...
Elements : C, Cl, Co, H, N, O, P, S
Num. Configurations : 2162
Num. Atoms : 188438
Num. Elements : 8
Co_dimer_JPCA_2022_train
Description : Training data only from the Co_dimer_JPCA_2022 dataset. This dataset contains dimer molecules of Co(II) with potential energy calculations for structu...
Authors : Sijin Ren, Eric Fonseca, William Perry, Hai-Ping C...
Elements : C, Cl, Co, H, N, O, P, S
Num. Configurations : 1798
Num. Atoms : 154882
Num. Elements : 8
ComBat
Description : DFT-optimized geometries and properties for Li-S electrolytes. These make up the Computational Database for Li-S Batteries (ComBat), calculated using ...
Authors : Rasha Atwi, Matthew Bliss, Maxim Makeev, Nav Nidhi...
Elements : C, F, H, Li, N, O, P, S, Si
Num. Configurations : 230
Num. Atoms : 5662
Num. Elements : 9
CrCoNi_Cao_2022
Description : Training dataset that captures chemical short-range order in equiatomic CrCoNi medium-entropy alloy published with our work Quantifying chemical short...
Authors : Yifan Cao, Killian Sheriff, Rodrigo Freitas
Elements : Co, Cr, Ni
Num. Configurations : 1257
Num. Atoms : 108684
Num. Elements : 3
CuPd_CMS2019
Description : This dataset was generated using the following active learning scheme: 1) candidate structures were relaxed by a partially-trained MTP model, 2) struc...
Authors : Konstantin Gubaev, Evgeny V. Podryabinkin, Gus L.W...
Elements : Cu, Pd
Num. Configurations : 522
Num. Atoms : 2450
Num. Elements : 2
Cu_FHI-aims_NPJCM_2021
Description : Approximately 46,000 configurations of copper, including small and bulk structures, surfaces, interfaces, point defects, and randomly modified variant...
Authors : Yury Lysogorskiy, Cas van der Oord, Anton Bochkare...
Elements : Cu
Num. Configurations : 46583
Num. Atoms : 308679
Num. Elements : 1
DAS_MLIP_CoSb_MgSb
Description : Approximately 850 configurations of CoSb3 and Mg3Sb2 generated using a dual adaptive sampling (DAS) method for use with machine learning of interatomi...
Authors : Hongliang Yang, Yifan Zhu, Erting Dong, Yabei Wu, ...
Elements : Mg, Sb
Num. Configurations : 846
Num. Atoms : 247744
Num. Elements : 2
DFT_polymorphs_PNAS_2022_PBE0_MBD_benzene_test
Description : Benzene test PBE0-MBD dataset from "Semi-local and hybrid functional DFT data for thermalised snapshots of polymorphs of benzene, succinic acid, and g...
Authors : Venkat Kapil, Edgar A. Engel
Elements : C, H
Num. Configurations : 200
Num. Atoms : 5760
Num. Elements : 2
DFT_polymorphs_PNAS_2022_PBE0_MBD_benzene_train
Description : Benzene training PBE0-MBD dataset from "Semi-local and hybrid functional DFT data for thermalised snapshots of polymorphs of benzene, succinic acid, a...
Authors : Venkat Kapil, Edgar A. Engel
Elements : C, H
Num. Configurations : 1800
Num. Atoms : 49536
Num. Elements : 2
DFT_polymorphs_PNAS_2022_PBE0_MBD_benzene_validation
Description : Benzene validation PBE0-MBD dataset from "Semi-local and hybrid functional DFT data for thermalised snapshots of polymorphs of benzene, succinic acid,...
Authors : Venkat Kapil, Edgar A. Engel
Elements : C, H
Num. Configurations : 200
Num. Atoms : 6072
Num. Elements : 2
DFT_polymorphs_PNAS_2022_PBE0_MBD_glycine_test
Description : Glycine test PBE0-MBD dataset from "Semi-local and hybrid functional DFT data for thermalised snapshots of polymorphs of benzene, succinic acid, and g...
Authors : Venkat Kapil, Edgar A. Engel
Elements : C, H, N, O
Num. Configurations : 200
Num. Atoms : 6880
Num. Elements : 4
DFT_polymorphs_PNAS_2022_PBE0_MBD_glycine_train
Description : Glycine training PBE0-MBD dataset from "Semi-local and hybrid functional DFT data for thermalised snapshots of polymorphs of benzene, succinic acid, a...
Authors : Venkat Kapil, Edgar A. Engel
Elements : C, H, N, O
Num. Configurations : 3594
Num. Atoms : 109940
Num. Elements : 4
DFT_polymorphs_PNAS_2022_PBE0_MBD_glycine_validation
Description : Glycine validation PBE0-MBD dataset from "Semi-local and hybrid functional DFT data for thermalised snapshots of polymorphs of benzene, succinic acid,...
Authors : Venkat Kapil, Edgar A. Engel
Elements : C, H, N, O
Num. Configurations : 200
Num. Atoms : 7120
Num. Elements : 4
DFT_polymorphs_PNAS_2022_PBE0_MBD_succinic_acid_test
Description : Succinic acid test PBE0-MBD dataset from "Semi-local and hybrid functional DFT data for thermalised snapshots of polymorphs of benzene, succinic acid,...
Authors : Venkat Kapil, Edgar A. Engel
Elements : C, H, O
Num. Configurations : 200
Num. Atoms : 5600
Num. Elements : 3
DFT_polymorphs_PNAS_2022_PBE0_MBD_succinic_acid_train
Description : Succinic acid training PBE0-MBD dataset from "Semi-local and hybrid functional DFT data for thermalised snapshots of polymorphs of benzene, succinic a...
Authors : Venkat Kapil, Edgar A. Engel
Elements : C, H, O
Num. Configurations : 1800
Num. Atoms : 50400
Num. Elements : 3
DFT_polymorphs_PNAS_2022_PBE0_MBD_succinic_acid_validation
Description : Succinic acid validation PBE0-MBD dataset from "Semi-local and hybrid functional DFT data for thermalised snapshots of polymorphs of benzene, succinic...
Authors : Venkat Kapil, Edgar A. Engel
Elements : C, H, O
Num. Configurations : 200
Num. Atoms : 5600
Num. Elements : 3
DFT_polymorphs_PNAS_2022_PBE_TS_benzene_test
Description : Benzene test PBE-TS dataset from "Semi-local and hybrid functional DFT data for thermalised snapshots of polymorphs of benzene, succinic acid, and gly...
Authors : Venkat Kapil, Edgar A. Engel
Elements : C, H
Num. Configurations : 1000
Num. Atoms : 29736
Num. Elements : 2
DFT_polymorphs_PNAS_2022_PBE_TS_benzene_train
Description : Benzene training PBE-TS dataset from "Semi-local and hybrid functional DFT data for thermalised snapshots of polymorphs of benzene, succinic acid, and...
Authors : Venkat Kapil, Edgar A. Engel
Elements : C, H
Num. Configurations : 55000
Num. Atoms : 1602048
Num. Elements : 2
DFT_polymorphs_PNAS_2022_PBE_TS_benzene_validation
Description : Benzene validation PBE-TS dataset from "Semi-local and hybrid functional DFT data for thermalised snapshots of polymorphs of benzene, succinic acid, a...
Authors : Venkat Kapil, Edgar A. Engel
Elements : C, H
Num. Configurations : 1000
Num. Atoms : 29712
Num. Elements : 2
DFT_polymorphs_PNAS_2022_PBE_TS_glycine_test
Description : Glycine test PBE-TS dataset from "Semi-local and hybrid functional DFT data for thermalised snapshots of polymorphs of benzene, succinic acid, and gly...
Authors : Venkat Kapil, Edgar A. Engel
Elements : C, H, N, O
Num. Configurations : 500
Num. Atoms : 17710
Num. Elements : 4
DFT_polymorphs_PNAS_2022_PBE_TS_glycine_train
Description : Glycine training PBE-TS dataset from "Semi-local and hybrid functional DFT data for thermalised snapshots of polymorphs of benzene, succinic acid, and...
Authors : Venkat Kapil, Edgar A. Engel
Elements : C, H, N, O
Num. Configurations : 29070
Num. Atoms : 952650
Num. Elements : 4
DFT_polymorphs_PNAS_2022_PBE_TS_glycine_validation
Description : Glycine validation PBE-TS dataset from "Semi-local and hybrid functional DFT data for thermalised snapshots of polymorphs of benzene, succinic acid, a...
Authors : Venkat Kapil, Edgar A. Engel
Elements : C, H, N, O
Num. Configurations : 500
Num. Atoms : 17800
Num. Elements : 4
DFT_polymorphs_PNAS_2022_PBE_TS_succinic_acid_test
Description : Succinic acid test PBE-TS dataset from "Semi-local and hybrid functional DFT data for thermalised snapshots of polymorphs of benzene, succinic acid, a...
Authors : Venkat Kapil, Edgar A. Engel
Elements : C, H, O
Num. Configurations : 500
Num. Atoms : 14000
Num. Elements : 3
DFT_polymorphs_PNAS_2022_PBE_TS_succinic_acid_train
Description : Succinic acid training PBE-TS dataset from "Semi-local and hybrid functional DFT data for thermalised snapshots of polymorphs of benzene, succinic aci...
Authors : Venkat Kapil, Edgar A. Engel
Elements : C, H, O
Num. Configurations : 29212
Num. Atoms : 817936
Num. Elements : 3
DFT_polymorphs_PNAS_2022_PBE_TS_succinic_acid_validation
Description : Succinic acid validation PBE-TS dataset from "Semi-local and hybrid functional DFT data for thermalised snapshots of polymorphs of benzene, succinic a...
Authors : Venkat Kapil, Edgar A. Engel
Elements : C, H, O
Num. Configurations : 500
Num. Atoms : 14000
Num. Elements : 3
DP-GEN_Cu
Description : Approximately 15,000 configurations of copper used to demonstrate the DP-GEN data generator for PES machine learning models.
Authors : Yuzhi Zhang, Haidi Wang, Weijie Chen, Jinzhe Zeng,...
Elements : Cu
Num. Configurations : 15286
Num. Atoms : 297597
Num. Elements : 1
DeePMD_SE
Description : 127,000 configurations from a dataset used to benchmark and train a modified DeePMD model called DeepPot-SE, or Deep Potential - Smooth Edition
Authors : Linfeng Zhang, Jiequn Han, Han Wang, Wissam A. Sai...
Elements : Al, C, Co, Cr, Cu, Fe, Ge, H, Mn, Mo, N, Ni, O, Pt, S, S...
Num. Configurations : 127112
Num. Atoms : 26278380
Num. Elements : 17
Fe_nanoparticles_PRB_2023
Description : This iron nanoparticles database contains dimers; trimers; bcc, fcc, hexagonal close-packed (hcp), simple cubic, and diamond crystalline structures. A...
Authors : Richard Jana, Miguel A. Caro
Elements : Fe
Num. Configurations : 198
Num. Atoms : 20097
Num. Elements : 1
FitSNAP_Fe_NPJ_2021
Description : About 2,500 configurations of alpha-Fe used in the training and testing of a ML model with the goal of building magneto-elastic machine-learning inter...
Authors : Svetoslav Nikolov, Mitchell A. Wood, Attila Cangi,...
Elements : Fe
Num. Configurations : 2517
Num. Atoms : 61526
Num. Elements : 1
Forces_are_not_enough
Description : Approximately 300,000 benchmarking configurations derived partly from the MD-17 and LiPS datasets, partly from original simulated water and alanine di...
Authors : Xiang Fu, Zhenghao Wu, Wujie Wang, Tian Xie, Sinan...
Elements : C, H, Li, N, O, P, S
Num. Configurations : 295001
Num. Atoms : 23735083
Num. Elements : 7
GDB_9_nature_2014
Description : 133,855 configurations of stable small organic molecules composed of CHONF. A subset of GDB-17, with calculations of energies, dipole moment, polariza...
Authors : Raghunathan Ramakrishnan, Pavlo O. Dral, Matthias ...
Elements : C, F, H, N, O
Num. Configurations : 133885
Num. Atoms : 2407753
Num. Elements : 5
GFN-xTB_JCIM_2021
Description : 10,000 configurations of organosilicon compounds with energies predicted by an improved GFN-xTB Hamiltonian parameterization, using revPBE.
Authors : Leonid Komissarov, Toon Verstraelen
Elements : Br, C, Cl, F, H, N, O, P, S, Si
Num. Configurations : 157367
Num. Atoms : 4022149
Num. Elements : 10
GST_GAP_22_extended
Description : The extended training dataset for GST_GAP_22, calculated using the PBEsol functional. New configurations, simulated under external electric fields, we...
Authors : Yuxing Zhou, Wei Zhang, Evan Ma, Volker L. Deringe...
Elements : Ge, Sb, Te
Num. Configurations : 2916
Num. Atoms : 399247
Num. Elements : 3
GST_GAP_22_main
Description : The main training dataset for GST_GAP_22, calculated using the PBEsol functional. GST-GAP-22 contains configurations of phase-change materials on the ...
Authors : Yuxing Zhou, Wei Zhang, Evan Ma, Volker L. Deringe...
Elements : Ge, Sb, Te
Num. Configurations : 2692
Num. Atoms : 341068
Num. Elements : 3
GST_GAP_22_refitted
Description : The training dataset for GST_GAP_22, recalculated using the PBE functional. GST-GAP-22 contains configurations of phase-change materials on the quasi-...
Authors : Yuxing Zhou, Wei Zhang, Evan Ma, Volker L. Deringe...
Elements : Ge, Sb, Te
Num. Configurations : 2692
Num. Atoms : 341004
Num. Elements : 3
HDNNP_H2O
Description : Approximately 28,000 configurations split into 4 datasets, each using a different functional, used in the training of a high-dimensional neural networ...
Authors : Tobias Morawietz, Jörg Behler
Elements : H, O
Num. Configurations : 28678
Num. Atoms : 2327628
Num. Elements : 2
HEA25S_high_entropy_alloys
Description : Dataset from "Surface segregation in high-entropy alloys from alchemical machine learning: dataset HEA25S". Includes 10000 bulk HEA structures (Datase...
Authors : Arslan Mazitov, Maximilian A. Springer, Nataliya L...
Elements : Ag, Au, Co, Cr, Cu, Fe, Hf, Ir, Lu, Mn, Mo, Nb, Ni, Pd, ...
Num. Configurations : 15004
Num. Atoms : 633387
Num. Elements : 25
HEA25_high_entropy_transition-metal_alloys
Description : Dataset from "Modeling high-entropy transition-metal alloys with alchemical compression". Includes 25,000 structures utilized for fitting the aforemen...
Authors : Nataliya Lopanitsyna, Guillaume Fraux, Maximilian ...
Elements : Ag, Au, Co, Cr, Cu, Fe, Hf, Ir, Lu, Mn, Mo, Nb, Ni, Pd, ...
Num. Configurations : 25627
Num. Atoms : 1063680
Num. Elements : 25
HME21_test
Description : The test set from HME21. The high-temperature multi-element 2021 (HME21) dataset comprises approximately 25,000 configurations, including 37 elements...
Authors : So Takamoto, Chikashi Shinagawa, Daisuke Motoki, K...
Elements : Ag, Al, Au, Ba, C, Ca, Cl, Co, Cr, Cu, F, Fe, H, In, Ir,...
Num. Configurations : 2495
Num. Atoms : 69572
Num. Elements : 37
HME21_train
Description : The training set from HME21. The high-temperature multi-element 2021 (HME21) dataset comprises approximately 25,000 configurations, including 37 elem...
Authors : So Takamoto, Chikashi Shinagawa, Daisuke Motoki, K...
Elements : Ag, Al, Au, Ba, C, Ca, Cl, Co, Cr, Cu, F, Fe, H, In, Ir,...
Num. Configurations : 19956
Num. Atoms : 555050
Num. Elements : 37
HME21_validation
Description : The validation set from HME21. The high-temperature multi-element 2021 (HME21) dataset comprises approximately 25,000 configurations, including 37 el...
Authors : So Takamoto, Chikashi Shinagawa, Daisuke Motoki, K...
Elements : Ag, Al, Au, Ba, C, Ca, Cl, Co, Cr, Cu, F, Fe, H, In, Ir,...
Num. Configurations : 2498
Num. Atoms : 69420
Num. Elements : 37
HO_LiMoNiTi_NPJCM_2020_LiMoNiTi_train
Description : Training configurations of Li8Mo2Ni7Ti7O32 from HO_LiMoNiTi_NPJCM_2020 used in the training of an ANN, whereby total energy is extrapolated by a Taylo...
Authors : April M. Cooper, Johannes Kästner, Alexander Urban...
Elements : Li, Mo, Ni, O, Ti
Num. Configurations : 824
Num. Atoms : 46144
Num. Elements : 5
HO_LiMoNiTi_NPJCM_2020_LiMoNiTi_validation
Description : Validation configurations of Li8Mo2Ni7Ti7O32 from HO_LiMoNiTi_NPJCM_2020 used in the training of an ANN, whereby total energy is extrapolated by a Tay...
Authors : April M. Cooper, Johannes Kästner, Alexander Urban...
Elements : Li, Mo, Ni, O, Ti
Num. Configurations : 1792
Num. Atoms : 100352
Num. Elements : 5
HO_LiMoNiTi_NPJCM_2020_bulk_water_train_test
Description : Training and testing configurations of bulk water from HO_LiMoNiTi_NPJCM_2020 used in the training of an ANN, whereby total energy is extrapolated by ...
Authors : April M. Cooper, Johannes Kästner, Alexander Urban...
Elements : H, O
Num. Configurations : 700
Num. Atoms : 134400
Num. Elements : 2
HO_LiMoNiTi_NPJCM_2020_bulk_water_validation
Description : Validation configurations of bulk water from HO_LiMoNiTi_NPJCM_2020 used in the training of an ANN, whereby total energy is extrapolated by a Taylor e...
Authors : April M. Cooper, Johannes Kästner, Alexander Urban...
Elements : H, O
Num. Configurations : 2112
Num. Atoms : 405504
Num. Elements : 2
HO_LiMoNiTi_NPJCM_2020_water_clusters
Description : Configurations of water clusters from HO_LiMoNiTi_NPJCM_2020 used in the training of an ANN, whereby total energy is extrapolated by a Taylor expansio...
Authors : April M. Cooper, Johannes Kästner, Alexander Urban...
Elements : H, O
Num. Configurations : 1848
Num. Atoms : 33264
Num. Elements : 2
HO_PNAS_2019
Description : 1590 configurations of H2O/water with total energy and forces calculated using a hybrid approach at DFT/revPBE0-D3 level of theory.
Authors : Bingqing Cheng, Edgar A. Engel, Jörg Behler, Chris...
Elements : H, O
Num. Configurations : 1588
Num. Atoms : 304896
Num. Elements : 2
HPt_NC_2022
Description : A training dataset of 90,000 configurations with interaction properties between H2 and Pt(111) surfaces.
Authors : Jonathan Vandermause, Yu Xie, Jin Soo Lim, Cameron...
Elements : H, Pt
Num. Configurations : 90740
Num. Atoms : 5706023
Num. Elements : 2
H_nature_2022
Description : Over 300,000 configurations in an expanded dataset of 19 hydrogen combustion reaction channels. Intrinsic reaction coordinate calculations (IRC) are c...
Authors : Xingyi Guan, Akshaya Das, Christopher J. Stein, Fa...
Elements : H, O
Num. Configurations : 350121
Num. Atoms : 1513654
Num. Elements : 2
HfO2_DPGEN_PRB_2021
Description : Approximately 28,500 configurations of hafnia (HfO2) used in the training of a DP model for the prediction of properties of various hafnia polymorphs,...
Authors : Jing Wu, Yuzhi Zhang, Linfeng Zhang, Shi Liu
Elements : Hf, O
Num. Configurations : 28564
Num. Atoms : 2741376
Num. Elements : 2
HfO2_NPJ_2020
Description : 6000 configurations of liquid and amorphous HfO2 generated for use with an active learning ML model.
Authors : Ganesh Sivaraman, Anand Narayanan Krishnamoorthy, ...
Elements : Hf, O
Num. Configurations : 6000
Num. Atoms : 576000
Num. Elements : 2
Hydrogen-induced_insulating_state_SmNiO3
Description : A dataset of DFT-calculated energies created to investigate the effect of hydrogen doping on the crystal structure and the electronic state in SmNiO3....
Authors : Kunihiko Yamauchi, Ikutaro Hamada
Elements : H, Ni, O, Sm
Num. Configurations : 3318
Num. Atoms : 156419
Num. Elements : 4
ISO17_NC_2017
Description : 129 molecules of composition C7O2H10 from the QM9 dataset with 5000 conformational geometries apiece. Molecular dynamics data was simulated using the ...
Authors : Jonathan Vandermause, Yu Xie, Jin Soo Lim, Cameron...
Elements : C, H, O
Num. Configurations : 640855
Num. Atoms : 12176245
Num. Elements : 3
In2Se3_2D_DPGEN
Description : Approximately 11,500 configurations of In2Se3, including monolayer (20-atom slab) and bulk (30-atom supercell) models.
Authors : Jing Wu, Liyi Bai, Jiawei Huang, Liyang Ma, Jian L...
Elements : In, Se
Num. Configurations : 11523
Num. Atoms : 248510
Num. Elements : 2
InP_JPCA2020
Description : This data set was used to generate a multi-element linear SNAP potential for InP, as published in Cusentino, M. A. et. al, J. Chem. Phys. (2020). Inte...
Authors : Mary Alice Cusentino, Mitchell A. Wood, Aidan P. T...
Elements : In, P
Num. Configurations : 1802
Num. Atoms : 106761
Num. Elements : 2
JARVIS-Polymer-Genome
Description : The JARVIS-Polymer-Genome dataset is part of the joint automated repository for various integrated simulations (JARVIS) database. This dataset contain...
Authors : Tran Doan Huan, Arun Mannodi-Kanakkithodi, Chiho K...
Elements : Al, C, Ca, Cd, Cl, F, H, Hf, Mg, N, O, Pb, S, Sn, Ti, Zn...
Num. Configurations : 1073
Num. Atoms : 34441
Num. Elements : 17
JARVIS-QM9-DGL
Description : The JARVIS-QM9-DGL dataset is part of the joint automated repository for various integrated simulations (JARVIS) database. This dataset contains confi...
Authors : Raghunathan Ramakrishnan, Pavlo O. Dral, Matthias ...
Elements : C, F, H, N, O
Num. Configurations : 130831
Num. Atoms : 2358210
Num. Elements : 5
JARVIS_2DMatPedia
Description : The JARVIS-2DMatPedia dataset is part of the joint automated repository for various integrated simulations (JARVIS) database. This subset contains con...
Authors : Jun Zhou, Lei Shen, Miguel Dias Costa, Kristin A. ...
Elements : Ac, Ag, Al, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce, Cl...
Num. Configurations : 6351
Num. Atoms : 66295
Num. Elements : 83
JARVIS_AGRA_CHO
Description : The JARVIS_AGRA_CHO dataset is part of the joint automated repository for various integrated simulations (JARVIS) DFT database. This dataset contains ...
Authors : Zhi Wen Chen, Zachary Gariepy, Lixin Chen, Xue Yao...
Elements : C, Co, Cu, Fe, H, Mo, Ni, O
Num. Configurations : 216
Num. Atoms : 14472
Num. Elements : 8
JARVIS_AGRA_CO
Description : The JARVIS_AGRA_CO dataset is part of the joint automated repository for various integrated simulations (JARVIS) DFT database. This dataset contains d...
Authors : Zhi Wen Chen, Zachary Gariepy, Lixin Chen, Xue Yao...
Elements : C, Co, Cu, Fe, Mo, Ni, O
Num. Configurations : 194
Num. Atoms : 12804
Num. Elements : 7
JARVIS_AGRA_COOH
Description : The JARVIS_AGRA_COOH dataset is part of the joint automated repository for various integrated simulations (JARVIS) DFT database. This dataset contains...
Authors : Zhi Wen Chen, Zachary Gariepy, Lixin Chen, Xue Yao...
Elements : C, Co, Cu, Fe, H, Mo, Ni, O
Num. Configurations : 280
Num. Atoms : 19040
Num. Elements : 8
JARVIS_AGRA_O
Description : The JARVIS_AGRA_O dataset is part of the joint automated repository for various integrated simulations (JARVIS) DFT database. This dataset contains da...
Authors : Thomas A.A. Batchelor, Jack K. Pedersen, Simon H. ...
Elements : Ir, O, Pd, Pt, Rh, Ru
Num. Configurations : 1000
Num. Atoms : 17000
Num. Elements : 6
JARVIS_AGRA_OH
Description : The JARVIS_AGRA_OH dataset is part of the joint automated repository for various integrated simulations (JARVIS) DFT database. This dataset contains d...
Authors : Thomas A.A. Batchelor, Jack K. Pedersen, Simon H. ...
Elements : H, Ir, O, Pd, Pt, Rh, Ru
Num. Configurations : 877
Num. Atoms : 15786
Num. Elements : 7
JARVIS_ALIGNN_FF
Description : The JARVIS_ALIGNN_FF dataset is part of the joint automated repository for various integrated simulations (JARVIS) database. This dataset is a subset ...
Authors : Kamal Choudhary, Brian DeCost, Lily Major, Keith B...
Elements : Ac, Ag, Al, Ar, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce...
Num. Configurations : 306411
Num. Atoms : 3193703
Num. Elements : 89
JARVIS_C2DB
Description : The JARVIS-C2DB dataset is part of the joint automated repository for various integrated simulations (JARVIS) database. This subset contains configura...
Authors : Sten Haastrup, Mikkel Strange, Mohnish Pandey, Tho...
Elements : Ag, Al, As, Au, B, Ba, Bi, Br, C, Ca, Cd, Cl, Co, Cr, Cs...
Num. Configurations : 3520
Num. Atoms : 17990
Num. Elements : 61
JARVIS_CFID_3D_8_18_2022
Description : The JARVIS_CFID_3D_8_18_2022 dataset is part of the joint automated repository for various integrated simulations (JARVIS) DFT database. This subset c...
Authors : Kamal Choudhary, Kevin F. Garrity, Andrew C. E. Re...
Elements : Ac, Ag, Al, Ar, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce...
Num. Configurations : 55631
Num. Atoms : 561738
Num. Elements : 89
JARVIS_CFID_OQMD
Description : The JARVIS_CFID_OQMD dataset is part of the joint automated repository for various integrated simulations (JARVIS) database. This dataset contains con...
Authors : Scott Kirklin, James E Saal, Bryce Meredig, Alex T...
Elements : Ac, Ag, Al, Ar, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce...
Num. Configurations : 459991
Num. Atoms : 2366255
Num. Elements : 89
JARVIS_DFT_2D_3_12_2021
Description : The DFT-2D-3-12-2021 dataset is part of the joint automated repository for various integrated simulations (JARVIS) DFT database. This subset contains ...
Authors : Kamal Choudhary, Kevin F. Garrity, Andrew C. E. Re...
Elements : Ac, Ag, Al, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce, Cl...
Num. Configurations : 1074
Num. Atoms : 7903
Num. Elements : 81
JARVIS_DFT_3D_12_12_2022
Description : The DFT_3D_12_12_2022 dataset is part of the joint automated repository for various integrated simulations (JARVIS) DFT database. This subset contains...
Authors : Kamal Choudhary, Kevin F. Garrity, Andrew C. E. Re...
Elements : Ac, Ag, Al, Ar, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce...
Num. Configurations : 75909
Num. Atoms : 785250
Num. Elements : 89
JARVIS_DFT_3D_8_18_2021
Description : The JARVIS_DFT_3D_8_18_2021 dataset is part of the joint automated repository for various integrated simulations (JARVIS) DFT database. This subset co...
Authors : Kamal Choudhary, Kevin F. Garrity, Andrew C. E. Re...
Elements : Ac, Ag, Al, Ar, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce...
Num. Configurations : 55631
Num. Atoms : 561741
Num. Elements : 89
JARVIS_EPC_2D
Description : The JARVIS_EPC_2D dataset is part of the joint automated repository for various integrated simulations (JARVIS) DFT database. This subset contains con...
Authors : Daniel Wines, Kamal Choudhary, Adam J. Biacchi, ev...
Elements : Ag, Al, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cl, Co, Cr, Cu...
Num. Configurations : 161
Num. Atoms : 788
Num. Elements : 55
JARVIS_MEGNet
Description : The JARVIS-MEGNet dataset is part of the joint automated repository for various integrated simulations (JARVIS) database. This subset contains configu...
Authors : Chi Chen, Weike Ye, Yunxing Zuo, Chen Zheng, Shyue...
Elements : Ac, Ag, Al, Ar, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce...
Num. Configurations : 69234
Num. Atoms : 2070948
Num. Elements : 89
JARVIS_MEGNet2
Description : The JARVIS-MEGNet2 dataset is part of the joint automated repository for various integrated simulations (JARVIS) database. This subset contains 133K m...
Authors : Chi Chen, Weike Ye, Yunxing Zuo, Chen Zheng, Shyue...
Elements : Ac, Ag, Al, Ar, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce...
Num. Configurations : 133420
Num. Atoms : 3880497
Num. Elements : 89
JARVIS_Materials_Project_2020
Description : The JARVIS_Materials_Project_2020 dataset is part of the joint automated repository for various integrated simulations (JARVIS) DFT database. This sub...
Authors : Anubhav Jain, Shyue Ping Ong, Geoffroy Hautier, We...
Elements : Ac, Ag, Al, Ar, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce...
Num. Configurations : 126335
Num. Atoms : 3725727
Num. Elements : 89
JARVIS_Materials_Project_84K
Description : The JARVIS_Materials_Project_84K dataset is part of the joint automated repository for various integrated simulations (JARVIS) DFT database. This subs...
Authors : Anubhav Jain, Shyue Ping Ong, Geoffroy Hautier, We...
Elements : Ac, Ag, Al, Ar, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce...
Num. Configurations : 83426
Num. Atoms : 2339932
Num. Elements : 89
JARVIS_OMDB
Description : The JARVIS_OMDB dataset is part of the joint automated repository for various integrated simulations (JARVIS) database. This dataset contains configur...
Authors : Bart Olsthoorn, R. Matthias Geilhufe, Stanislav S....
Elements : Ag, Al, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Cl, Co, Cr...
Num. Configurations : 12497
Num. Atoms : 1061362
Num. Elements : 65
JARVIS_OQMD_no_CFID
Description : The JARVIS_OQMD_no_CFID dataset is part of the joint automated repository for various integrated simulations (JARVIS) database. This dataset contains ...
Authors : Scott Kirklin, James E Saal, Bryce Meredig, Alex T...
Elements : Ac, Ag, Al, Ar, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce...
Num. Configurations : 811782
Num. Atoms : 5017701
Num. Elements : 89
JARVIS_Open_Catalyst_100K
Description : The JARVIS_Open_Catalyst_100K dataset is part of the joint automated repository for various integrated simulations (JARVIS) DFT database. This subset ...
Authors : Lowik Chanussot, Abhishek Das, Siddharth Goyal, Th...
Elements : Ag, Al, As, Au, B, Bi, C, Ca, Cd, Cl, Co, Cr, Cs, Cu, Fe...
Num. Configurations : 124943
Num. Atoms : 9720870
Num. Elements : 56
JARVIS_Open_Catalyst_10K
Description : The JARVIS_Open_Catalyst_10K dataset is part of the joint automated repository for various integrated simulations (JARVIS) DFT database. This subset c...
Authors : Lowik Chanussot, Abhishek Das, Siddharth Goyal, Th...
Elements : Ag, Al, As, Au, B, Bi, C, Ca, Cd, Cl, Co, Cr, Cs, Cu, Fe...
Num. Configurations : 34943
Num. Atoms : 2720316
Num. Elements : 56
JARVIS_Open_Catalyst_All
Description : The JARVIS_Open_Catalyst_All dataset is part of the joint automated repository for various integrated simulations (JARVIS) DFT database. This subset c...
Authors : Lowik Chanussot, Abhishek Das, Siddharth Goyal, Th...
Elements : Ag, Al, As, Au, B, Bi, C, Ca, Cd, Cl, Co, Cr, Cs, Cu, Fe...
Num. Configurations : 485269
Num. Atoms : 37728919
Num. Elements : 56
JARVIS_QE_TB
Description : The QE-TB dataset is part of the joint automated repository for various integrated simulations (JARVIS) DFT database. This subset contains configurati...
Authors : Kevin F. Garrity, Kamal Choudhary
Elements : Ag, Al, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Cl, Co, Cr...
Num. Configurations : 829576
Num. Atoms : 2578920
Num. Elements : 64
JARVIS_QM9_STD_JCTC
Description : The JARVIS_QM9_STD_JCTC dataset is part of the joint automated repository for various integrated simulations (JARVIS) database. This dataset contains ...
Authors : Raghunathan Ramakrishnan, Pavlo O. Dral, Matthias ...
Elements : C, F, H, N, O
Num. Configurations : 130829
Num. Atoms : 2359192
Num. Elements : 5
JARVIS_QMOF
Description : The JARVIS_QMOF dataset is part of the joint automated repository for various integrated simulations (JARVIS) database. This dataset contains configur...
Authors : Andrew S. Rosen, Shaelyn M. Iyer, Debmalya Ray, Zh...
Elements : Ag, Al, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce, Cl, Co...
Num. Configurations : 20425
Num. Atoms : 2321633
Num. Elements : 79
JARVIS_SNUMAT
Description : The JARVIS_SNUMAT dataset is part of the joint automated repository for various integrated simulations (JARVIS) database. This dataset contains band g...
Authors : Sangtae Kim, Miso Lee, Changho Hong, Youngchae Yoo...
Elements : Ag, Al, Ar, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce, Cl...
Num. Configurations : 10481
Num. Atoms : 216749
Num. Elements : 73
JARVIS_TinNet_N
Description : The JARVIS_TinNet dataset is part of the joint automated repository for various integrated simulations (JARVIS) database. This dataset contains config...
Authors : Shih-Han Wang, Hemanth Somarajan Pillai, Siwen Wan...
Elements : Ag, Au, Cd, Co, Cr, Cu, Fe, H, Hf, Ir, Mn, Mo, N, Nb, Ni...
Num. Configurations : 329
Num. Atoms : 6251
Num. Elements : 27
JARVIS_TinNet_O
Description : The JARVIS_TinNet dataset is part of the joint automated repository for various integrated simulations (JARVIS) database. This dataset contains config...
Authors : Shih-Han Wang, Hemanth Somarajan Pillai, Siwen Wan...
Elements : Ag, Al, Au, Bi, Cd, Co, Cr, Cu, Fe, Ga, Hf, In, Ir, La, ...
Num. Configurations : 747
Num. Atoms : 12699
Num. Elements : 36
JARVIS_TinNet_OH
Description : The JARVIS_TinNet dataset is part of the joint automated repository for various integrated simulations (JARVIS) database. This dataset contains config...
Authors : Shih-Han Wang, Hemanth Somarajan Pillai, Siwen Wan...
Elements : Ag, Al, Au, Bi, Cd, Co, Cr, Cu, Fe, Ga, H, Hf, In, Ir, L...
Num. Configurations : 748
Num. Atoms : 13464
Num. Elements : 37
JARVIS_mlearn
Description : The JARVIS_mlearn dataset is part of the joint automated repository for various integrated simulations (JARVIS) database. This dataset contains config...
Authors : Yunxing Zuo, Chi Chen, Xiangguo Li, Zhi Deng, Yimi...
Elements : Cu, Ge, Li, Mo, Ni, Si
Num. Configurations : 1566
Num. Atoms : 115742
Num. Elements : 6
LiGePS_SSE_PBE
Description : Approximately 6,500 configurations of Li10GeP2S12, based on crystal structures from the Materials Project database, material ID mp-696129. One of two ...
Authors : Jianxing Huang, Linfeng Zhang, Han Wang, Jinbao Zh...
Elements : Ge, Li, P, S
Num. Configurations : 6550
Num. Atoms : 1479000
Num. Elements : 4
LiGePS_SSE_PBEsol
Description : Approximately 2,800 configurations of Li10GeP2S12, based on crystal structures from the Materials Project database, material ID mp-696129. One of two ...
Authors : Jianxing Huang, Linfeng Zhang, Han Wang, Jinbao Zh...
Elements : Ge, Li, P, S
Num. Configurations : 2835
Num. Atoms : 504350
Num. Elements : 4
LiSiPS_SSE_PBE
Description : Approximately 9,100 configurations of Li10SiP2S12, based on crystal structures from the Materials Project database, material ID mp-696129. One of two ...
Authors : Jianxing Huang, Linfeng Zhang, Han Wang, Jinbao Zh...
Elements : Li, P, S, Si
Num. Configurations : 9169
Num. Atoms : 2101550
Num. Elements : 4
LiSiPS_SSE_PBEsol
Description : Approximately 2,300 configurations of Li10SiP2S12, based on crystal structures from the Materials Project database, material ID mp-696129. One of two ...
Authors : Jianxing Huang, Linfeng Zhang, Han Wang, Jinbao Zh...
Elements : Li, P, S, Si
Num. Configurations : 2357
Num. Atoms : 313150
Num. Elements : 4
LiTiO_Science_2020
Description : This dataset contains configurations of lithium titanate from the publication Kinetic Pathways of ionic transport in fast-charging lithium titanate. I...
Authors : Tina Chen, Dong-hwa Seo
Elements : Be, Li, O, Ti
Num. Configurations : 849
Num. Atoms : 150105
Num. Elements : 4
MD22_AT_AT
Description : Dataset containing MD trajectories of AT-AT DNA base pairs from the MD22 benchmark set. {DESC}
Authors : Stefan Chmiela, Valentin Vassilev-Galindo, Oliver ...
Elements : C, H, N, O
Source Data : http://sgdml.org/
Num. Configurations : 20001
Num. Atoms : 1200060
Num. Elements : 4
MD22_AT_AT_CG_CG
Description : Dataset containing MD trajectories of AT-AT-CG-CG DNA base pairs from the MD22 benchmark set. MD22 represents a collection of datasets in a benchmark ...
Authors : Stefan Chmiela, Valentin Vassilev-Galindo, Oliver ...
Elements : C, H, N, O
Source Data : http://sgdml.org/
Num. Configurations : 10153
Num. Atoms : 1198054
Num. Elements : 4
MD22_Ac_Ala3_NHMe
Description : Dataset containing MD trajectories of the 42-atom tetrapeptide Ac-Ala3-NHMe from the MD22 benchmark set. MD22 represents a collection of datasets in a...
Authors : Stefan Chmiela, Valentin Vassilev-Galindo, Oliver ...
Elements : C, H, N, O
Source Data : http://sgdml.org/
Num. Configurations : 85109
Num. Atoms : 3574578
Num. Elements : 4
MD22_DHA
Description : Dataset containing MD trajectories of DHA (docosahexaenoic acid) from the MD22 benchmark set. MD22 represents a collection of datasets in a benchmark ...
Authors : Stefan Chmiela, Valentin Vassilev-Galindo, Oliver ...
Elements : C, H, O
Source Data : http://sgdml.org/
Num. Configurations : 69753
Num. Atoms : 3906168
Num. Elements : 3
MD22_buckyball_catcher
Description : Dataset containing MD trajectories of the buckyball-catcher supramolecule from the MD22 benchmark set. MD22 represents a collection of datasets in a b...
Authors : Stefan Chmiela, Valentin Vassilev-Galindo, Oliver ...
Elements : C, H
Source Data : http://sgdml.org/
Num. Configurations : 6102
Num. Atoms : 903096
Num. Elements : 2
MD22_double_walled_nanotube
Description : Dataset containing MD trajectories of the double-walled nanotube supramolecule from the MD22 benchmark set. MD22 represents a collection of datasets i...
Authors : Stefan Chmiela, Valentin Vassilev-Galindo, Oliver ...
Elements : C, H
Source Data : http://sgdml.org/
Num. Configurations : 5032
Num. Atoms : 1861840
Num. Elements : 2
MD22_stachyose
Description : Dataset containing MD trajectories of the tetrasaccharide stachyose from the MD22 benchmark set. MD22 represents a collection of datasets in a benchma...
Authors : Stefan Chmiela, Valentin Vassilev-Galindo, Oliver ...
Elements : C, H, O
Source Data : http://sgdml.org/
Num. Configurations : 27272
Num. Atoms : 2372664
Num. Elements : 3
MISPR
Description : Example dataset for MISPR (Materials Informatics for Structure-Property Relationships) materials science simulation software, with DFT-calculated conf...
Authors : Rasha Atwi, Matthew Bliss, Maxim Makeev, Nav Nidhi...
Elements : C, Cl, F, H, N, O, P, S, Si
Num. Configurations : 503
Num. Atoms : 8996
Num. Elements : 9
MTPu_2023
Description : A comprehensive database generated using density functional theory simulations, encompassing a wide range of crystal structures, point defects, extend...
Authors : Karim Zongo, Hao Sun, Claudiane Ouellet-Plamondon,...
Elements : O, Si
Num. Configurations : 1062
Num. Atoms : 71595
Num. Elements : 2
Matbench_mp_e_form
Description : Matbench v0.1 test dataset for predicting DFT formation energy from structure. Adapted from Materials Project database. Entries having formation energ...
Authors : Alexander Dunn, Qi Wang, Alex Ganose, Daniel Dopp,...
Elements : Ac, Ag, Al, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce, Cl...
Num. Configurations : 132752
Num. Atoms : 3869658
Num. Elements : 84
Matbench_mp_gap
Description : The Matbench_mp_gap dataset is a Matbench v0.1 test dataset for predicting DFT PBE band gap from structure, adapted from the Materials Project databas...
Authors : Alexander Dunn, Qi Wang, Alex Ganose, Daniel Dopp,...
Elements : Ac, Ag, Al, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce, Cl...
Num. Configurations : 106113
Num. Atoms : 3184984
Num. Elements : 84
Matbench_perovskites
Description : The Matbench_perovskites dataset is a Matbench v0.1 test dataset for predicting formation energy from crystal structure. Adapted from an original data...
Authors : Alexander Dunn, Qi Wang, Alex Ganose, Daniel Dopp,...
Elements : Ag, Al, As, Au, B, Ba, Be, Bi, Ca, Cd, Co, Cr, Cs, Cu, F...
Num. Configurations : 18928
Num. Atoms : 94640
Num. Elements : 56
Materials_Project
Description : Configurations from the Materials Project database: an online resource with the goal of computing properties of all inorganic materials.
Authors : Anubhav Jain, Shyue Ping Ong, Geoffroy Hautier, We...
Elements : Ac, Ag, Al, Ar, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce...
Num. Configurations : 6342530
Num. Atoms : 200016161
Num. Elements : 89
Mg_edmonds_2022
Description : 16748 configurations of magnesium with gathered energy, stress and forces at the DFT level of theory.
Authors : Marvin Poul
Elements : Mg
Num. Configurations : 16874
Num. Atoms : 78617
Num. Elements : 1
MoNbTaVW_PRB2021
Description : This dataset was originally designed to fit a GAP model for the Mo-Nb-Ta-V-W quinary system that was used to study segregation and defects in the body...
Authors : Jesper Byggmästar, Kai Nordlund, Flyura Djurabekov...
Elements : Mo, Nb, Ta, V, W
Num. Configurations : 2329
Num. Atoms : 127913
Num. Elements : 5
Mo_PRM2019
Description : This dataset was designed to enable machine learning of Mo elastic, thermal, and defect properties, as well as surface energetics, melting, and the st...
Authors : Jesper Byggmästar, Kai Nordlund, Flyura Djurabekov...
Elements : Mo
Num. Configurations : 3785
Num. Atoms : 45667
Num. Elements : 1
NDSC_TUT_2022
Description : 500 configurations of Mg2 for MD prediction using a model fitted on Al, W, Mg and Si.
Authors : Connor Allen, Albert P. Bartok
Elements : Mg
Num. Configurations : 500
Num. Atoms : 1000
Num. Elements : 1
NENCI-2021
Description : NENCI-2021 is a database of approximately 8000 benchmark Non-Equilibirum Non-Covalent Interaction (NENCI) energies performed on molecular dimers;inter...
Authors : Zachary M. Sparrow, Brian G. Ernst, Paul T. Joo, K...
Elements : Br, C, Cl, F, H, Li, N, Na, O, P, S
Num. Configurations : 7763
Num. Atoms : 129402
Num. Elements : 11
NEP_PRB_2021
Description : Approximately 7,000 distinct configurations of 2D-silicene, silicon, and PbTe. Silicon data used from http://dx.doi.org/10.1103/PhysRevX.8.041048. Dat...
Authors : Zheyong Fan
Elements : Pb, Si, Te
Num. Configurations : 7426
Num. Atoms : 611808
Num. Elements : 3
NEP_qHPF_test
Description : The test set of a train and test set pair.The combined datasets comprise approximately 275 configurations of monolayer quasi-hexagonal-phase fullerene...
Authors : Penghua Ying
Elements : C
Num. Configurations : 39
Num. Atoms : 4680
Num. Elements : 1
NEP_qHPF_train
Description : The train set of a train and test set pair.The combined datasets comprise approximately 275 configurations of monolayer quasi-hexagonal-phase fulleren...
Authors : Penghua Ying
Elements : C
Num. Configurations : 238
Num. Atoms : 28560
Num. Elements : 1
NMD-18
Description : 3,000 Al-Ga-In sesquioxides with energies and band gaps. Relaxed and Vegard's Law geometries with formation energy and band gaps at DFT-PBE level of t...
Authors : Christopher Sutton, Luca M. Ghiringhelli, Takenori...
Elements : Al, Ga, In, O
Num. Configurations : 3000
Num. Atoms : 185070
Num. Elements : 4
NNIP_FeH_PRM_2021
Description : Approximately 20,000 configurations from a dataset of alpha-iron and hydrogen. Properties include forces and potential energy, calculated using VASP a...
Authors : Fan-Shun Meng, Jun-Ping Du, Shuhei Shinzato, Hidek...
Elements : Fe, H
Num. Configurations : 20920
Num. Atoms : 1870008
Num. Elements : 2
NNP-Ga2O3
Description : 9,200 configurations of beta-Ga2O3, including two configuration sets. One contains DFT data for 8400 configurations simulated between temperatures of ...
Authors : Ruiyang Li, Zeyu Liu, Andrew Rohskopf, Kiarash Gor...
Elements : Ga, O
Num. Configurations : 9200
Num. Atoms : 2944000
Num. Elements : 2
NVNMD_GeTe
Description : Approximately 5,000 configurations of GeTe used in training of a non-von Neumann multiplication-less DNN model.
Authors : Pinghui Mo, Chang Li, Dan Zhao, Yujia Zhang, Mengc...
Elements : Ge, Te
Num. Configurations : 5025
Num. Atoms : 321600
Num. Elements : 2
N_O_F_columns_non-bonded_vdW_potential_JCP2023
Description : This dataset contains structures of materials from the N (15th), O (16th) and F (16th) columns of the periodic table used for generating a 2-body non-...
Authors : Peng Geng, Sergey Zybin, Saber Naserifar, William ...
Elements : As, At, Bi, O, P, Po, S, Sb, Se, Te
Num. Configurations : 262
Num. Atoms : 1494
Num. Elements : 10
Nb_PRM2019
Description : This dataset was designed to enable machine-learning of Nb elastic, thermal, and defect properties, as well as surface energetics, melting, and the st...
Authors : Jesper Byggmästar, Kai Nordlund, Flyura Djurabekov...
Elements : Nb
Num. Configurations : 3787
Num. Atoms : 45641
Num. Elements : 1
NequIP_NC_2022
Description : Approximately 57,000 configurations from the evaluation datasets for NequIP graph neural network model for interatomic potentials. Trajectories have b...
Authors : Simon Batzner, Albert Musaelian, Lixin Sun, Mario ...
Elements : C, Cu, H, Li, O, P, S
Num. Configurations : 56856
Num. Atoms : 7631075
Num. Elements : 7
NiCoCr_NC2020
Description : The face-centered cubic medium-entropy alloy NiCoCr has received considerable attention for its good mechanical properties, uncertain stacking fault e...
Authors : Binglun Yin, William Curtin
Elements : Co, Cr, Ni
Num. Configurations : 428
Num. Atoms : 40624
Num. Elements : 3
OC20_IS2RES_train
Description : This dataset contains all frames from the trajectories for the training configurations in the OC20 initial structure to relaxed energy (IS2RE) and ini...
Authors : Lowik Chanussot, Abhishek Das, Siddharth Goyal, Th...
Elements : Ag, Al, As, Au, B, Bi, C, Ca, Cd, Cl, Co, Cr, Cs, Cu, Fe...
Num. Configurations : 92899663
Num. Atoms : 7522707627
Num. Elements : 56
OC20_IS2RES_val_id
Description : OC20_IS2RES_val_id is the in-domain validation set for the OC20 Initial Structure to Relaxed Structure (IS2RS) and Initial Structure to Relaxed Energy...
Authors : Lowik Chanussot, Abhishek Das, Siddharth Goyal, Th...
Elements : Ag, Al, As, Au, B, Bi, C, Ca, Cd, Cl, Co, Cr, Cs, Cu, Fe...
Num. Configurations : 5026511
Num. Atoms : 406702097
Num. Elements : 56
OC20_IS2RES_val_ood_ads
Description : OC20_IS2RES_ood_ads is the out-of-domain validation set for the OC20 Initial Structure to Relaxed Structure (IS2RS) and Initial Structure to Relaxed E...
Authors : Lowik Chanussot, Abhishek Das, Siddharth Goyal, Th...
Elements : Ag, Al, As, Au, B, Bi, C, Ca, Cd, Cl, Co, Cr, Cs, Cu, Fe...
Num. Configurations : 4883196
Num. Atoms : 390308139
Num. Elements : 56
OC20_IS2RES_val_ood_both
Description : OC20_IS2RES_ood_ads is the out-of-domain validation set for the OC20 Initial Structure to Relaxed Structure (IS2RS) and Initial Structure to Relaxed E...
Authors : Lowik Chanussot, Abhishek Das, Siddharth Goyal, Th...
Elements : Ag, Al, As, Au, Bi, C, Ca, Cd, Cl, Co, Cr, Cs, Cu, Fe, G...
Num. Configurations : 3665193
Num. Atoms : 308297930
Num. Elements : 55
OC20_IS2RES_val_ood_cat
Description : OC20_IS2RES_val_ood_cat is the out-of-domain validation set for the OC20 Initial Structure to Relaxed Structure (IS2RS) and Initial Structure to Relax...
Authors : Lowik Chanussot, Abhishek Das, Siddharth Goyal, Th...
Elements : Ag, Al, As, Au, Bi, C, Ca, Cd, Cl, Co, Cr, Cs, Cu, Fe, G...
Num. Configurations : 5151015
Num. Atoms : 411767380
Num. Elements : 55
OC20_S2EF_train_200K
Description : OC20_S2EF_train_200K is the 200K training split of the OC20 Structure to Energy and Forces (S2EF) task.
Authors : Lowik Chanussot, Abhishek Das, Siddharth Goyal, Th...
Elements : Ag, Al, As, Au, B, Bi, C, Ca, Cd, Cl, Co, Cr, Cs, Cu, Fe...
Num. Configurations : 200000
Num. Atoms : 14631937
Num. Elements : 56
OC20_S2EF_train_20M
Description : OC20_S2EF_train_20M is the 20 million structure training subset of the OC20 Structure to Energy and Forces dataset. Features include potential energy,...
Authors : Lowik Chanussot, Abhishek Das, Siddharth Goyal, Th...
Elements : Ag, Al, As, Au, B, Bi, C, Ca, Cd, Cl, Co, Cr, Cs, Cu, Fe...
Num. Configurations : 20000000
Num. Atoms : 1465265878
Num. Elements : 56
OC20_S2EF_train_2M
Description : OC20_S2EF_train_2M is the 2 million structure training subset of the OC20 Structure to Energy and Forces dataset. Features include potential energy, f...
Authors : Lowik Chanussot, Abhishek Das, Siddharth Goyal, Th...
Elements : Ag, Al, As, Au, B, Bi, C, Ca, Cd, Cl, Co, Cr, Cs, Cu, Fe...
Num. Configurations : 2000000
Num. Atoms : 146496199
Num. Elements : 56
OC20_S2EF_train_all
Description : OC20_S2EF_train_all is the ~63 million structure full training set of the OC20 Structure to Energy and Forces (S2EF) dataset. Features include energy,...
Authors : Lowik Chanussot, Abhishek Das, Siddharth Goyal, Th...
Elements : Ag, Al, As, Au, B, Bi, C, Ca, Cd, Cl, Co, Cr, Cs, Cu, Fe...
Num. Configurations : 133934018
Num. Atoms : 9810895377
Num. Elements : 56
OC20_S2EF_val_id
Description : OC20_S2EF_val_id is the ~1-million structure in-domain validation set of the OC20 Structure to Energy and Forces (S2EF) dataset. Features include ener...
Authors : Lowik Chanussot, Abhishek Das, Siddharth Goyal, Th...
Elements : Ag, Al, As, Au, B, Bi, C, Ca, Cd, Cl, Co, Cr, Cs, Cu, Fe...
Num. Configurations : 999866
Num. Atoms : 73147343
Num. Elements : 56
OC20_S2EF_val_ood_ads
Description : OC20_S2EF_val_ood_ads is the out-of-domain validation set of the OC20 Structure to Energy and Forces (S2EF) dataset featuring unseen adsorbate. Featur...
Authors : Lowik Chanussot, Abhishek Das, Siddharth Goyal, Th...
Elements : Ag, Al, As, Au, B, Bi, C, Ca, Cd, Cl, Co, Cr, Cs, Cu, Fe...
Num. Configurations : 999838
Num. Atoms : 72858155
Num. Elements : 56
OC20_S2EF_val_ood_both
Description : OC20_S2EF_val_ood_both is the out-of-domain validation set of the OC20 Structure to Energy and Forces (S2EF) dataset featuring both unseen catalyst co...
Authors : Lowik Chanussot, Abhishek Das, Siddharth Goyal, Th...
Elements : Ag, Al, As, Au, Bi, C, Ca, Cd, Cl, Co, Cr, Cs, Cu, Fe, G...
Num. Configurations : 999944
Num. Atoms : 84604635
Num. Elements : 55
OC20_S2EF_val_ood_cat
Description : OC20_S2EF_val_ood_cat is the out-of-domain validation set of the OC20 Structure to Energy and Forces (S2EF) dataset featuring unseen catalyst composit...
Authors : Lowik Chanussot, Abhishek Das, Siddharth Goyal, Th...
Elements : Ag, Al, As, Au, Bi, C, Ca, Cd, Cl, Co, Cr, Cs, Cu, Fe, G...
Num. Configurations : 999809
Num. Atoms : 74059718
Num. Elements : 55
OC22-IS2RE-Train
Description : Training configurations for the initial structure to relaxed total energy (IS2RE) task of OC22. Open Catalyst 2022 (OC22) is a database of training tr...
Authors : Richard Tran, Janice Lan, Muhammed Shuaibi, Brando...
Elements : Ag, Al, As, Au, Ba, Be, Bi, C, Ca, Cd, Ce, Co, Cr, Cs, C...
Num. Configurations : 7878688
Num. Atoms : 634845874
Num. Elements : 57
OC22-IS2RE-Validation-in-domain
Description : In-domain validation configurations for the initial structure to relaxed total energy (IS2RE) task of OC22. Open Catalyst 2022 (OC22) is a database of...
Authors : Richard Tran, Janice Lan, Muhammed Shuaibi, Brando...
Elements : Ag, Al, As, Au, Ba, Be, Bi, C, Ca, Cd, Ce, Co, Cr, Cs, C...
Num. Configurations : 442843
Num. Atoms : 35275211
Num. Elements : 57
OC22-IS2RE-Validation-out-of-domain
Description : Out-of-domain validation configurations for the initial structure to relaxed total energy (IS2RE) task of OC22. Open Catalyst 2022 (OC22) is a databas...
Authors : Richard Tran, Janice Lan, Muhammed Shuaibi, Brando...
Elements : Au, Ba, Be, Bi, C, Ca, Cd, Ce, Co, Cr, Cs, Cu, Fe, Ga, G...
Num. Configurations : 521827
Num. Atoms : 42219955
Num. Elements : 52
OC22-S2EF-Train
Description : Training configurations for the structure to total energy and forces task (S2EF) of OC22. Open Catalyst 2022 (OC22) is a database of training trajecto...
Authors : Richard Tran, Janice Lan, Muhammed Shuaibi, Brando...
Elements : Ag, Al, As, Au, Ba, Be, Bi, C, Ca, Cd, Ce, Co, Cr, Cs, C...
Num. Configurations : 8389365
Num. Atoms : 669615870
Num. Elements : 57
OC22-S2EF-Validation-in-domain
Description : In-domain validation configurations for the structure to total energy and forces (S2EF) task of OC22. Open Catalyst 2022 (OC22) is a database of train...
Authors : Richard Tran, Janice Lan, Muhammed Shuaibi, Brando...
Elements : Ag, Al, As, Au, Ba, Be, Bi, C, Ca, Cd, Ce, Co, Cr, Cs, C...
Num. Configurations : 407195
Num. Atoms : 31919751
Num. Elements : 57
OC22-S2EF-Validation-out-of-domain
Description : Out-of-domain validation configurations for the structure to total energy and forces (S2EF) task of OC22. Open Catalyst 2022 (OC22) is a database of t...
Authors : Richard Tran, Janice Lan, Muhammed Shuaibi, Brando...
Elements : Au, Ba, Be, Bi, C, Ca, Cd, Ce, Co, Cr, Cs, Cu, Fe, Ga, G...
Num. Configurations : 459594
Num. Atoms : 36999141
Num. Elements : 52
OMat24_train_aimd_from_PBE_1000_npt
Description : The aimd-from-PBE-1000-npt training split of OMat24 (Open Materials 2024). OMat24 is a large-scale open dataset of density functional theory (DFT) cal...
Authors : Luis Barroso-Luque, Muhammed Shuaibi, Xiang Fu, Br...
Elements : Ac, Ag, Al, Ar, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce...
Num. Configurations : 21269486
Num. Atoms : 179930890
Num. Elements : 89
OMat24_train_aimd_from_PBE_1000_nvt
Description : The aimd-from-PBE-1000-nvt training split of OMat24 (Open Materials 2024). OMat24 is a large-scale open dataset of density functional theory (DFT) cal...
Authors : Luis Barroso-Luque, Muhammed Shuaibi, Xiang Fu, Br...
Elements : Ac, Ag, Al, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce, Cl...
Num. Configurations : 20256650
Num. Atoms : 169879539
Num. Elements : 86
OMat24_train_aimd_from_PBE_3000_npt
Description : The aimd-from-PBE-3000-npt training split of OMat24 (Open Materials 2024). OMat24 is a large-scale open dataset of density functional theory (DFT) cal...
Authors : Luis Barroso-Luque, Muhammed Shuaibi, Xiang Fu, Br...
Elements : Ac, Ag, Al, Ar, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce...
Num. Configurations : 6076290
Num. Atoms : 411540573
Num. Elements : 89
OMat24_train_aimd_from_PBE_3000_nvt
Description : The aimd-from-PBE-3000-nvt training split of OMat24 (Open Materials 2024). OMat24 is a large-scale open dataset of density functional theory (DFT) cal...
Authors : Luis Barroso-Luque, Muhammed Shuaibi, Xiang Fu, Br...
Elements : Ac, Ag, Al, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce, Cl...
Num. Configurations : 7839846
Num. Atoms : 530963613
Num. Elements : 86
OMat24_train_rattled_1000
Description : The rattled-1000 training split of OMat24 (Open Materials 2024). OMat24 is a large-scale open dataset of density functional theory (DFT) calculations....
Authors : Luis Barroso-Luque, Muhammed Shuaibi, Xiang Fu, Br...
Elements : Ac, Ag, Al, Ar, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce...
Num. Configurations : 11388475
Num. Atoms : 161511768
Num. Elements : 89
OMat24_train_rattled_1000_subsampled
Description : The rattled-1000-subsampled training split of OMat24 (Open Materials 2024). OMat24 is a large-scale open dataset of density functional theory (DFT) ca...
Authors : Luis Barroso-Luque, Muhammed Shuaibi, Xiang Fu, Br...
Elements : Ac, Ag, Al, Ar, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce...
Num. Configurations : 3879731
Num. Atoms : 55648760
Num. Elements : 89
OMat24_train_rattled_300
Description : The rattled-300 training split of OMat24 (Open Materials 2024). OMat24 is a large-scale open dataset of density functional theory (DFT) calculations. ...
Authors : Luis Barroso-Luque, Muhammed Shuaibi, Xiang Fu, Br...
Elements : Ac, Ag, Al, Ar, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce...
Num. Configurations : 6319086
Num. Atoms : 89791956
Num. Elements : 88
OMat24_train_rattled_300_subsampled
Description : The rattled-300-subsampled training split of OMat24 (Open Materials 2024). OMat24 is a large-scale open dataset of density functional theory (DFT) cal...
Authors : Luis Barroso-Luque, Muhammed Shuaibi, Xiang Fu, Br...
Elements : Ac, Ag, Al, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce, Cl...
Num. Configurations : 3463993
Num. Atoms : 49674369
Num. Elements : 88
OMat24_train_rattled_500
Description : The rattled-500 training split of OMat24 (Open Materials 2024). OMat24 is a large-scale open dataset of density functional theory (DFT) calculations. ...
Authors : Luis Barroso-Luque, Muhammed Shuaibi, Xiang Fu, Br...
Elements : Ac, Ag, Al, Ar, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce...
Num. Configurations : 6922152
Num. Atoms : 98860290
Num. Elements : 88
OMat24_train_rattled_500_subsampled
Description : The rattled-500-subsampled training split of OMat24 (Open Materials 2024). OMat24 is a large-scale open dataset of density functional theory (DFT) cal...
Authors : Luis Barroso-Luque, Muhammed Shuaibi, Xiang Fu, Br...
Elements : Ac, Ag, Al, Ar, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce...
Num. Configurations : 3975399
Num. Atoms : 56846329
Num. Elements : 89
OMat24_train_rattled_relax
Description : The rattled-relax training split of OMat24 (Open Materials 2024). OMat24 is a large-scale open dataset of density functional theory (DFT) calculations...
Authors : Luis Barroso-Luque, Muhammed Shuaibi, Xiang Fu, Br...
Elements : Ac, Ag, Al, Ar, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce...
Num. Configurations : 9433298
Num. Atoms : 78952123
Num. Elements : 87
OMat24_validation_aimd-from-PBE-1000-npt
Description : The val_aimd-from-PBE-1000-npt validation split of OMat24 (Open Materials 2024). OMat24 is a large-scale open dataset of density functional theory (DF...
Authors : Luis Barroso-Luque, Muhammed Shuaibi, Xiang Fu, Br...
Elements : Ac, Ag, Al, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce, Cl...
Num. Configurations : 202758
Num. Atoms : 1710254
Num. Elements : 85
OMat24_validation_aimd-from-PBE-1000-nvt
Description : The val_aimd-from-PBE-1000-nvt validation split of OMat24 (Open Materials 2024). OMat24 is a large-scale open dataset of density functional theory (DF...
Authors : Luis Barroso-Luque, Muhammed Shuaibi, Xiang Fu, Br...
Elements : Ac, Ag, Al, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce, Cl...
Num. Configurations : 195575
Num. Atoms : 1643554
Num. Elements : 85
OMat24_validation_aimd-from-PBE-3000-npt
Description : The val_aimd-from-PBE-3000-npt validation split of OMat24 (Open Materials 2024). OMat24 is a large-scale open dataset of density functional theory (DF...
Authors : Luis Barroso-Luque, Muhammed Shuaibi, Xiang Fu, Br...
Elements : Ac, Ag, Al, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce, Cl...
Num. Configurations : 59516
Num. Atoms : 4036396
Num. Elements : 85
OMat24_validation_aimd-from-PBE-3000-nvt
Description : The val_aimd-from-PBE-3000-nvt validation split of OMat24 (Open Materials 2024). OMat24 is a large-scale open dataset of density functional theory (DF...
Authors : Luis Barroso-Luque, Muhammed Shuaibi, Xiang Fu, Br...
Elements : Ac, Ag, Al, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce, Cl...
Num. Configurations : 76478
Num. Atoms : 5186115
Num. Elements : 84
OMat24_validation_rattled_1000
Description : The rattled-1000 validation split of OMat24 (Open Materials 2024). OMat24 is a large-scale open dataset of density functional theory (DFT) calculation...
Authors : Luis Barroso-Luque, Muhammed Shuaibi, Xiang Fu, Br...
Elements : Ac, Ag, Al, Ar, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce...
Num. Configurations : 117004
Num. Atoms : 1657765
Num. Elements : 86
OMat24_validation_rattled_1000_subsampled
Description : The rattled-1000-subsampled validation split of OMat24 (Open Materials 2024). OMat24 is a large-scale open dataset of density functional theory (DFT) ...
Authors : Luis Barroso-Luque, Muhammed Shuaibi, Xiang Fu, Br...
Elements : Ac, Ag, Al, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce, Cl...
Num. Configurations : 38271
Num. Atoms : 549832
Num. Elements : 87
OMat24_validation_rattled_300
Description : The rattled-300 validation split of OMat24 (Open Materials 2024). OMat24 is a large-scale open dataset of density functional theory (DFT) calculations...
Authors : Luis Barroso-Luque, Muhammed Shuaibi, Xiang Fu, Br...
Elements : Ac, Ag, Al, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce, Cl...
Num. Configurations : 62451
Num. Atoms : 883431
Num. Elements : 84
OMat24_validation_rattled_300_subsampled
Description : The rattled-300-subsampled validation split of OMat24 (Open Materials 2024). OMat24 is a large-scale open dataset of density functional theory (DFT) c...
Authors : Luis Barroso-Luque, Muhammed Shuaibi, Xiang Fu, Br...
Elements : Ac, Ag, Al, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce, Cl...
Num. Configurations : 34244
Num. Atoms : 490880
Num. Elements : 85
OMat24_validation_rattled_500
Description : The rattled-500 validation split of OMat24 (Open Materials 2024). OMat24 is a large-scale open dataset of density functional theory (DFT) calculations...
Authors : Luis Barroso-Luque, Muhammed Shuaibi, Xiang Fu, Br...
Elements : Ac, Ag, Al, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce, Cl...
Num. Configurations : 68830
Num. Atoms : 985338
Num. Elements : 85
OMat24_validation_rattled_500_subsampled
Description : The rattled-500-subsampled validation split of OMat24 (Open Materials 2024). OMat24 is a large-scale open dataset of density functional theory (DFT) c...
Authors : Luis Barroso-Luque, Muhammed Shuaibi, Xiang Fu, Br...
Elements : Ac, Ag, Al, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce, Cl...
Num. Configurations : 39464
Num. Atoms : 564068
Num. Elements : 85
OMat24_validation_rattled_relax
Description : The rattled-relax validation split of OMat24 (Open Materials 2024). OMat24 is a large-scale open dataset of density functional theory (DFT) calculatio...
Authors : Luis Barroso-Luque, Muhammed Shuaibi, Xiang Fu, Br...
Elements : Ac, Ag, Al, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce, Cl...
Num. Configurations : 91043
Num. Atoms : 764266
Num. Elements : 84
OMol25_neutral_validation
Description : The neutral validation set from OMol25. From the dataset creator: OMol25 represents the largest high quality molecular DFT dataset spanning biomolecul...
Authors : Daniel S. Levine, Muhammed Shuaibi, Evan Walter Cl...
Elements : B, Br, C, Ca, Cl, F, H, I, K, Li, Mg, N, Na, O, P, S, Si
Num. Configurations : 27697
Num. Atoms : 1238644
Num. Elements : 17
OMol25_train_4M
Description : The Train 4M set from OMol25 (~4 million structure training subset). From the dataset creator: OMol25 represents the largest high quality molecular DF...
Authors : Daniel S. Levine, Muhammed Shuaibi, Evan Walter Cl...
Elements : Ag, Al, Ar, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce, Cl...
Num. Configurations : 3986754
Num. Atoms : 218680957
Num. Elements : 83
OMol25_train_neutral
Description : The Train neutral set from OMol25. From the dataset creator: OMol25 represents the largest high quality molecular DFT dataset spanning biomolecules, m...
Authors : Daniel S. Levine, Muhammed Shuaibi, Evan Walter Cl...
Elements : B, Br, C, Ca, Cl, F, H, I, K, Li, Mg, N, Na, O, P, S, Si
Num. Configurations : 34335828
Num. Atoms : 929562799
Num. Elements : 17
OMol25_validation
Description : The validation set from OMol25. From the dataset creator: OMol25 represents the largest high quality molecular DFT dataset spanning biomolecules, meta...
Authors : Daniel S. Levine, Muhammed Shuaibi, Evan Walter Cl...
Elements : Ag, Al, Ar, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce, Cl...
Num. Configurations : 2762021
Num. Atoms : 283298012
Num. Elements : 83
OrbNet_Denali
Description : All DFT single-point calculations for the OrbNet Denali training set were carried out in Entos Qcore version 0.8.17 at the ωB97X-D3/def2-TZVP level of...
Authors : Anders S. Christensen, Sai Krishna Sirumalla, Zhuo...
Elements : B, Br, C, Ca, Cl, F, H, I, K, Li, Mg, N, Na, O, P, S, Si
Num. Configurations : 2338215
Num. Atoms : 104958650
Num. Elements : 17
PWMLFF_feature_comparison_NPJ2023
Description : Partial dataset for "Accuracy evaluation of different machine learning force field features". The included data is limited to that hosted directly on ...
Authors : Ting Han, Jie Li, Liping Liu, Fengyu Li, Lin-Wang ...
Elements : C, H, Mg, Ni, O, Si
Num. Configurations : 17255
Num. Atoms : 918240
Num. Elements : 6
Paramagnetic_lanthanide_compounds
Description : This dataset is composed of fully-deuterated Gd(III) analogue d-[GdL] in a variety of solvent materials, including MeOH, D2O and d6-DMSO.
Authors : Barak Alnami, Jon G. C. Kragskow, Jakob K. Staab, ...
Elements : C, Gd, H, N, O, S
Num. Configurations : 41748
Num. Atoms : 28419876
Num. Elements : 6
PropMolFlow_QM9_CNOFH_2025
Description : This DFT dataset is curated in response to the growing interest in property-guided molecule genaration using generative AI models. Typically, the prop...
Authors : Cheng Zeng, Jirui Jin, George Karypis, Mark Transt...
Elements : C, F, H, N, O
Num. Configurations : 10773
Num. Atoms : 205304
Num. Elements : 5
PtNi_alloy_NPJ2022
Description : DFT dataset consisting of 6828 resampled Pt-Ni alloys used for training an NNP. The energy and forces of each structure in the resampled database are ...
Authors : Shuang Han, Giovanni Barcaro, Alessandro Fortunell...
Elements : Ni, Pt
Num. Configurations : 6828
Num. Atoms : 1074161
Num. Elements : 2
QM-22
Description : Includes CHON molecules of 4-15 atoms, developed in counterpoint to the MD17 dataset, run at higher total energies (above 500 K) and with a broader co...
Authors : Joel M. Bowman, Chen Qu, Riccardo Conte, Apurba Na...
Elements : C, H, O
Num. Configurations : 6762
Num. Atoms : 101430
Num. Elements : 3
QM7b_AlphaML
Description : Energy, computed with LR-CCSD, hybrid DFT (B3LYP & SCAN0) for 7211 molecules in QM7b and 52 molecules in AlphaML showcase database.
Authors : Yang Yang, Ka Un Lao, David M. Wilkins, Andrea Gri...
Elements : C, Cl, H, N, O, S
Num. Configurations : 29033
Num. Atoms : 408865
Num. Elements : 6
QM9x
Description : Dataset containing DFT calculations of energy and forces for all configurations in the QM9 dataset, recalculated with the ωB97X functional and 6-31G(d...
Authors : Mathias Schreiner, Arghya Bhowmik, Tejs Vegge, Jon...
Elements : C, F, H, N, O
Num. Configurations : 133885
Num. Atoms : 2407753
Num. Elements : 5
QM_hamiltonian_nature_2019
Description : ~100,000 configurations of water, ethanol, malondialdehyde and uracil gathered at the PBE/def2-SVP level of theory using ORCA.
Authors : Kristof T. Schütt, Michael Gastegger, Alexandre Tk...
Elements : C, H, N, O
Num. Configurations : 91977
Num. Atoms : 887799
Num. Elements : 4
REANN_CO2_Ni100
Description : Approximately 9,850 configurations of CO2 with a movable Ni(100) surface.
Authors : Yaolong Zhang, Junfan Xia, Bin Jiang
Elements : C, Ni, O
Num. Configurations : 9850
Num. Atoms : 384150
Num. Elements : 3
SAIT_semiconductors_ACS_2023_HfO_out-of-domain
Description : Out-of-domain configurations from the SAIT_semiconductors_ACS_2023_HfO dataset. This dataset contains HfO configurations from the SAIT semiconductors ...
Authors : Geonu Kim, Byunggook Na, Gunhee Kim, Hyuntae Cho, ...
Elements : Hf, O
Num. Configurations : 6996
Num. Atoms : 671616
Num. Elements : 2
SAIT_semiconductors_ACS_2023_HfO_raw
Description : Structures from the SAIT_semiconductors_ACS_2023_HfO dataset, separated into crystal, out-of-domain, and random (generated by randomly distributing 32...
Authors : Geonu Kim, Byunggook Na, Gunhee Kim, Hyuntae Cho, ...
Elements : Hf, O
Num. Configurations : 192000
Num. Atoms : 18431808
Num. Elements : 2
SAIT_semiconductors_ACS_2023_HfO_test
Description : Test configurations from the SAIT_semiconductors_ACS_2023_HfO dataset. This dataset contains HfO configurations from the SAIT semiconductors datasets....
Authors : Geonu Kim, Byunggook Na, Gunhee Kim, Hyuntae Cho, ...
Elements : Hf, O
Num. Configurations : 3510
Num. Atoms : 336960
Num. Elements : 2
SAIT_semiconductors_ACS_2023_HfO_train
Description : Training configurations from the SAIT_semiconductors_ACS_2023_HfO dataset. This dataset contains HfO configurations from the SAIT semiconductors datas...
Authors : Geonu Kim, Byunggook Na, Gunhee Kim, Hyuntae Cho, ...
Elements : Hf, O
Num. Configurations : 27960
Num. Atoms : 2684160
Num. Elements : 2
SAIT_semiconductors_ACS_2023_HfO_validation
Description : Validation configurations from the SAIT_semiconductors_ACS_2023_HfO dataset. This dataset contains HfO configurations from the SAIT semiconductors dat...
Authors : Geonu Kim, Byunggook Na, Gunhee Kim, Hyuntae Cho, ...
Elements : Hf, O
Num. Configurations : 3510
Num. Atoms : 336960
Num. Elements : 2
SAIT_semiconductors_ACS_2023_SiN_out-of-domain
Description : Out-of-domain configurations from the SAIT_semiconductors_ACS_2023_SiN dataset. This dataset contains SiN, Si and N configurations from the SAIT semic...
Authors : Geonu Kim, Byunggook Na, Gunhee Kim, Hyuntae Cho, ...
Elements : N, Si
Num. Configurations : 1235
Num. Atoms : 129675
Num. Elements : 2
SAIT_semiconductors_ACS_2023_SiN_raw
Description : Structures from the SAIT_semiconductors_ACS_2023_SiN dataset, separated into N-only, Si-only, SiN, and out-of-domain melt, quench and relax configurat...
Authors : Geonu Kim, Byunggook Na, Gunhee Kim, Hyuntae Cho, ...
Elements : N, Si
Num. Configurations : 88163
Num. Atoms : 5204822
Num. Elements : 2
SAIT_semiconductors_ACS_2023_SiN_test
Description : Test configurations from the SAIT_semiconductors_ACS_2023_SiN dataset. This dataset contains SiN, Si and N configurations from the SAIT semiconductors...
Authors : Geonu Kim, Byunggook Na, Gunhee Kim, Hyuntae Cho, ...
Elements : N, Si
Num. Configurations : 2866
Num. Atoms : 165559
Num. Elements : 2
SAIT_semiconductors_ACS_2023_SiN_train
Description : Training configurations from the SAIT_semiconductors_ACS_2023_SiN dataset. This dataset contains SiN, Si and N configurations from the SAIT semiconduc...
Authors : Geonu Kim, Byunggook Na, Gunhee Kim, Hyuntae Cho, ...
Elements : N, Si
Num. Configurations : 22510
Num. Atoms : 1284467
Num. Elements : 2
SAIT_semiconductors_ACS_2023_SiN_validation
Description : Validation configurations from the SAIT_semiconductors_ACS_2023_SiN dataset. This dataset contains SiN, Si and N configurations from the SAIT semicond...
Authors : Geonu Kim, Byunggook Na, Gunhee Kim, Hyuntae Cho, ...
Elements : N, Si
Num. Configurations : 2822
Num. Atoms : 159951
Num. Elements : 2
SIMPLE_NN_SiO2
Description : 10,000 configurations of SiO2 used as an example for the SIMPLE-NN machine learning model. Dataset includes three types of crystals: quartz, cristobal...
Authors : Kyuhyun Lee, Dongsun Yoo, Wonseok Jeong, Seungwu H...
Elements : O, Si
Num. Configurations : 10000
Num. Atoms : 600000
Num. Elements : 2
SN2_JCTC_2019
Description : The SN2 dataset was generated as a partner benchmark dataset, along with the 'solvated protein fragments' dataset, for measuring the performance of ma...
Authors : Oliver T. Unke, Markus Meuwly
Elements : Br, C, Cl, F, H, I
Num. Configurations : 394684
Num. Atoms : 2194246
Num. Elements : 6
SPICE_2023
Description : SPICE (Small-Molecule/Protein Interaction Chemical Energies) is a collection of quantum mechanical data for training potential functions. The emphasis...
Authors : Peter Eastman, Pavan Kumar Behara, David L. Dotson...
Elements : Br, C, Ca, Cl, F, H, I, K, Li, N, Na, O, P, S
Num. Configurations : 116504
Num. Atoms : 3382829
Num. Elements : 14
Si-H-GAP_reference
Description : A reference set of configurations of hydrogenated liquid and amorphous silicon from the datasets for Si-H-GAP. These configurations were used to evalu...
Authors : Davis Unruh, Reza Vatan Meidanshahi, Stephen M. Go...
Elements : H, Si
Num. Configurations : 114
Num. Atoms : 24895
Num. Elements : 2
Si-H-GAP_training
Description : A set of training configurations of hydrogenated liquid and amorphous silicon from the datasets for Si-H-GAP. Includes virial sigmas used for configur...
Authors : Davis Unruh, Reza Vatan Meidanshahi, Stephen M. Go...
Elements : H, Si
Num. Configurations : 392
Num. Atoms : 65909
Num. Elements : 2
Si-H-GAP_validation
Description : A set of validation configurations of hydrogenated liquid and amorphous silicon from the datasets for Si-H-GAP. These configurations served to augment...
Authors : Davis Unruh, Reza Vatan Meidanshahi, Stephen M. Go...
Elements : H, Si
Num. Configurations : 150
Num. Atoms : 23000
Num. Elements : 2
Si_Al_Ti_Seko_PRB_2019_test
Description : Training sets from Si_Al_Ti_Seko_PRB_2019. This dataset is compiled of 10,000 selected structures from the ICSD, divided into training and test sets. ...
Authors : Atsuto Seko, Atsushi Togo, Isao Tanaka
Elements : Al, Si, Ti
Num. Configurations : 3989
Num. Atoms : 197628
Num. Elements : 3
Si_Al_Ti_Seko_PRB_2019_train
Description : Test sets from Si_Al_Ti_Seko_PRB_2019. This dataset is compiled of 10,000 selected structures from the ICSD, divided into training and test sets. The ...
Authors : Atsuto Seko, Atsushi Togo, Isao Tanaka
Elements : Al, Si, Ti
Num. Configurations : 36155
Num. Atoms : 1774664
Num. Elements : 3
Si_JCP_2017
Description : A dataset of 64-atom silicon configurations in four phases: cubic-diamond, (beta)-tin, R8, and liquid. MD simulations are run at 300, 600 and 900 K fo...
Authors : Ekin D. Cubuk, Brad D. Malone, Berk Onat, Amos Wat...
Elements : Si
Num. Configurations : 1117
Num. Atoms : 71424
Num. Elements : 1
Si_PRX_GAP
Description : The original DFT training data for the general-purpose silicon interatomic potential described in the associated publication. The kinds of configurati...
Authors : Albert P. Bartók, James Kermode, Noam Bernstein, G...
Elements : Si
Num. Configurations : 2472
Num. Atoms : 171164
Num. Elements : 1
Silica_NPJCM_2022
Description : This dataset was created for the purpose of training an MLIP for silica (SiO2). For initial DFT computations, GPAW (in combination with ASE) was used ...
Authors : Linus C. Erhard, Jochen Rohrer, Karsten Albe, Volk...
Elements : O, Si
Num. Configurations : 3074
Num. Atoms : 268118
Num. Elements : 2
Sn-SCAN_PRM_2023
Description : Approximately 6,500 configurations of Sn, including Sn8, Sn16 and Sn32, used in developing a deep potential that predicts the phase diagram of Sn.
Authors : Tao Chen, Fengbo Yuan, Jianchuan Liu, Huayun Geng,...
Elements : Sn
Num. Configurations : 6721
Num. Atoms : 113584
Num. Elements : 1
TSFF_PLOS_2022
Description : One configuration of an enzyme: training data for a quantum-guided molecular mechanics model.
Authors : Taylor R. Quinn, Himani N. Patel, Kevin H. Koh, Br...
Elements : C, H, N, O, S
Num. Configurations : 1
Num. Atoms : 117
Num. Elements : 5
Ta_Linear_JCP2014
Description : This data set was originally used to generate a linear SNAP potential for solid and liquid tantalum as published in Thompson, A.P. et. al, J. Comp. Ph...
Authors : Aidan P. Thompson, Laura P. Swiler, Christian R. T...
Elements : Ta
Num. Configurations : 363
Num. Atoms : 4224
Num. Elements : 1
Ta_PINN_2021
Description : A dataset consisting of the energies of supercells containing from 1 to 250 atoms. The supercells represent energy-volume relations for 8 crystal stru...
Authors : Yi-Shen Lin, Ganga P. Purja Pun, Yuri Mishin
Elements : Ta
Num. Configurations : 3196
Num. Atoms : 136037
Num. Elements : 1
Ta_PRM2019
Description : This dataset was designed to enable machine-learning of Ta elastic, thermal, and defect properties, as well as surface energetics, melting, and the st...
Authors : Jesper Byggmästar, Kai Nordlund, Flyura Djurabekov...
Elements : Ta
Num. Configurations : 3775
Num. Atoms : 45439
Num. Elements : 1
TdS-PdV_Atari5200
Description : Approximately 45,000 configurations of metal oxides of Mg, Ag, Pt, Cu and Zn, with initial training structures taken from the Materials Project databa...
Authors : Pandu Wisesa, Christopher M. Andolina, Wissam A. S...
Elements : Ag, Cu, Mg, O, Pt, Zn
Num. Configurations : 44404
Num. Atoms : 1987604
Num. Elements : 6
TiMoS_alloys_CMS2021
Description : Training set (DFT output) for CE models and MC simulation output for the manuscript 'Phase behaviour of (Ti:Mo)S2binary alloys arising from electron-l...
Authors : Andrea Silva, Tomas Polcar, Denis Kramer
Elements : Mo, S, Ti
Num. Configurations : 259
Num. Atoms : 3996
Num. Elements : 3
TiO2_CMS2016
Description : TiO2 dataset that was designed to build atom neural network potentials (ANN) by Artrith et al. using the AENET package. This dataset includes various ...
Authors : Nongnuch Artrith, Alexander Urban
Elements : O, Ti
Num. Configurations : 7812
Num. Atoms : 165114
Num. Elements : 2
TiZrHfTa_APS2021
Description : A dataset used to train machine-learning interatomic potentials (moment tensor potentials) for multicomponent alloys to ab initio data in order to inv...
Authors : Konstantin Gubaev, Yuji Ikeda, Ferenc Tasnádi, Jör...
Elements : Hf, Ta, Ti, Zr
Num. Configurations : 3623
Num. Atoms : 223984
Num. Elements : 4
Ti_NPJCM_2021
Description : Approximately 7,400 configurations of titanium used for training a deep potential using the DeePMD-kit molecular dynamics package and DP-GEN training ...
Authors : Tongqi Wen, Rui Wang, Lingyu Zhu, Linfeng Zhang, H...
Elements : Ti
Num. Configurations : 7378
Num. Atoms : 143856
Num. Elements : 1
Transition1x-test
Description : The test split of the Transition1x dataset. Transition1x is a benchmark dataset containing 9.6 million Density Functional Theory (DFT) calculations of...
Authors : Mathias Schreiner, Arghya Bhowmik, Tejs Vegge, Jon...
Elements : C, H, N, O
Num. Configurations : 190277
Num. Atoms : 2106770
Num. Elements : 4
Transition1x-validation
Description : The validation split of the Transition1x dataset. Transition1x is a benchmark dataset containing 9.6 million Density Functional Theory (DFT) calculati...
Authors : Mathias Schreiner, Arghya Bhowmik, Tejs Vegge, Jon...
Elements : C, H, N, O
Num. Configurations : 264996
Num. Atoms : 3743476
Num. Elements : 4
Transition1x_train
Description : The training split of the Transition1x dataset. Transition1x is a benchmark dataset containing 9.6 million Density Functional Theory (DFT) calculation...
Authors : Mathias Schreiner, Arghya Bhowmik, Tejs Vegge, Jon...
Elements : C, H, N, O
Num. Configurations : 62990
Num. Atoms : 536010
Num. Elements : 4
UNEP_v1_2023_test
Description : The test set for UNEP-v1 (version 1 of Unified NeuroEvolution Potential), a model implemented in GPUMD.
Authors : Keke Song, Rui Zhao, Jiahui Liu, Yanzhou Wang, Eri...
Elements : Ag, Al, Au, Cr, Cu, Mg, Mo, Ni, Pb, Pd, Pt, Ta, Ti, V, W...
Num. Configurations : 4411
Num. Atoms : 318910
Num. Elements : 16
UNEP_v1_2023_train
Description : The training set for UNEP-v1 (version 1 of Unified NeuroEvolution Potential), a model implemented in GPUMD.
Authors : Keke Song, Rui Zhao, Jiahui Liu, Yanzhou Wang, Eri...
Elements : Ag, Al, Au, Cr, Cu, Mg, Mo, Ni, Pb, Pd, Pt, Ta, Ti, V, W...
Num. Configurations : 104799
Num. Atoms : 6840534
Num. Elements : 16
V_PRM2019
Description : This dataset was designed to enable machine-learning of V elastic, thermal, and defect properties, as well as surface energetics, melting, and the str...
Authors : Jesper Byggmästar, Kai Nordlund, Flyura Djurabekov...
Elements : V
Num. Configurations : 3802
Num. Atoms : 46466
Num. Elements : 1
W-14
Description : 158,000 diverse atomic environments of elemental tungsten.Includes DFT-PBE energies, forces and stresses for tungsten; periodic unit cells in the rang...
Authors : Wojciech J. Szlachta, Albert P. Bartók, Gábor Csán...
Elements : W
Num. Configurations : 9693
Num. Atoms : 158515
Num. Elements : 1
WBe_PRB2019
Description : This data set was originally used to generate a multi-component linear SNAP potential for tungsten and beryllium as published in Wood, M. A., et. al. ...
Authors : Mitchell A. Wood, Mary Alice Cusentino, Brian D. W...
Elements : Be, W
Num. Configurations : 25120
Num. Atoms : 525915
Num. Elements : 2
WS22_acrolein
Description : Configurations of acrolein from WS22. The WS22 database combines Wigner sampling with geometry interpolation to generate 1.18 million molecular geomet...
Authors : Max Pinheiro Jr, Shuang Zhang, Pavlo O. Dral, Mari...
Elements : C, H, O
Num. Configurations : 120000
Num. Atoms : 960000
Num. Elements : 3
WS22_alanine
Description : Configurations of alanine from WS22. The WS22 database combines Wigner sampling with geometry interpolation to generate 1.18 million molecular geometr...
Authors : Max Pinheiro Jr, Shuang Zhang, Pavlo O. Dral, Mari...
Elements : C, H, N, O
Num. Configurations : 120000
Num. Atoms : 1560000
Num. Elements : 4
WS22_dmabn
Description : Configurations of dmabn from WS22. The WS22 database combines Wigner sampling with geometry interpolation to generate 1.18 million molecular geometrie...
Authors : Max Pinheiro Jr, Shuang Zhang, Pavlo O. Dral, Mari...
Elements : C, H, N
Num. Configurations : 120000
Num. Atoms : 2520000
Num. Elements : 3
WS22_nitrophenol
Description : Configurations of nitrophenol from WS22. The WS22 database combines Wigner sampling with geometry interpolation to generate 1.18 million molecular geo...
Authors : Max Pinheiro Jr, Shuang Zhang, Pavlo O. Dral, Mari...
Elements : C, H, N, O
Num. Configurations : 120000
Num. Atoms : 1800000
Num. Elements : 4
WS22_o-hbdi
Description : Configurations of o-hbdi from WS22. The WS22 database combines Wigner sampling with geometry interpolation to generate 1.18 million molecular geometri...
Authors : Max Pinheiro Jr, Shuang Zhang, Pavlo O. Dral, Mari...
Elements : C, H, N, O
Num. Configurations : 120000
Num. Atoms : 2640000
Num. Elements : 4
WS22_sma
Description : Configurations of sma from WS22. The WS22 database combines Wigner sampling with geometry interpolation to generate 1.18 million molecular geometries ...
Authors : Max Pinheiro Jr, Shuang Zhang, Pavlo O. Dral, Mari...
Elements : C, H, N, O
Num. Configurations : 120040
Num. Atoms : 2280760
Num. Elements : 4
WS22_thymine
Description : Configurations of o-hbdi from WS22. The WS22 database combines Wigner sampling with geometry interpolation to generate 1.18 million molecular geometri...
Authors : Max Pinheiro Jr, Shuang Zhang, Pavlo O. Dral, Mari...
Elements : C, H, N, O
Num. Configurations : 120000
Num. Atoms : 1800000
Num. Elements : 4
WS22_toluene
Description : Configurations of toluene from WS22. The WS22 database combines Wigner sampling with geometry interpolation to generate 1.18 million molecular geometr...
Authors : Max Pinheiro Jr, Shuang Zhang, Pavlo O. Dral, Mari...
Elements : C, H
Num. Configurations : 100000
Num. Atoms : 1500000
Num. Elements : 2
WS22_urea
Description : Configurations of urea from WS22. The WS22 database combines Wigner sampling with geometry interpolation to generate 1.18 million molecular geometries...
Authors : Max Pinheiro Jr, Shuang Zhang, Pavlo O. Dral, Mari...
Elements : C, H, N, O
Num. Configurations : 120000
Num. Atoms : 960000
Num. Elements : 4
WS22_urocanic
Description : Configurations of urocanic from WS22. The WS22 database combines Wigner sampling with geometry interpolation to generate 1.18 million molecular geomet...
Authors : Max Pinheiro Jr, Shuang Zhang, Pavlo O. Dral, Mari...
Elements : C, H, N, O
Num. Configurations : 120000
Num. Atoms : 1920000
Num. Elements : 4
W_LML-retrain_bulk_MD_test
Description : Test set from W_LML-retrain dataset, containing bulk tungsten calculations. The W_LML-retrain dataset contains DFT calculations used in testing a line...
Authors : Berk Onat, Christoph Ortner, James R. Kermode
Elements : W
Num. Configurations : 8
Num. Atoms : 1996
Num. Elements : 1
W_PRB2019
Description : This dataset was originally designed to fit a GAP potential with a specific focus on properties relevant for simulations of radiation-induced collisio...
Authors : Jesper Byggmästar, Ali Hamedani, Kai Nordlund, Fly...
Elements : W
Num. Configurations : 3528
Num. Atoms : 42068
Num. Elements : 1
Yttrium-catalyzed_benzylic_C-H_alkylations_of_alkylpyridines_with_olefins
Description : This data was assembled to investigate rare-earth-catalyzed benzylic C(sp3)-H addition of pyridines to olefins. All calculations were performed with t...
Authors : Guangli Zhou, Gen Luo, Xiaohui Kang, Zhaomin Hou, ...
Elements : C, H, N, Y
Num. Configurations : 68
Num. Atoms : 4110
Num. Elements : 4
ZIF-4_Amorphous_Zeolitic_Imidazolate_Frameworks_2023
Description : This dataset contains four trajectories of amorphous zeolitic imidazolate frameworks (ZIF-4), liquids calculated at four different volumes and at tem...
Authors : Nicolas Castel, Dune Andre, Connor Edwards, Jack D...
Elements : C, H, N, Zn
Num. Configurations : 1189836
Num. Atoms : 323635392
Num. Elements : 4
Zeo-1_SD_2022
Description : 130,000 configurations of zeolite from the Database of Zeolite Structures. Calculations performed using Amsterdam Modeling Suite software.
Authors : Leonid Komissarov, Toon Verstraelen
Elements : Al, Ba, Be, C, Ca, Cs, F, Ge, H, K, Li, N, Na, O, Si
Num. Configurations : 12930
Num. Atoms : 1841742
Num. Elements : 15
Zn_MTP_CMS2023
Description : A training dataset of diverse atomic configurations of Zn, varying in aggregation states, crystal structures, defect types, and sizes. The aim was to ...
Authors : Haojie Mei, Luyao Cheng, Liang Chen, Feifei Wang, ...
Elements : Zn
Num. Configurations : 13552
Num. Atoms : 278996
Num. Elements : 1
Zr_Sn_JNM_2024
Description : This dataset contains data from density functional theory calculations of various atomic configurations of pure Zr, pure Sn, and Zr-Sn alloys with dif...
Authors : Haojie Mei, Liang Chen, Feifei Wang, Guisen Liu, J...
Elements : Sn, Zr
Num. Configurations : 23611
Num. Atoms : 688087
Num. Elements : 2
a-AlOx_JCP_2020
Description : This dataset was used for the training of an MLIP for amorphous alumina (a-AlOx). Two configurations sets correspond to i) the actual training data an...
Authors : Wenwen Li, Yasunobu Ando, Satoshi Watanabe
Elements : Al, O
Num. Configurations : 123586
Num. Atoms : 4541918
Num. Elements : 2
aC_JCP_2023
Description : The amorphous carbon dataset was generated using ab initio calculations with VASP software. We utilized the LDA exchange-correlation functional and th...
Authors : Emi Minamitani, Ippei Obayashi, Koji Shimizu, Sato...
Elements : C
Num. Configurations : 20195
Num. Atoms : 5192400
Num. Elements : 1
aC_JCP_2023_test
Description : Test split from the 216-atom amorphous portion of the aC_JCP_2023 dataset. The amorphous carbon dataset was generated using ab initio calculations wit...
Authors : Emi Minamitani, Ippei Obayashi, Koji Shimizu, Sato...
Elements : C
Num. Configurations : 3366
Num. Atoms : 727056
Num. Elements : 1
aC_JCP_2023_train
Description : Train split from the 216-atom amorphous portion of the aC_JCP_2023 dataset. The amorphous carbon dataset was generated using ab initio calculations wi...
Authors : Emi Minamitani, Ippei Obayashi, Koji Shimizu, Sato...
Elements : C
Num. Configurations : 13464
Num. Atoms : 2908224
Num. Elements : 1
adatoms_on_single-layer_graphene_PRR2021
Description : This dataset consists of graphene superlattices with tungsten adatoms with properties calculated at the DFT level of theory. The authors modeled the p...
Authors : Anastasiia Skurativska, Stepan S. Tsirkin, Fabian ...
Elements : C, Cr, Ir, Mo, Nb, Os, Re, Rh, Ru, Ta, W
Num. Configurations : 18
Num. Atoms : 774
Num. Elements : 11
aleatoric_epistemic_error_AIC2023
Description : Dataset for H2CO, with and without added noise for testing the effects of noise on quality of fit. Configurations sets are included for clean energy v...
Authors : Sugata Goswami, Silvan Käser, Raymond J. Bemish, M...
Elements : C, H, O
Num. Configurations : 28808
Num. Atoms : 115232
Num. Elements : 3
alkali-metal_intercalation_in_disordered_carbon_anode_materials_JMCA2019
Description : A dataset created as part of a combination DFT-ML approach to study three alkali metals (K, Li, Na) in model carbon systems at a range of densities an...
Authors : Jian-Xing Huang, Gábor Csányi, Jin-Bao Zhao, Jun C...
Elements : C, K, Li, Na
Num. Configurations : 1365
Num. Atoms : 298050
Num. Elements : 4
alpha_brass_nanoparticles
Description : 53,841 structures of alpha-brass (less than 40% Zinc). Includes atomic forces and total energy. Calculated using VASP at the DFT level of theory.
Authors : Jan Weinreich, Anton Römer, Martín Leandro Paleico...
Elements : Cu, Zn
Num. Configurations : 53696
Num. Atoms : 2956679
Num. Elements : 2
cG-SchNet
Description : Configurations from a cG-SchNet trained on a subset of the QM9dataset. Model was trained with the intention of providing molecules withspecified funct...
Authors : Niklas W.A. Gebauer, Michael Gastegger, Stefaan S....
Elements : C, F, H, N, O
Num. Configurations : 79772
Num. Atoms : 1467492
Num. Elements : 5
calcium_ferrites_as_cathodes_ca4fe9o17
Description : Dataset for "Appraisal of calcium ferrites as cathodes for calcium rechargeable batteries: DFT, synthesis, characterization and electrochemistry of Ca...
Authors : M. Elena Arroyo-de Dompablo, José Luis Casals
Elements : Ca, Fe, O
Num. Configurations : 345
Num. Atoms : 35462
Num. Elements : 3
cathode_materials_for_rechargeable_Ca_batteries_CM2021
Description : Data from the publication "Enlisting Potential Cathode Materials for Rechargeable Ca Batteries". The development of rechargeable batteries based on a ...
Authors : M. Elena Arroyo-de Dompablo, Jose Luis Casals
Elements : Ca, Co, Fe, Mn, N, Ni, O, P, S, Si, V
Num. Configurations : 10840
Num. Atoms : 1034770
Num. Elements : 11
datasets_for_magnetic_MTP_NatSR2024_training
Description : This dataset comprises a training dataset for magnetic multi-component machine-learning potentials for Fe-Al systems, including different concentratio...
Authors : Alexey S. Kotykhov, Konstantin Gubaev, Max Hodapp,...
Elements : Al, Fe
Num. Configurations : 2012
Num. Atoms : 11440
Num. Elements : 2
datasets_for_magnetic_MTP_NatSR2024_verification
Description : This is the verification dataset (see companion training dataset: datasets_for_magnetic_MTP_NatSR2024_training) used in training a magnetic multi-comp...
Authors : Alexey S. Kotykhov, Konstantin Gubaev, Max Hodapp,...
Elements : Al, Fe
Num. Configurations : 336
Num. Atoms : 3696
Num. Elements : 2
defected_phosphorene_ACS_2023
Description : This dataset contains pristine monolayer phosphorene as well as structures with monovacancies which were used to train an artificial neural network (A...
Authors : Lukáš Kývala, Andrea Angeletti, Cesare Franchini, ...
Elements : P
Num. Configurations : 5091
Num. Atoms : 722311
Num. Elements : 1
discrepencies_and_error_metrics_NPJ_2023_enhanced_validation_set
Description : Structures from discrepencies_and_error_metrics_NPJ_2023 validation set, enhanced by inclusion of rare events. The full discrepencies_and_error_metric...
Authors : Yunsheng Liu, Xingfeng He, Yifei Mo
Elements : Si
Num. Configurations : 50
Num. Atoms : 3198
Num. Elements : 1
discrepencies_and_error_metrics_NPJ_2023_interstitial_enhanced_training_set
Description : Structures from discrepencies_and_error_metrics_NPJ_2023 training set, enhanced by inclusion of interstitials. The full discrepencies_and_error_metric...
Authors : Yunsheng Liu, Xingfeng He, Yifei Mo
Elements : Si
Num. Configurations : 218
Num. Atoms : 13629
Num. Elements : 1
discrepencies_and_error_metrics_NPJ_2023_interstitial_re_testing_set
Description : Structures from discrepencies_and_error_metrics_NPJ_2023 test set; these include an interstitial. The full discrepencies_and_error_metrics_NPJ_2023 da...
Authors : Yunsheng Liu, Xingfeng He, Yifei Mo
Elements : Si
Num. Configurations : 100
Num. Atoms : 6500
Num. Elements : 1
discrepencies_and_error_metrics_NPJ_2023_vacancy_enhanced_training_set
Description : Structures from discrepencies_and_error_metrics_NPJ_2023 training set; includes some structures with vacancies. The full discrepencies_and_error_metri...
Authors : Yunsheng Liu, Xingfeng He, Yifei Mo
Elements : Si
Num. Configurations : 218
Num. Atoms : 13389
Num. Elements : 1
discrepencies_and_error_metrics_NPJ_2023_vacancy_re_testing_set
Description : Structures from discrepencies_and_error_metrics_NPJ_2023 test set; these include a single migrating vacancy. The full discrepencies_and_error_metrics_...
Authors : Yunsheng Liu, Xingfeng He, Yifei Mo
Elements : Si
Num. Configurations : 100
Num. Atoms : 6300
Num. Elements : 1
disordered_transition_metal_oxyfluorides_EA2021
Description : Data from "On-the-fly assessment of diffusion barriers of disordered transition metal oxyfluorides using local descriptors". The dataset contains the ...
Authors : Jin Hyun Chang, Peter Bjørn Jørgensen, Simon Lofta...
Elements : F, Li, O, V
Num. Configurations : 233
Num. Atoms : 20670
Num. Elements : 4
doped_CsPbI3_energetics_test
Description : The test set from the doped CsPbI3 energetics dataset. This dataset was created to explore the effect of Cd and Pb substitutions on the structural sta...
Authors : Roman A. Eremin, Innokentiy S. Humonen, Alexey A. ...
Elements : Cd, Cs, I, Pb, Zn
Num. Configurations : 60
Num. Atoms : 9600
Num. Elements : 5
doped_CsPbI3_energetics_train_validate
Description : The training + validation set from the doped CsPbI3 energetics dataset. This dataset was created to explore the effect of Cd and Pb substitutions on t...
Authors : Roman A. Eremin, Innokentiy S. Humonen, Alexey A. ...
Elements : Cd, Cs, I, Pb, Zn
Num. Configurations : 142
Num. Atoms : 22720
Num. Elements : 5
electrode_materials_for_ca-based_rechargeable_batteries
Description : Dataset for "Analysis of minerals as electrode materials for Ca-based rechargeable batteries". Includes DFT structures of pyroxenes, garnet and carbon...
Authors : M. Elena Arroyo-de Dompablo, Jose Luis Casals
Elements : C, Ca, Cr, Mn, O, Si
Num. Configurations : 4726
Num. Atoms : 550074
Num. Elements : 6
ferroelectricity_and_metallicity_in_BaTiO3_JMCC2021
Description : Dataset for "Interplay between ferroelectricity and metallicity in BaTiO3", exploring properties of ferroelectric barium titanate (BaTiO3), including ...
Authors : Veronica F. Michel, Tobias Esswein, Nicola A. Spal...
Elements : Al, Ba, K, La, Nb, O, Sc, Ti, V
Num. Configurations : 1127
Num. Atoms : 19125
Num. Elements : 9
flexible_molecules_JCP2021
Description : Configurations of azobenzene featuring a cis to trans thermal inversion through three channels: inversion, rotation, and rotation assisted by inversio...
Authors : Valentin Vassilev-Galindo, Gregory Fonseca, Igor P...
Elements : C, H, N, O
Num. Configurations : 69182
Num. Atoms : 1520340
Num. Elements : 4
glass-ceramic_lithium_thiophosphate_electrolytes_
Description : This database contains computationally generated atomic structures of glass-ceramics lithium thiophosphates (gc-LPS) with the general composition (Li2...
Authors : Haoyue Guo, Nongnuch Artrith
Elements : Li, P, S
Num. Configurations : 6055
Num. Atoms : 264604
Num. Elements : 3
linear_magnetic_coefficient_in_Cr2O3_JPCM2024
Description : We establish the sign of the linear magnetoelectric (ME) coefficient, α, in chromia, Cr₂O₃. Cr₂O₃ is the prototypical linear ME material, in which an ...
Authors : Eric Bousquet, Eddy Lelièvre-Berna, Navid Qureshi,...
Elements : Cr, O
Num. Configurations : 165
Num. Atoms : 1650
Num. Elements : 2
local_polarization_in_oxygen-deficient_LaMnO3_PRR2020
Description : This dataset contains structural calculations of LaMnO3 carried out in Quantum ESPRESSO at the DFT-PBEsol+U level of theory. The dataset was built to ...
Authors : Chiara Ricca, Nicolas Niederhauser, Ulrich Aschaue...
Elements : Ba, La, Mn, O, Ti
Num. Configurations : 4514
Num. Atoms : 174337
Num. Elements : 5
mbGDML_maldonado_2023
Description : Configurations of water, acetonitrile and methanol, simulated with ASE and modeled using a variety of software and methods: GAP, SchNet, GDML, ORCA an...
Authors : Alex M. Maldonado
Elements : C, H, N, O
Num. Configurations : 24543
Num. Atoms : 712134
Num. Elements : 4
mlearn_Cu_test
Description : A comprehensive DFT data set was generated for six elements - Li, Mo, Ni, Cu, Si, and Ge. These elements were chosen to span a variety of chemistries ...
Authors : Yunxing Zuo, Chi Chen, Xiangguo Li, Zhi Deng, Yimi...
Elements : Cu
Num. Configurations : 31
Num. Atoms : 3178
Num. Elements : 1
mlearn_Cu_train
Description : A comprehensive DFT data set was generated for six elements - Li, Mo, Ni, Cu, Si, and Ge. These elements were chosen to span a variety of chemistries ...
Authors : Yunxing Zuo, Chi Chen, Xiangguo Li, Zhi Deng, Yimi...
Elements : Cu
Num. Configurations : 262
Num. Atoms : 27416
Num. Elements : 1
mlearn_Ge_test
Description : A comprehensive DFT data set was generated for six elements - Li, Mo, Ni, Cu, Si, and Ge. These elements were chosen to span a variety of chemistries ...
Authors : Yunxing Zuo, Chi Chen, Xiangguo Li, Zhi Deng, Yimi...
Elements : Ge
Num. Configurations : 25
Num. Atoms : 1568
Num. Elements : 1
mlearn_Ge_train
Description : A comprehensive DFT data set was generated for six elements - Li, Mo, Ni, Cu, Si, and Ge. These elements were chosen to span a variety of chemistries ...
Authors : Yunxing Zuo, Chi Chen, Xiangguo Li, Zhi Deng, Yimi...
Elements : Ge
Num. Configurations : 228
Num. Atoms : 14072
Num. Elements : 1
mlearn_Li_test
Description : A comprehensive DFT data set was generated for six elements - Li, Mo, Ni, Cu, Si, and Ge. These elements were chosen to span a variety of chemistries ...
Authors : Yunxing Zuo, Chi Chen, Xiangguo Li, Zhi Deng, Yimi...
Elements : Li
Num. Configurations : 29
Num. Atoms : 1320
Num. Elements : 1
mlearn_Li_train
Description : A comprehensive DFT data set was generated for six elements - Li, Mo, Ni, Cu, Si, and Ge. These elements were chosen to span a variety of chemistries ...
Authors : Yunxing Zuo, Chi Chen, Xiangguo Li, Zhi Deng, Yimi...
Elements : Li
Num. Configurations : 241
Num. Atoms : 11576
Num. Elements : 1
mlearn_Mo_test
Description : A comprehensive DFT data set was generated for six elements - Li, Mo, Ni, Cu, Si, and Ge. These elements were chosen to span a variety of chemistries ...
Authors : Yunxing Zuo, Chi Chen, Xiangguo Li, Zhi Deng, Yimi...
Elements : Mo
Num. Configurations : 23
Num. Atoms : 1189
Num. Elements : 1
mlearn_Mo_train
Description : A comprehensive DFT data set was generated for six elements - Li, Mo, Ni, Cu, Si, and Ge. These elements were chosen to span a variety of chemistries ...
Authors : Yunxing Zuo, Chi Chen, Xiangguo Li, Zhi Deng, Yimi...
Elements : Mo
Num. Configurations : 194
Num. Atoms : 10087
Num. Elements : 1
mlearn_Ni_test
Description : A comprehensive DFT data set was generated for six elements - Li, Mo, Ni, Cu, Si, and Ge. These elements were chosen to span a variety of chemistries ...
Authors : Yunxing Zuo, Chi Chen, Xiangguo Li, Zhi Deng, Yimi...
Elements : Ni
Num. Configurations : 31
Num. Atoms : 3158
Num. Elements : 1
mlearn_Ni_train
Description : A comprehensive DFT data set was generated for six elements - Li, Mo, Ni, Cu, Si, and Ge. These elements were chosen to span a variety of chemistries ...
Authors : Yunxing Zuo, Chi Chen, Xiangguo Li, Zhi Deng, Yimi...
Elements : Ni
Num. Configurations : 263
Num. Atoms : 27420
Num. Elements : 1
mlearn_Si_test
Description : A comprehensive DFT data set was generated for six elements - Li, Mo, Ni, Cu, Si, and Ge. These elements were chosen to span a variety of chemistries ...
Authors : Yunxing Zuo, Chi Chen, Xiangguo Li, Zhi Deng, Yimi...
Elements : Si
Num. Configurations : 25
Num. Atoms : 1525
Num. Elements : 1
mlearn_Si_train
Description : A comprehensive DFT data set was generated for six elements - Li, Mo, Ni, Cu, Si, and Ge. These elements were chosen to span a variety of chemistries ...
Authors : Yunxing Zuo, Chi Chen, Xiangguo Li, Zhi Deng, Yimi...
Elements : Si
Num. Configurations : 214
Num. Atoms : 13233
Num. Elements : 1
oxygen-vacancy_defects_in_Cu2O(111)
Description : This dataset investigates the effect of defects, such as copper and oxygen vacancies, in cuprous oxide films. Structures include oxygen vacancies form...
Authors : Nanchen Dongfang, Marcella Iannuzzi, Yasmine Al-Ha...
Elements : Cu, O
Num. Configurations : 864
Num. Atoms : 606270
Num. Elements : 2
pure_magnesium_DFT_PRM2020
Description : This dataset provides DFT (as implemented in VASP) calculations for pure magnesium. Configuration sets include bulk, generalized stacking fault energi...
Authors : Binglun Yin, Markus Stricker, W. A. Curtin
Elements : Mg
Num. Configurations : 405
Num. Atoms : 10730
Num. Elements : 1
q-AQUA
Description : The a-AQUA dataset was generated to address the need for a training set for a water PES that includes 2-body, 3-body and 4-body interactions calculate...
Authors : Qi Yu, Chen Qu, Paul L. Houston, Riccardo Conte, A...
Elements : H, O
Num. Configurations : 120372
Num. Atoms : 878157
Num. Elements : 2
rMD17
Description : A dataset of 10 molecules (aspirin, azobenzene, benzene, ethanol, malonaldehyde, naphthalene, paracetamol, salicylic, toluene, uracil) with 100,000 st...
Authors : Anders S. Christensen, O. Anatole von Lilienfeld
Elements : C, H, N, O
Num. Configurations : 999988
Num. Atoms : 15599712
Num. Elements : 4
reactive_hydrogen_ACS_2023
Description : This dataset contains structures of Cu, including Cu(111), Cu(100), Cu(110), and Cu(211). Slab settings are as follows: 3 x 3, 6-layered slabs for Cu(...
Authors : Wojciech G. Stark, Julia Westermayr, Oscar A. Doug...
Elements : Cu, H
Num. Configurations : 3413
Num. Atoms : 191104
Num. Elements : 2
reduced-perovskite_and_oxidized-marokite_oxides
Description : Dataset contains DFT calculations of oxygen-deficient perovskites from the Ca2Fe2O5-brownmillerite and Ca2Mn2O5 structures; and tunnel CaMn4O8, a deri...
Authors : M. Elena Arroyo-de Dompablo, José Luis Casals
Elements : Ca, Fe, Mn, O
Num. Configurations : 2919
Num. Atoms : 387258
Num. Elements : 4
sAlex_validation
Description : The validation split of sAlex. sAlex is a subsample of the Alexandria dataset that was used to fine tune the OMat24 (Open Materials 2024) models. From...
Authors : Luis Barroso-Luque, Muhammed Shuaibi, Xiang Fu, Br...
Elements : Ac, Ag, Al, As, Au, B, Ba, Be, Bi, Br, C, Ca, Cd, Ce, Cl...
Num. Configurations : 547885
Num. Atoms : 5670890
Num. Elements : 86
sGDML_Aspirin_ccsd_NC2018_test
Description : The test set of a train/test pair from the aspirin dataset from sGDML. To create the coupled cluster datasets, the data used for training the models w...
Authors : Stefan Chmiela, Huziel E. Sauceda, Klaus-Robert Mü...
Elements : C, H, O
Source Data : http://sgdml.org/
Num. Configurations : 500
Num. Atoms : 10500
Num. Elements : 3
sGDML_Aspirin_ccsd_NC2018_train
Description : The train set of a train/test pair from the aspirin dataset from sGDML. To create the coupled cluster datasets, the data used for training the models ...
Authors : Stefan Chmiela, Huziel E. Sauceda, Klaus-Robert Mü...
Elements : C, H, O
Source Data : http://sgdml.org/
Num. Configurations : 1000
Num. Atoms : 21000
Num. Elements : 3
sGDML_Benzene_DFT_NC2018
Description : The data used for training the DFT models were created running ab initio MD in the NVT ensemble using the Nosé-Hoover thermostat at 500 K during a 200...
Authors : Stefan Chmiela, Huziel E. Sauceda, Klaus-Robert Mü...
Elements : C, H
Source Data : http://sgdml.org/
Num. Configurations : 49863
Num. Atoms : 598356
Num. Elements : 2
sGDML_Benzene_ccsdt_NC2018_test
Description : The test set of a train/test pair from the benzene dataset from sGDML. To create the coupled cluster datasets, the data used for training the models w...
Authors : Stefan Chmiela, Huziel E. Sauceda, Klaus-Robert Mü...
Elements : C, H
Source Data : http://sgdml.org/
Num. Configurations : 500
Num. Atoms : 6000
Num. Elements : 2
sGDML_Benzene_ccsdt_NC2018_train
Description : The train set of a train/test pair from the benzene dataset from sGDML. To create the coupled cluster datasets, the data used for training the models ...
Authors : Stefan Chmiela, Huziel E. Sauceda, Klaus-Robert Mü...
Elements : C, H
Source Data : http://sgdml.org/
Num. Configurations : 1000
Num. Atoms : 12000
Num. Elements : 2
sGDML_Ethanol_ccsdt_NC2018_test
Description : The test set of a train/test pair from the ethanol dataset from sGDML. To create the coupled cluster datasets, the data used for training the models w...
Authors : Stefan Chmiela, Huziel E. Sauceda, Klaus-Robert Mü...
Elements : C, H, O
Source Data : http://sgdml.org/
Num. Configurations : 1000
Num. Atoms : 9000
Num. Elements : 3
sGDML_Ethanol_ccsdt_NC2018_train
Description : The train set of a train/test pair from the ethanol dataset from sGDML. To create the coupled cluster datasets, the data used for training the models ...
Authors : Stefan Chmiela, Huziel E. Sauceda, Klaus-Robert Mü...
Elements : C, H, O
Source Data : http://sgdml.org/
Num. Configurations : 998
Num. Atoms : 8982
Num. Elements : 3
sGDML_Malonaldehyde_ccsdt_NC2018_test
Description : The test set of a train/test pair from the malonaldehyde dataset from sGDML. To create the coupled cluster datasets, the data used for training the mo...
Authors : Stefan Chmiela, Huziel E. Sauceda, Klaus-Robert Mü...
Elements : C, H, O
Source Data : http://sgdml.org/
Num. Configurations : 500
Num. Atoms : 4500
Num. Elements : 3
sGDML_Malonaldehyde_ccsdt_NC2018_train
Description : The train set of a train/test pair from the malonaldehyde dataset from sGDML. To create the coupled cluster datasets, the data used for training the m...
Authors : Stefan Chmiela, Huziel E. Sauceda, Klaus-Robert Mü...
Elements : C, H, O
Source Data : http://sgdml.org/
Num. Configurations : 1000
Num. Atoms : 9000
Num. Elements : 3
sGDML_Toluene_ccsdt_NC2018_test
Description : The test set of a train/test pair from the toluene dataset from sGDML. To create the coupled cluster datasets, the data used for training the models w...
Authors : Stefan Chmiela, Huziel E. Sauceda, Klaus-Robert Mü...
Elements : C, H
Source Data : http://sgdml.org/
Num. Configurations : 501
Num. Atoms : 7515
Num. Elements : 2
sGDML_Toluene_ccsdt_NC2018_train
Description : The train set of a train/test pair from the toluene dataset from sGDML. To create the coupled cluster datasets, the data used for training the models ...
Authors : Stefan Chmiela, Huziel E. Sauceda, Klaus-Robert Mü...
Elements : C, H
Source Data : http://sgdml.org/
Num. Configurations : 1000
Num. Atoms : 15000
Num. Elements : 2
solute_strengthening_of_prism_edge_locations_in_Mg_alloys
Description : This dataset includes Mg and Mg-Zn alloy structures with solute atoms at the prism edge locations. The dataset was created to study the strengthening ...
Authors : Masoud Rahbar Niazi, W. A Curtin
Elements : Mg, Zn
Num. Configurations : 94
Num. Atoms : 28615
Num. Elements : 2
solvated_protein_fragments_JCTC_2019
Description : The solvated protein fragments dataset was generated as a partner benchmark dataset, along with SN2, for measuring the performance of machine learning...
Authors : Oliver T. Unke, Markus Meuwly
Elements : C, H, N, O, S
Num. Configurations : 2731180
Num. Atoms : 58395272
Num. Elements : 5
stable_and_metastable_phases_in_sputtered_CuInS2
Description : The chalcopyrite Cu(In,Ga)S2 has gained renewed interest in recent years due to its potential application in tandem solar cells. In this contribution,...
Authors : Jes Larsen, Kostiantyn Sopiha, Clas Persson, Charl...
Elements : Cu, In, Na, S
Num. Configurations : 3105
Num. Atoms : 117948
Num. Elements : 4
tmQM_wB97MV
Description : tmQM_wB97MV contains configurations from the tmQM dataset, with several structures from tmQM that were found to be missing hydrogens filtered out, and...
Authors : Aaron G. Garrison, Javier Heras-Domingo, John R. K...
Elements : Ag, As, Au, B, Br, C, Cd, Cl, Co, Cr, Cu, F, Fe, H, Hf, ...
Num. Configurations : 86507
Num. Atoms : 5710877
Num. Elements : 44
vanadium_in_high_entropy_alloys_AM2020
Description : Dataset created for "Vanadium is an optimal element for strengthening in both fcc and bcc high-entropy alloys", to explore the effect of V in the high...
Authors : Binglun Yin, Francesco Maresca, W. A. Curtin
Elements : Ni, V
Num. Configurations : 261
Num. Atoms : 22196
Num. Elements : 2
water_and_Cu+_synergy_in_selective_CO2_hydrogenation_to_methanol_over_Cu/MgO_catalysts
Description : This dataset was created to investigate the role of surface water and hydroxyl groups in facilitating spontaneous CO₂ activation at Cu⁺ sites and the ...
Authors : Estefanía Fernández Villanueva, Pablo Germán Luste...
Elements : C, Cu, H, Mg, O
Num. Configurations : 14962
Num. Atoms : 1043709
Num. Elements : 5
water_ice_JCP_2020
Description : Starting from a single reference ab initio simulation, we use active learning to expand into new state points and to describe the quantum nature of th...
Authors : Christoph Schran, Kyrstof Brezina, Ondrej Marsalek
Elements : H, O
Num. Configurations : 8000
Num. Atoms : 2088000
Num. Elements : 2
water_ice_NEP_2023
Description : The main part of the dataset consists of structures of liquid water at 300 K from first-principles molecular dynamics (FPMD) simulations using a hybri...
Authors : Zekun Chen, Margaret L. Berrens, Kam-Tung Chan, Zh...
Elements : H, O
Num. Configurations : 814
Num. Atoms : 216144
Num. Elements : 2
water_ice_PNAS_2021
Description : Dataset generated using a committee-based active learning strategy to build a training dataset for modeling complex aqueous systems.
Authors : Christoph Schran, Fabian L. Thiemann, Patrick Rowe...
Elements : B, C, F, H, Mo, N, O, S, Ti
Num. Configurations : 1786
Num. Atoms : 681912
Num. Elements : 9
xxMD-CASSCF_test
Description : Test dataset from xxMD-CASSCF. The xxMD (Extended Excited-state Molecular Dynamics) dataset is a comprehensive collection of non-adiabatic trajectorie...
Authors : Zihan Pengmei, Yinan Shu, Junyu Liu
Elements : C, H, N, O, S
Num. Configurations : 65100
Num. Atoms : 707552
Num. Elements : 5
xxMD-CASSCF_train
Description : Training dataset from xxMD-CASSCF. The xxMD (Extended Excited-state Molecular Dynamics) dataset is a comprehensive collection of non-adiabatic traject...
Authors : Zihan Pengmei, Yinan Shu, Junyu Liu
Elements : C, H, N, O, S
Num. Configurations : 130179
Num. Atoms : 1411656
Num. Elements : 5
xxMD-CASSCF_validation
Description : Validation dataset from xxMD-CASSCF. The xxMD (Extended Excited-state Molecular Dynamics) dataset is a comprehensive collection of non-adiabatic traje...
Authors : Zihan Pengmei, Yinan Shu, Junyu Liu
Elements : C, H, N, O, S
Num. Configurations : 64848
Num. Atoms : 701516
Num. Elements : 5
xxMD-DFT_test
Description : Test dataset from xxMD-DFT. The xxMD (Extended Excited-state Molecular Dynamics) dataset is a comprehensive collection of non-adiabatic trajectories e...
Authors : Zihan Pengmei, Yinan Shu, Junyu Liu
Elements : C, H, N, O, S
Num. Configurations : 21661
Num. Atoms : 402856
Num. Elements : 5
xxMD-DFT_train
Description : Training dataset from xxMD-DFT. The xxMD (Extended Excited-state Molecular Dynamics) dataset is a comprehensive collection of non-adiabatic trajectori...
Authors : Zihan Pengmei, Yinan Shu, Junyu Liu
Elements : C, H, N, O, S
Num. Configurations : 43395
Num. Atoms : 807416
Num. Elements : 5
xxMD-DFT_validation
Description : Validation dataset from xxMD-DFT. The xxMD (Extended Excited-state Molecular Dynamics) dataset is a comprehensive collection of non-adiabatic trajecto...
Authors : Zihan Pengmei, Yinan Shu, Junyu Liu
Elements : C, H, N, O, S
Num. Configurations : 21606
Num. Atoms : 402151
Num. Elements : 5