Dataset
solvated_protein_fragments_JCTC_2019
Download Dataset XYZ files Download Dataset Parquet files
Name | solvated_protein_fragments_JCTC_2019 |
---|---|
Extended ID | solvated_protein_fragments_JCTC_2019__Unke-Meuwly__DS_ctjgc03xdauc_0 |
Description | The solvated protein fragments dataset was generated as a partner benchmark dataset, along with SN2, for measuring the performance of machine learning models, in particular PhysNet, at describing chemical reactions, long-range interactions, and condensed phase systems. The dataset contains structures for all possible "amons" (hydrogen-saturated covalently bonded fragments) of up to eight heavy atoms (C, N, O, S) that can be derived from chemical graphs of proteins containing the 20 natural amino acids connected via peptide bonds or disulfide bridges. For amino acids that can occur in different charge states due to (de)protonation (i.e., carboxylic acids that can be negatively charged or amines that can be positively charged), all possible structures with up to a total charge of +-2e are included. In total, the dataset provides reference energies, forces, and dipole moments for 2,731,180 structures calculated at the revPBE-D3(BJ)/def2-TZVP level of theory using ORCA 4.0.1. |
Authors |
Oliver T. Unke Markus Meuwly |
DOI |
10.60732/c4731f07
https://commons.datacite.org/doi.org/10.60732/c4731f07 https://doi.datacite.org/dois/10.60732%2Fc4731f07 https://doi.org/10.60732/c4731f07 Cite as: Unke, O. T., and Meuwly, M. "solvated protein fragments JCTC 2019." ColabFit, 2023. https://doi.org/10.60732/c4731f07. For other citation formats, see the DataCite Fabrica page for this dataset. |
Calculated Property Types |
atomic_forces cauchy_stress energy |
Elements |
C (19.27%) H (63.04%) N (4.88%) O (11.69%) S (1.13%) |
Number of Configurations | 2,731,180 |
Number of Atoms | 58,395,272 |
Links |
https://doi.org/10.5281/zenodo.2605372 https://doi.org/10.1021/acs.jctc.9b00181 |
Configuration Sets by Name | (None) |
Configuration Sets by ID | (None) |
Calculated Properties | |
ColabFit ID | DS_ctjgc03xdauc_0 |
Files | colabfitspec.json |
No uploaded content is transferred in ownership from the original creators to ColabFit. All content is distributed under the license specified by its contributor who has stated that he or she has the authority to share it under the specified license.