Dataset

ANI-1xBB




Species content of dataset


Name :
ANI-1xBB
Authors :
Shuhao Zhang, Roman Zubatyuk, Yinuo Yang, Adrian Roitberg, Olexandr Isayev
Description :
ANI-1xBB is a dataset of approximately 13.1 million nonequilibrium conformers of small organic molecules (H, C, N, O only; up to 7 heavy atoms; up to 23 atoms total), designed to support the training of reactive machine learning interatomic potentials. Single-point quantum chemistry properties were computed at three electronic temperatures (T_el = 0, 1000, and 5000 K) using B97-3c composite DFT in ORCA 4.2.1 via finite-temperature DFT (Fermi smearing). All geometries were treated as closed-shell (charge = 0, mult = 1); Fermi smearing at T_el = 5000 K approximates the superposition of closed- and open-shell states during bond dissociation and is the primary labeling scheme used for model training in the associated publication. This dataset contains the T_el = 5000 K (b973c_etemp5000) energies and forces; data at T_el = 0 K and 1000 K are available in the original source files. Configuration sets represent: constrained geometry optimization steps (snap_source='opt', ~9% of data) and fixed-distance NVT MD snapshots (snap_source='md', ~91% of data).
Cite As :
Zhang, S., Zubatyuk, R., Yang, Y., Roitberg, A., and Isayev, O. "ANI-1xBB." ColabFit, 2025. https://doi.org/None.
ColabFit ID :
Date Added :
2026-05-24
License :
CC-BY-NC-ND-4.0
Downloads :
0
Num. Configurations :
13,144,877
Num. Atoms :
184,872,744
Calculated Property Types :
atomic_forces energy
Elements :
C (28.89%) H (53.71%) N (10.09%) O (7.31%)
Methods :
DFT-B97-3c
Software :
ORCA 4.2.1
Spec File :
Configuration Sets by Name :
Configuration Sets by ID :
Dataset viewer powered by Hugging Face

No uploaded content is transferred in ownership from the original creators to ColabFit. All content is distributed under the license specified by its contributor who has stated that he or she has the authority to share it under the specified license.