You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This issue proposes some ideas for ML tutorial content using the data on QCArchive. These examples should focus on use cases for quantum chemical data in ML, including e.g. supervised learning of relationships between:
structure and QC properties
QC properties and function
and e.g. unsupervised learning of:
molecule clusters on the basis of QC properties
interest in different classes of molecules or theory methods based on distributions found in the QCArchive data
These examples should demonstrate the key advantages of QCArchive as a distribution method for ML data versus the current model of SI and Figshare: uniform data formats, interoperability/composability, trusted provenance, and discovery of new datasets.
Some specific ideas for examples:
Train a model on QM7b to predict DFT energy from molecular geometries using Coulomb, SLATM, and SOAP features with a kernel method. Test the model on QM9 or GDB-13. Train a model with a combination of datasets (e.g. QM7b + QM9).
Fit a water model using THG's water cluster dataset to the TIP-4P functional form, perhaps using bayesian regression.
Placeholder for something using a generative model.
The text was updated successfully, but these errors were encountered:
This issue proposes some ideas for ML tutorial content using the data on QCArchive. These examples should focus on use cases for quantum chemical data in ML, including e.g. supervised learning of relationships between:
and e.g. unsupervised learning of:
These examples should demonstrate the key advantages of QCArchive as a distribution method for ML data versus the current model of SI and Figshare: uniform data formats, interoperability/composability, trusted provenance, and discovery of new datasets.
Some specific ideas for examples:
The text was updated successfully, but these errors were encountered: