About Data to Knowledge
Request To Join
Machine Learning in materials modelling has revolutionised and transformed the field in recent years. Based on availability of good data from computations and/or experiment one can use machine learning to train expert systems using large language models (à la chatGPT), train surrogate models to predict properties or structure avoiding the use of simulations or simply speed up simulations by using machine learnt interatomic potentials (MLIPs). However, these advances require development of new infrastructure to capture, store and distribute ML models. Work in this pathfinder is dedicated to making possible the creation of these models by providing data infrastructure, workflows for creation and exploitation of machine learnt models.
This work begins with exploring the field of machine learning interatomic potentials which is a key technique in atomistic modelling. The development focuses around three key areas:
- Production of aiida-mlip plugin for AiiDA – enhances provenance tracking, reproducibility and sharing of processes and data
- Janus-core tools for integration of MLIPs
- abcd – specialised database for MLIP training data, faster searching through implementation of opensearch