Author: Xu, Zhao; Luo, Youzhi; Zhang, Xuan; Xu, Xinyi; Xie, Yaochen; Liu, Meng; Dickerson, Kaleb; Deng, Cheng; Nakata, Maho; Ji, Shuiwang
Title: Molecule3D: A Benchmark for Predicting 3D Geometries from Molecular Graphs Cord-id: 75k11bn3 Document date: 2021_9_30
ID: 75k11bn3
Snippet: Graph neural networks are emerging as promising methods for modeling molecular graphs, in which nodes and edges correspond to atoms and chemical bonds, respectively. Recent studies show that when 3D molecular geometries, such as bond lengths and angles, are available, molecular property prediction tasks can be made more accurate. However, computing of 3D molecular geometries requires quantum calculations that are computationally prohibitive. For example, accurate calculation of 3D geometries of
Document: Graph neural networks are emerging as promising methods for modeling molecular graphs, in which nodes and edges correspond to atoms and chemical bonds, respectively. Recent studies show that when 3D molecular geometries, such as bond lengths and angles, are available, molecular property prediction tasks can be made more accurate. However, computing of 3D molecular geometries requires quantum calculations that are computationally prohibitive. For example, accurate calculation of 3D geometries of a small molecule requires hours of computing time using density functional theory (DFT). Here, we propose to predict the ground-state 3D geometries from molecular graphs using machine learning methods. To make this feasible, we develop a benchmark, known as Molecule3D, that includes a dataset with precise ground-state geometries of approximately 4 million molecules derived from DFT. We also provide a set of software tools for data processing, splitting, training, and evaluation, etc. Specifically, we propose to assess the error and validity of predicted geometries using four metrics. We implement two baseline methods that either predict the pairwise distance between atoms or atom coordinates in 3D space. Experimental results show that, compared with generating 3D geometries with RDKit, our method can achieve comparable prediction accuracy but with much smaller computational costs. Our Molecule3D is available as a module of the MoleculeX software library (https://github.com/divelab/MoleculeX).
Search related documents:
Co phrase search for related documents- loss function and lung infection: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13
- loss function and machine learning: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23
- loss function and machine learning model: 1, 2, 3
- low dimensional and machine learning: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12
- low dimensional and machine learning model: 1
- low dimensional representation and machine learning: 1, 2
- low energy and lung infection: 1
- low energy and machine learning: 1, 2, 3, 4, 5, 6, 7, 8, 9
- low energy and machine learning model: 1
- lung infection and machine learn: 1
- lung infection and machine learning: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23
- lung infection and machine learning model: 1, 2
- lung infection and mae absolute error: 1
- machine learning and mae absolute error: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24
- machine learning model and mae absolute error: 1, 2, 3, 4
Co phrase search for related documents, hyperlinks ordered by date