Abstract
Accurately determining the global minima of a molecular structure is important in diverse scientific fields, including drug design, materials science, and chemical synthesis. Conformational search engines serve as valuable tools for exploring the extensive conformational space of molecules and identifying energetically favorable conformations. In this study, we present a comprehensive comparison of Auto3D, CREST, Balloon, and RDKit, which are freely available conformational search engines, to evaluate their effectiveness in locating the global minima. These engines employ distinct methodologies, including machine learning (ML) potential-based, semiempirical, and force field (FF) based approaches. Through rigorous analyses and employing novel approaches for validation, including the utilization of a unique physical property known as collisional cross section (CCS), which characterizes the molecular shape, size, and charge, we thoroughly assess the capabilities of these engines in generating conformation ensembles that effectively capture the global minima. To accomplish this, we created the gas-phase conformation library (GPCL) which currently consists of the full ensembles of 20 small molecules, which can be used by the community to validate any conformational search engine. Further members of the GPCL can be readily created for any molecule of interest using our standard workflow used to compute CCS values expanding the ability of the GPCL in validation exercises. These innovative validation techniques enhance our understanding of the conformational landscape and provide valuable insights into the performance of conformation generation engines. Our findings shed light on the strengths and limitations of each search engine, enabling informed decisions for their utilization in various scientific fields, where accurate molecular structure determination is crucial for understanding biological activity and designing targeted interventions. By facilitating the identification of reliable conformations, this study significantly contributes to enhancing the efficiency and accuracy of molecular structure determination, with a particular focus on metabolite structure elucidation. The findings of this research also provide valuable insights for developing effective workflows in predicting the structures of unknown compounds with high precision.
Supplementary materials
Title
Molecular Gas Phase Conformational Ensembles
Description
The supplementary information encompasses the specifics of Root Mean Square Deviation (RMSD) computation and the relative energy values associated with the various conformations of the 20 metabolites analyzed in this investigation.
Actions