Theoretical and Computational Chemistry

Beam Search Sampling for Molecular Design and Intrinsic Prioritization with Machine Intelligence

Abstract

Chemical language models enable de novo drug design without the requirement for explicit molecular construction rules. While such models have been applied to generate novel compounds with desired bioactivity, the actual prioritization and selection of the most promising computational designs remains challenging. In this work, we leveraged the probabilities learnt by chemical language models with the beam search algorithm as a model-intrinsic technique for automated molecule design and scoring. Prospective application of this method yielded three novel inverse agonists of retinoic acid receptor-related orphan receptors (RORs). Each design was synthesizable in three reaction steps and presented low-micromolar to nanomolar potency towards RORg. This model-intrinsic sampling technique eliminates the strict need for external compound scoring functions, thereby further extending the applicability of generative artificial intelligence to data-driven drug discovery.

Content

Thumbnail image of Beam_Search_for_Molecular_Design.pdf

Supplementary material

Thumbnail image of Supplementary_Information.pdf
Supplementary Information