Abstract
Molecular generative models based on deep learning have increasingly gained attention for their ability in de novo polymer design. However, there remains a knowledge gap in the thorough evaluation of these models. This benchmark study explores de novo polymer design using six popular deep generative models: Variational Autoencoder (VAE), Adversarial Autoencoder (AAE), Objective-Reinforced Generative Adversarial Networks (ORGAN), Character-level Recurrent Neural Network (CharRNN), REINVENT, and GraphINVENT. Various metrics highlighted the excellent performance of CharRNN, REINVENT, and GraphINVENT, particularly when applied to the real polymer dataset, while VAE and AAE show more advantages in generating hypothetical polymers. The CharRNN, REINVENT, and GraphINVENT models were further trained on real polymers utilizing reinforcement learning methods, targeting the generation of hypothetical polymers with high glass transition temperatures. The findings of this study provide critical insights into the capabilities and limitations of each generative model, offering valuable guidance for future endeavors in polymer design and discovery.