Abstract
Developing amorphous polymers with desirable thermal conductivity has significant implications, as they are ubiquitous in applications where thermal transport is critical. Conventional Edisonian approaches are slow and without guarantee of success in material development. In this work, using a reinforcement learning scheme, we design polymers with thermal conductivity above 0.4 W/m- K. We leverage a machine learning model trained against 469 thermal conductivity data calculated from high-throughput molecular dynamics (MD) simulations as the surrogate for thermal conductivity prediction, and we use a recurrent neural network trained with around one million virtual polymer structures as a polymer generator. For all newly generated polymers with thermal conductivity > 0.400 W/m-K, we have evaluated their synthesizability by calculating the synthesis accessibility score and validated the thermal conductivity of selected polymers using MD simulations. The best thermally conductive polymer designed has a MD-calculated thermal conductivity of 0.693 W/m-K, which is also estimated to be easily synthesizable. Our demonstrated inverse design scheme based on reinforcement learning may advance polymer development with target properties, and the scheme can also be generalized to other materials development tasks for different applications.