Organic light-emitting diodes (OLEDs) have gained widespread commercial use, yet there is a continuous need to identify innovative emitters that offer higher efficiency and broader color gamut. To effectively screen out promising OLED molecules that are yet to be synthesized, we perform a representation learning aided high throughput virtual screening (HTVS) over millions of Ir(III) complexes, a prototypical type of phosphorescent OLED material, constructed via a random combination of 278 reported ligands. We successfully screen out a decent amount of promising candidates for both display and lighting purposes, which are worth further experimental investigation. The high efficiency and accuracy of our model are largely attributed to the pioneering attempt of using representation learning to organic luminescent molecules, which is initiated by a pre-training procedure with over 1.6 million 3D molecular structures and frontier orbital energies predicted via semi-empirical methods, followed by a fine-tune scheme via the quantum mechanical computed properties over around 1500 candidates. Such workflow enables an effective model construction process that is otherwise hindered by the scarcity of labeled data, and can be straightforwardly extended to the discovery of other novel materials.
Supporting Information for “Automatic Screen-out of Ir(III) Complex emitters by Combined Machine Learning and Computational Analysis”