Materials Chemistry

Interpretable Machine Learning of Two-Photon Absorption



Molecules with strong two-photon absorption (TPA) are important in many advanced applications such as upconverted laser and photodynamic therapy, but their design is hampered by the high cost of experimental screening and accurate quantum chemical (QC) calculations. Here we perform a systematic study by collecting and analyzing with interpretable machine learning (ML) experimental TPA database with ca. 900 molecules. We uncovered that only very few molecular features are sufficient to explain the TPA magnitudes. The most important feature is conjugation length (rather than area as believed before) followed by features reflecting effects of donor and acceptor substitution and coplanarity. These features are used to create a very fast ML model with prediction errors of similar magnitude compared to experimental and affordable QC meth-ods errors. Our ML model has the potential for high-throughput screening as additionally validated with our new experimental measurements.

Version notes

We modified the title, abstract, and key words of the paper to more accurately represent its content and highlights the major finding.


Thumbnail image of ML-TPA-220821-CHemRxiv.pdf

Supplementary material

Thumbnail image of ML-TPA si220820.pdf
Supplementary information
Supplementary figures; details of the models; details of the descriptors
Thumbnail image of
dataset for this study, including the descriptors and code