Design of CO2-philic Molecular Units with Large Language Models

08 April 2025, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

The integration of large language models (LLMs) into chemical sciences presents a transformative approach for molecular design. In this study, we explore the capabilities of LLMs for generating novel molecular structures with enhanced CO₂ affinity for the development of novel physisorptionbased carbon capture technologies. A hybrid workflow was employed, combining pretrained LLMs (GPT-4o, Llama-3, and Gemini 2.0) with computational chemistry techniques. LLMs were prompted to propose molecular candidates, which were subsequently evaluated using density functional theory (DFT) to assess their interaction energies with CO₂. Through iterative model refinement and expert-guided modifications, a dataset of 95 molecular units was generated, revealing several promising candidates with CO₂ interaction energies exceeding the -7 kcal/mol threshold for optimal physisorption. Notably, LLM-generated structures showcased emergent design strategies, such as cooperative binding motifs, that aligned with domain knowledge and experimental precedent. This study highlights the potential of LLMs as powerful tools for molecular discovery while underscoring the need for expert oversight and rigorous validation in AI-driven chemical research.

Keywords

LLMs
CO2 capture
noncovalent interactions
DFT

Supplementary materials

Title
Description
Actions
Title
Supporting Information for Design of CO2-philic Molecular Units with Large Language Models
Description
Supporting Information for Design of CO2-philic Molecular Units with Large Language Models
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.