Multi-Modal Contrastive Learning for Chemical Structure Elucidation with VibraCLIP

23 April 2025, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Multi-modal contrastive learning offers a powerful framework for molecular structure elucidation from vibrational spectra. VibraCLIP applies this approach by embedding infrared (IR), Raman spectra, and molecular graphs into a unified latent space, enabling accurate, on-the-fly, and scalable molecular identification. Aligning IR and Raman modalities boosts Top-1 retrieval accuracy from 12.4% to 62.9%, while incorporating standardized molecular mass elevates Top-25 accuracy to 98.9%, underscoring the value of chemically grounded anchoring features. VibraCLIP’s three-way alignment strategy captures the complementary nature of vibrational modes, offering substantial improvements over single-modality baselines. A lightweight fine-tuning protocol, updating only the final projection layer, enables robust generalization from theoretical to experimental datasets. This flexible, data-efficient framework transforms vibrational spectroscopy into a high-precision tool for molecular discovery. VibraCLIP establishes a new standard for AI-driven spectral interpretation, bridging molecular structure and spectroscopy with broad impact in fields as separated as drug discovery or astrochemical observations.

Keywords

Multi-Modal Learning
Smart Characterization
Vibrational Spectroscopy

Supplementary materials

Title
Description
Actions
Title
Supporting Information
Description
The supporting information includes a detailed table of hyperparameter optimization, model implementation details and visualization of elucidated structures across different retrieval accuracies and model’s strategies for the experimental dataset. Providing additional insights into the performance and interpretability of the proposed approach.
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.