Intelligent Molecular Identification for High Performance Organosulfide Capture Using Active Machine Learning Algorithm

Yuxiang  Chen; Chuanlei Liu; Yang An; Yue  Lou; Yang Zhao; Cheng Qian; Hao Jiang; Kongguo Wu; Xianghui Zhang; Hui Sun; Di Wu; Benxian Shen; Fahai Cao

doi:10.26434/chemrxiv-2021-hczsl

Chemical Engineering and Industrial Chemistry

Search within Chemical Engineering and Industrial Chemistry

Intelligent Molecular Identification for High Performance Organosulfide Capture Using Active Machine Learning Algorithm

26 November 2021, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Machine learning and computer-aided approaches significantly accelerate molecular design and discovery in scientific and industrial fields increasingly relying on data science for efficiency. The typical method used is supervised learning which needs huge datasets. Semi-supervised machine learning approaches are effective to train unlabeled data with improved modeling performance, whereas they are limited by the accumulation of prediction errors. Here, to screen solvents for removal of methyl mercaptan, a type of organosulfur impurities in natural gas, we constructed a computational framework by integrating molecular similarity search and active learning methods, namely, molecular active selection machine learning (MASML). This new model framework identifies the optimal molecules set by molecular similarity search and iterative addition to the training dataset. Among all 126,068 compounds in the initial dataset, 3 molecules were identified to be promising for methyl mercaptan (MeSH) capture, including benzylamine (BZA), p-methoxybenzylamine (PZM), and N,N-diethyltrimethylenediamine (DEAPA). Further experiments confirmed the effectiveness of our modeling framework in efficient molecular design and identification for capturing methyl mercaptan, in which DEAPA presents a Henry's law constant 89.4% lower than that of methyl diethanolamine (MDEA).

Supplementary materials

Title

Description

Actions

Title

Supporting Information - Intelligent Molecular Identification for High Performance Organosulfide Capture Using Active Machine Learning Algorithm

Description

1. Methods 2. Tables 3. Figures 4. References

Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

Intelligent Molecular Identification Approach to High-Efficiency Solvents for Organosulfide Capture Using the Active Machine Learning Framework

Yuxiang Chen, Chuanlei Liu, Yang An, Yue Lou, Yang Zhao, Cheng Qian, Hao Jiang, Kongguo Wu, Benxian Shen, Xianghui Zhang, Fahai Cao, Di Wu, Hui Sun journal article

Energy & Fuels , Volume 37, Issue 16

Online publication date: Aug 02, 2023

Version History

Nov 26, 2021 Version 1

Metrics

1,111

287

Views

Downloads

License

The content is available under CC BY NC ND 4.0

DOI

10.26434/chemrxiv-2021-hczsl

Funding

National Natural Science Foundation of China

2187809

National Natural Science Foundation of China

22178109

Natural Science Foundation of Shanghai

21ZR1417700

Washington State University

institutional funds from the Gene and Linda Voiland School of Chemical Engineering and Bioengineering

Washington State University

institutional funds from the Alexandra Navrotsky Institute for Experimental Thermodynamics

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) declare that they have sought and gained approval from the relevant ethics committee/IRB for this research and its publication.

Intelligent Molecular Identification for High Performance Organosulfide Capture Using Active Machine Learning Algorithm

Authors

Abstract

Supplementary materials

Comments

Now Published

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share