A consistent set of thermophysical properties of methane curated with machine learning

Matheus Maximo-Canadas; Rubens Caio Souza; Julio Cesar Duarte; Jakler Nichele; Leonardo Santos de Brito Alves; Luis Octavio Vieira Pereira; Ligia Gaigher Franco; Itamar Borges Jr

doi:10.26434/chemrxiv-2024-gthhs-v2

Chemical Engineering and Industrial Chemistry

Search within Chemical Engineering and Industrial Chemistry

A consistent set of thermophysical properties of methane curated with machine learning

10 April 2025, Version 2

This is not the most recent version. There is a

newer version

of this content available

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Accurately predicting thermophysical properties across different physical states is essential for industrial and scientific applications. However, experimental data measurements often exhibit variability and noise, requiring robust modeling approaches. In this work, we employ machine learning (ML) techniques to predict methane’s thermophysical properties in liquid, vapor, and supercritical phases, including isobaric and isochoric heat capacities, density, volume, Joule-Thomson coefficients, enthalpies, sound speed, and viscosities applying an approach recently developed (ACS Eng. Au, DOI: 10.1021/acsengineeringau.5c00001). We explored different ML algorithms and approaches, including Adaptive Boosting, Bagging, Decision Trees, Extra Trees, Gradient Boosting, Histogram-based Gradient Boosting Regression Tree, K-Nearest Neighbors, Light Gradient Boosting Machine, Nu-Support Vector Regression, Random Forest, Extreme Gradient Boosting, and Artificial Neural Networks. ML models produced predictions that aligned more closely with the statistically treated National Institute of Standards and Technology (NIST) data than with the raw experimental data used to train these models. These results highlight ML’s potential to identify and generalize complex patterns, smooth inherent noise, and manage the variability of different thermophysical properties. They indicate that ML models, particularly Extra Trees and Gradient Boosting, can offer a scalable alternative for thermophysical property predictions, offering consistency and efficiency over traditional methods. Although our approach does not eliminate preprocessing, it demonstrates that ML can effectively manage noisy data independently, offering a more efficient and cost-effective alternative to conventional pre- and post-processing techniques.

Keywords

Methane

thermophysical properties

machine learning

data noise reduction

Supplementary materials

Title

Description

Actions

Title

Supporting materials for methane

Description

The Supporting Information is organized into four main sections. Section S1 (“Number of Data and Machine Learning Models”) provides a summary of the amount of experimental data sourced from the literature used by the National Institute of Standards and Technology (NIST) to produce their equations. It also presents the ML models applied to each thermophysical property. Section S2 (“Experimental Data Visualization”) investigates the diversity of the input data and identifies potential anomalies, which are also considered during the model training process. Section S3 (“Best metrics”) presents the best performance metrics for each thermophysical property, presented to each physical state. The last section, Section S4 (“Others relevant data”) provides guidance on how to access additional data generated in this study, namely, the detailed performance metrics for all ML models applied to each property and the final hyperparameter configurations selected via GridSearchCV for each ML model and thermodynamic phase.

Actions

Supplementary weblinks

Title

Description

Actions

Title

Zenodo

Description

data and the computer codes

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Jun 09, 2025 Version 3

Apr 10, 2025 Version 2

Nov 15, 2024 Version 1

Version Notes

Minor corrections

Metrics

699

205

Views

Downloads

Citations

License

The content is available under CC BY NC ND 4.0

DOI

10.26434/chemrxiv-2024-gthhs-v2

Funding

Conselho Nacional de Desenvolvimento Científico e Tecnológico

304148/2018–0 and 409447/2018–8

Fundação Carlos Chagas Filho de Amparo à Pesquisa do Estado do Rio de Janeiro

26/201.197/2021, E-26/211.046/2021, E-26/201.251/2022, and E-26/201.190/2021

Petrobras

code 2021/00093-5

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

A consistent set of thermophysical properties of methane curated with machine learning

Authors

Abstract

Keywords

Supplementary materials

Supplementary weblinks

Comments

Version History

Version Notes

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share