Data Mining Crystallization Kinetics

Cameron Brown; Diego Maldonado; Antony Vassileiou; Blair Johnston; Alastair Florence

doi:10.26434/chemrxiv.11708286.v3

Chemical Engineering and Industrial Chemistry

Search within Chemical Engineering and Industrial Chemistry

Data Mining Crystallization Kinetics

11 August 2020, Version 3

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Population balance model is a valuable modelling tool which facilitates the optimization and understanding of crystallization processes. However, in order to use this tool, it is necessary to have previous knowledge of the crystallization kinetics, specifically crystal growth and nucleation. The majority of approaches to achieve proper estimations of kinetic parameters required experimental data. Across time, a vast literature about the estimation of kinetic parameters and population balances have been published. Considering the availability of data, this work built a database with information on solute, solvent, kinetic expression, parameters, crystallization method and seeding. Correlations were assessed and clusters structures identified by hierarchical clustering analysis. The final database contains 336 data of kinetic parameters from 185 different sources. The data were analysed using kinetic parameters of the most common expressions. Subsequently, clusters were identified for each kinetic model. With these clusters, classification random forest models were made using solute descriptors, seeding, solvent, and crystallization methods as classifiers. Random forest models had an overall classification accuracy higher than 70% whereby they were useful to provide rough estimates of kinetic parameters, although these methods have some limitations.

Keywords

crystallization kinetics

population balance model

data mining

machine learning

random forest

Supplementary materials

Title

Description

Actions

Title

moe descriptors

Description

Actions

Title

dataset raw v2

Description

Actions

Title

dataset preprocessed v2

Description

Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

Data mining crystallization kinetics

Diego A. Maldonado, Antony Vassileiou, Blair Johnston, Alastair J. Florence, Cameron J. Brown journal article

Digital Discovery , Volume 1, Issue 5

Online publication date: 2022

Version History

Aug 11, 2020 Version 3

Jan 31, 2020 Version 2

Jan 29, 2020 Version 1

Version Notes

Updates to datasets: id 326,327, 328, 329, 330 - order of magnitude of kg corrected id 220 - order of magnitude of kg corrected id 268 - reference updated to that of original experimental work. Value of kg updated assuming published units of micron/min.

Metrics

5,371

1,173

Views

Downloads

Citations

License

The content is available under CC BY NC 4.0

DOI

10.26434/chemrxiv.11708286.v3

Funding

EP/P006965/1

Author’s competing interest statement

No conflict of interest

Data Mining Crystallization Kinetics

Authors

Abstract

Keywords

Supplementary materials

Comments

Now Published

Version History

Version Notes

Metrics

License

DOI

Funding

Author’s competing interest statement

Share