Machine learning has been emerging as a promising tool in the chemical and materials domain. In this paper, we introduce a framework to automatically perform rational model selection and hyperparameter optimization that are important concerns for the efficient and successful use of machine learning, but have so far largely remained unexplored by this community. The framework features four variations of genetic algorithm and is implemented in the chemml program package. Its performance is benchmarked against popularly used algorithms and packages in the data science community and the results show that our implementation outperforms these methods both in terms of time and accuracy. The effectiveness of our implementation is further demonstrated via a scenario involving multi-objective optimization for model selection.
download asset GA_hpo.pdf 0.64 MB [opens in a new tab] cloud_download
pdf : 0.64 MB
download asset GA_hpo-SI.pdf 0.08 MB [opens in a new tab] cloud_download
pdf : 0.08 MB