Yoked Learning in Molecular Data Science

16 August 2023, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Active machine learning is an established and increasingly popular experimental design technique where the machine learning model can request additional data to improve the model’s predictive performance. It is generally assumed that this data is optimal for the machine learning model since it relies on the model’s predictions or model architecture and therefore cannot be transferred to other models. Inspired by research in pedagogy, we here introduce the concept of yoked machine learning where a second machine learning model learns from the data selected by another model. We found that in 48% of the benchmarked combinations, yoked learning performed similar or better than active learning. We analyze distinct cases in which yoked learning can improve active learning performance. In particular, we prototype Yoked Deep Learning (YoDeL) where a classic machine learning model provides data to a deep neural network, thereby mitigating challenges of active deep learning such as slow refitting time per learning iteration and poor performance on small datasets. In summary, we expect the new concept of yoked (deep) learning to provide a competitive option to boost the performance of active learning and benefit from distinct capabilities of multiple machine learning models during data acquisition, training, and deployment.

Keywords

machine learning
active learning
molecular data science

Supplementary materials

Title
Description
Actions
Title
Supporting Information
Description
Supplemental Figures and Tables
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.