Abstract
As Machine-Learned CVs (MLCVs) are becoming increasingly relevant in the molecular simulation literature, we discuss the necessary conditions to enable reproducibility in the calculation and representation of free energy surfaces (FES). We note that the variability of the training process, as well as the roughness of the hyperparameter space, impose inherent limits on the reproducibility of results even when the mathematical structure of the model defining a CV is consistent. To this end, we propose the adoption of a Geometric (gauge invariant) Free Energy representation to obtain consistent free energy differences across training instances and architectures. Further, we introduce a normalisation factor to model gradients for biased enhanced sampling. This factor, effectively unifies Free Energy definitions and addresses practical issues preventing the widespread use and deployment of MLCVs.