Lipophilicity is a physicochemical property with wide relevance in drug design and is applied in areas such as food chemistry, environmental chemistry, and computational biology. This descriptor strongly influences the absorption, distribution, permeability, bioaccumulation, protein-binding, and biological activity of bioorganic compounds. Lipophilicity is commonly expressed as the n-octanol/water partition coefficient (PN) for neutral molecules, whereas for molecules with ionizable groups, the distribution coefficient (D) at a given pH is used. The logDpH is usually predicted using a pH correction over the logPN using the pKa of ionizable molecules, while often ignoring the apparent ionic partition (PIapp) because of the challenge of predicting the partitioning of the charged species and/or related species (e.g., ion-pairs, counterions, molecular aggregates). In this work, we studied the impact of "P" _"I" ^"app" on the prediction of both the experimental lipophilicity of small molecules and experimental lipophilicity-based applications and metrics such as lipophilic efficiency (LipE), distribution of spiked drugs in milk products, and pH-dependent partition of water contaminants in synthetic passive samples such as silicones. Our findings show that better predictions are obtained by considering the apparent ionic partition, whereas ignoring its contribution can lead to inadequate experimental simplifications and/or computational predictions. In this context, we developed machine learning algorithms to determine the cases that "PIapp" should be considered. The results indicate that small, rigid, and unsaturated molecules with logPN close to zero, which present a significant proportion of ionic species in the aqueous phase, were better modeled using the apparent ionic partition (PIapp). In addition, we validated our findings using a test and two external sets, which included small molecules and amino acid analogs, where the logistic regressions, random forest classifications, and support vector machine models predicted better formalism to determine the logDpH for each molecule with high accuracies, sensitivities, and specificities. Finally, our findings can serve as guidance to the scientific community working in early-stage drug design, food, and environmental chemistry who deal with ionizable molecules, to determine a priori which pH-dependent lipophilicity profile should be used in their research and applications depending on the structure of a substance.
Tables and Figures