Machine Learning-Assisted Parameter Dataset Establishment and Query System Design of the “POSTParamQuery” Web Application within the Preferential Occupancy Site Theory (POST) Framework: Practical Applications in Doped Luminescent Materials

10 April 2025, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Identifying the crystallographic positions of dopant ions in doped inorganic materials is a longstanding challenge that has impeded precise control over material properties. To address this, we introduce a robust theoretical framework along with detailed methodologies to evaluate dopant ion occupancy within a host lattice, taking into account the associated energetic factors. The preferential occupancy site theory (POST) provides a robust framework for predicting lattice sites of the dopant in doped inorganic materials. To facilitate the application of POST in inorganic material design, a comprehensive approach combining parameter dataset and the creation of a user-friendly web application query system named “POSTParamQuery” is presented. First, a standardized dataset of critical parameters, including atomization energy coefficients Jij for various bond types along with empirical bond valence constants (dij0 and bij0) necessary for accurate POST evaluations, was established. Comparing five distinct machine learning (ML) models, identifying XGBoost and Linear Regression model as the one with the best predictive ability, achieving an accuracy (R2) of 0.9229 and 0.9615 in predicting Jij and dij0, respectively. XGBoost and Linear Regression were utilized to predict and establish 4,698 groups of bond parameters for different types of bonds, generating a total of 14,094 standardized parameter data used by POST. Upon this foundation, an accessible web application parameter query system was established, enabling efficient retrieval and analysis of parameter data, thereby streamlining the optimization of material properties. To validate the framework, POST was applied to a series of technologically significant luminescent materials, including Y2O3:Eu3+, Zn3(PO4)2:Mn2+, Sr4Al14O25:Eu2+, Sr2MgSi2O7:Eu2+,Dy3+, Sr2Al6O11:Eu2+, ZnAl2O4:Fe3+, CaGa4O7:Mn2+, and Ca5(PO4)3Cl:Sb3+, Mn2+. Through computational analyses, specific lattice sites occupied by dopant ions such as Eu3+, Mn2+, Eu2+, Dy3+, Fe3+, Mn2+, and Sb3+ were accurately predicted. Notably, The calculated results demonstrate a high degree of consistency with previously reported experimental findings, suggesting that further experiments will likely reinforce this agreement. This outcome highlights the reliability and predictive power of the POST method. This work not only establishes a practical toolset for optimizing existing luminescent materials but also provides a theoretical foundation for the rational design of new functional materials. By integrating standardized parameters dataset, advanced computational methods, and an accessible web application query system, our study advances the field of materials science by bridging fundamental theory with practical applications.

Supplementary materials

Title
Description
Actions
Title
Supporting material
Description
Supporting Information for ML-Assisted Parameter Dataset Establishment and Query System Design of the “POSTParamQuery” Web Application within the Preferential Occupancy Site Theory (POST) Framework: Practical Applications in Doped Luminescent Materials
Actions
Title
Excel_1_True_Jij
Description
the standard true Jij
Actions
Title
Excel_2_POST_Jij_True_INPUT_processed_data
Description
standard true Jij data
Actions
Title
Excel_3_Jij_predict_1_input_processed_data
Description
the standard input features of compounds with XGBoost prediction 1 model
Actions
Title
Excel_4_Jij_predicted_1_result
Description
Predicted result of Jij using the XGBoost prediciton 1 method
Actions
Title
Excel_5_Jij_predicted_2_input_processed_data
Description
Input data of compounds for predicting Jij using the XGBoost Prddiction 2 method
Actions
Title
Excel_6_Jij_predicted_2_result
Description
Predicted result of Jij using the XGBoost prediction 2 method
Actions
Title
Excel_7_POST_d0_true
Description
the original data of the valence bond parameter d0
Actions
Title
Excel_8_POST_d0_true_clear
Description
the original data of d0 after delete some bonds without the features data
Actions
Title
Excel_9_d0_predicted_1_input_processed_data
Description
Input data of prediciton d0 using the LR prediction method
Actions
Title
Excel_10_d0_predicted_1_result
Description
Prediction result of the valence bond parameter d using the LR prediciton 1 method
Actions
Title
Excel_11_d0_predicted_2_input_processed_data
Description
Input data of the valence bond parameter d using the LR prediciton 2 method
Actions
Title
Excel_12_d0_predicted_2_result
Description
Prediciton result of the valence bond parameter d using the LR prediciton 2 method
Actions
Title
Excel_13_b
Description
the database of the valence bond parameter b0
Actions
Title
Excel_14_POST_b0_d0_Jij
Description
the final dataset of the POST parameters
Actions
Title
POSTParamQuery
Description
the “POSTParamQuery” Web Application for query POST parameters
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.