DHDIP : An Interpretable Model for Hypertension and Hyperlipidemia Prediction Based on EMR Data
Background and Objective: Traditional hypertension and hyperlipidemia prediction models suffer from uneven modeling data sources, short sample sizes, and a lack of uniform standards for the index system, resulting in the model failing to fulfill clinical applications. To address this issue, this work will offer DHDIP, an interpretable hypertension and hyperlipidemia prediction model based on EMR data.Methods: First, we will select massive high-dimensional, unstructured EMR data as a unified modeling data source, and propose a pre-processing algorithm for EMR data to solve the problem that EMR data cannot be directly processed by machine learning algorithms. Second, a variety of mainstream models such as XGBoost, CatBoost, and RandomForest are selected for modeling, and the best adaptation algorithms are identified by performance comparison. Finally, the SHAP framework was introduced into the DHDIP model, thus identifying the main factors contributing to hypertension and hyperlipidemia, effectively enhancing the interpretability of the model.Results: The DHDIP model's MSE value is 0.0285, and its LOSS value is 0.0054, both of which are better than previous studies.Conclusion: The model balances performance and interpretability. Multi-objective learning allows for a more thorough analysis and prediction of the condition, which not only lowers the cost of disease prediction but also aids physicians in clinical diagnosis
Year of publication: |
[2022]
|
---|---|
Authors: | Liao, Bin ; Jia, Xiaoyao ; Zhang, Tao ; Sun, Ruina |
Publisher: |
[S.l.] : SSRN |
Saved in:
freely available
Saved in favorites
Similar items by person
-
ZHOU, KUI, (2008)
-
Wang, Ting, (2022)
-
Xie, Wenling, (2022)
- More ...