BMC Public Health. 2025 May 23;25(1):1904. doi: 10.1186/s12889-025-23075-7.
ABSTRACT
BACKGROUND: The incidence of cardiovascular metabolic diseases (CMD) continues to rise among middle-aged and elderly populations, affecting not only physical health but also significantly increasing the risk of depression. This study aims to construct a machine learning model to predict the risk of depression in middle-aged and elderly patients with CMD and to identssify key risk factors.
METHODS: Based on data from the China Health and Retirement Longitudinal Study (CHARLS) from 2018 to 2020, 4,477 patients aged 45 and above were included. LASSO regression was used to screen for risk factors, and three machine learning algorithms-logistic regression (LR), random forest (RF), and XGBoost-were employed to build predictive models. The performance of the models was evaluated using ROC curves, calibration curves, and decision curves.
RESULTS: The study found several risk factors significantly associated with depression, including disability status, pain, retirement status, number of chronic diseases, education level, age, gender, place of residence, life satisfaction, optimism about the future, and self-rated health status. The incidence of depression was significantly higher among women (56%), rural residents (64%), individuals with disabilities, non-retirees (85%), and those with chronic illnesses (73%). The LR model demonstrated the best predictive performance, with an AUC of 0.69. Key predictive factors included self-rated health, residence, education level, gender, pain, life satisfaction, age, and hope for the future.
CONCLUSION: This study developed a depression risk prediction model based on logistic regression, providing important references for psychological health interventions in middle-aged and elderly patients with CMD. Identifying and intervening in high-risk populations is crucial for improving patients’ quality of life.
PMID:40410764 | DOI:10.1186/s12889-025-23075-7
Recent Comments