Reliability, Item Functioning, and Gender Bias of the CES-D Scale in Community-Dwelling Elderly: Findings from the ELSA Cohort

Braz J Psychiatry. 2025 Oct 5. doi: 10.47626/1516-4446-2025-4401. Online ahead of print.

ABSTRACT

OBJECTIVE: To evaluate the item performance of the Center for Epidemiologic Studies Depression (CES-D) scale.

METHODS: Participants were adults aged 50 and older from the English Longitudinal Study of Ageing (ELSA). Using classical test theory and item response theory, data from 11,612 participants were analyzed to estimate reliability, item discrimination (a), and item difficulty (b). Differential Item Functioning (DIF) analyses assessed whether individuals from different gender groups responded differently to items despite similar depressive symptom levels.

RESULTS: The CES-D demonstrated adequate internal consistency (α = 0.80; ω = 0.85), with a lower marginal reliability (0,65). Around 60% of participants endorsed at least one depressive symptom. All items showed moderate to higher levels of discrimination (a > 0.66), with “slept restlessly” most frequently endorsed (b = 0.43), and “felt lonely” the hardest to endorse (b = 1.59). Four items – “slept restlessly”, “felt lonely”, “felt sad”, and “could not get going” – exhibited significant DIF, with women more likely to endorse these items than men at equivalent symptom levels.

CONCLUSIONS: CES-D items showed acceptable reliability and effectively captured varying depression severity. Despite some DIF, no substantial gender-related measurement bias was found, supporting the scale’s use for screening in older adult populations.

PMID:41046569 | DOI:10.47626/1516-4446-2025-4401

Reliability, Item Functioning, and Gender Bias of the CES-D Scale in Community-Dwelling Elderly: Findings from the ELSA Cohort

Submit a Comment Cancel reply

Recent Posts

Recent Comments