Publication: The effectiveness of a sentence completion test for depression screening using large language models
dc.contributor.author | Porkaew P. | |
dc.contributor.author | Zhu T. | |
dc.contributor.author | Li A. | |
dc.contributor.author | Chuenphitthayavut K. | |
dc.contributor.correspondence | Porkaew P. | |
dc.contributor.other | Srinakharinwirot University | |
dc.date.accessioned | 2025-08-26T19:00:02Z | |
dc.date.issued | 2025-09-01 | |
dc.description.abstract | Depressive symptoms pose a significant mental health challenge globally, including in Thailand. The aim of this study was to assess the effectiveness of a sentence completion test for depression using large language models (LLMs). To improve objectivity and reduce bias in assessments, this study detects and classifies the trends in depression, modernizing the screening process with a newly developed depression sentence completion test. This research examines the four key areas: 1) family, 2) society, 3) health, and 4) self-concept among 373 participants, aged 20 to 40. Additionally, four models were applied to test: 1) LLAMA 3.1-8B, 2) Gemma2-9B, 3) Qwen2-7B, and 4) Typhoon1.5-7B. The result revealed that health, self-concept, and DIFF (Difference) were strongly related to sentiment levels with strong positive values of 0.48, 0.49, and 0.54, respectively, which might mean that they are significant indicators of depression risk. Family and society had positive but lower values of 0.27 and 0.19, respectively. Statistical validation confirms model reliability with a 0.78 lower bound accuracy (p ≤ .05). In the evaluation of all Thai-compatible LLMs, random forest models consistently performed better than decision tree classifiers in the classification of depression risk. LLaMA3.1 and Gemma2 produced the highest sensitivity. Ethical problems have to be considered when using LLMs in mental health. Embedding diverse populations and dynamic updating to sample data in future studies will assure greater accuracy and generalization across different demographic groups. | |
dc.identifier.citation | Acta Psychologica Vol.259 (2025) | |
dc.identifier.doi | 10.1016/j.actpsy.2025.105425 | |
dc.identifier.eissn | 18736297 | |
dc.identifier.issn | 00016918 | |
dc.identifier.scopus | 2-s2.0-105013662119 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14740/50354 | |
dc.rights.holder | SCOPUS | |
dc.subject | Arts and Humanities | |
dc.subject | Psychology | |
dc.title | The effectiveness of a sentence completion test for depression screening using large language models | |
dc.type | Article | |
dspace.entity.type | Publication | |
oaire.citation.title | Acta Psychologica | |
oaire.citation.volume | 259 | |
oairecerif.author.affiliation | University of Chinese Academy of Sciences | |
oairecerif.author.affiliation | Beijing Forestry University | |
oairecerif.author.affiliation | Institute of Psychology Chinese Academy of Sciences | |
oairecerif.author.affiliation | Srinakharinwirot University | |
oairecerif.author.affiliation | Thailand National Electronics and Computer Technology Center | |
swu.datasource.scopus | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105013662119&origin=inward |