Developing Deep Learning Models for Depression Detection in Texts

dc.contributor.advisorEick, Christoph F.
dc.contributor.committeeMemberChen, Guoning
dc.contributor.committeeMemberTsekos, Nikolaos V.
dc.contributor.committeeMemberYuan, Xiaojing
dc.creatorAigbe, Steve Aibuedefe
dc.date.accessioned2024-01-27T01:54:06Z
dc.date.createdDecember 2023
dc.date.issued2023-12
dc.date.updated2024-01-27T01:54:07Z
dc.description.abstractDepression is a major mental health disorder affecting a significant portion of the world population. Methods mostly being employed for depression detection are clinical interviews and questionnaire surveys where psychiatric assessment tables are used to establish mental disorder prognosis. Analyzing texts written by an individual can serve as an additional knowledge source to diagnose depression. Consequently, using deep learning models to detect depressed and non-depressed individuals based on social media posts, by analyzing the words being posted, has become the focus of recent research. The lack of big-sized depression-labeled datasets for training models for depression detection in texts is a major challenge. Also, selecting a data augmentation (DA) method to augment the available small-sized datasets is difficult. So, we developed a methodology, named DAMEVAL, for the evaluation of DA methods for text classification. In DAMEVAL, we proposed a set of evaluation measures and benchmark NLP datasets for the evaluation and comparison of DA methods to create a reference for easier selection of DA methods by users. In this dissertation, we extracted and analyzed the textual depression symptoms indicators present in texts posted in online forums and the distribution of these indicators with respect to depressed and non-depressed social media users. Also, we computed weights, using the TFIDF method, based on the extracted depression symptoms’ indicators present in users' posts. Subsequently, we introduced a weighted deep learning model named DEP-BERTCNN, based on the computed depression indicators’ weights, for depression detection in text in online forums. DEP-BERTCNN uses a combination of a pre-trained BERT language model, an attention model and convolutional neural network to classify forum users as depressed or non-depressed. The DEP-BERTCNN model was trained and evaluated on the large-scale Reddit Self-reported Depression Dataset (RSDD). Our results outperform several baseline methods for depression detection in texts, demonstrating the effectiveness of combining deep learning model with linguistic indicators associated with depression symptoms. In summary, we aim to develop a beneficial system that can easily be used to detect depression in texts and enable policy makers to respond to mental health escalations easily and promptly.
dc.description.departmentComputer Science, Department of
dc.format.digitalOriginborn digital
dc.format.mimetypeapplication/pdf
dc.identifier.citationPortions of this document appear in: Aigbe, Steve Aibuedefe, and Christoph Eick. "Learning domain-specific word embeddings from covid-19 tweets." In 2021 IEEE International Conference on Big Data (Big Data), pp. 4307-4312. IEEE, 2021.
dc.identifier.urihttps://hdl.handle.net/10657/16219
dc.language.isoeng
dc.rightsThe author of this work is the copyright owner. UH Libraries and the Texas Digital Library have their permission to store and provide access to this work. UH Libraries has secured permission to reproduce any and all previously published materials contained in the work. Further transmission, reproduction, or presentation of this work is prohibited except with permission of the author(s).
dc.subjectDepression Detection
dc.subjectBERT
dc.subjectCNN
dc.subjectAttention
dc.subjectWeights
dc.subjectTF-IDF
dc.subjectTexts
dc.subjectNLP
dc.titleDeveloping Deep Learning Models for Depression Detection in Texts
dc.type.dcmitext
dc.type.genreThesis
dcterms.accessRightsThe full text of this item is not available at this time because the student has placed this item under an embargo for a period of time. The Libraries are not authorized to provide a copy of this work during the embargo period.
local.embargo.lift2025-12-01
local.embargo.terms2025-12-01
thesis.degree.collegeCollege of Natural Sciences and Mathematics
thesis.degree.departmentComputer Science, Department of
thesis.degree.disciplineComputer Science
thesis.degree.grantorUniversity of Houston
thesis.degree.levelDoctoral
thesis.degree.nameDoctor of Philosophy

Files

License bundle

Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
4.43 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
1.81 KB
Format:
Plain Text
Description: