Deep learning with anaphora resolution for the detection of tweeters with depression: Algorithm development and validation study

Akkapon Wongkoblap*, Miguel A. Vadillo, Vasa Curcin

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

24 Citations (Scopus)

Abstract

Background: Mental health problems are widely recognized as a major public health challenge worldwide. This concern highlights the need to develop effective tools for detecting mental health disorders in the population. Social networks are a promising source of data wherein patients publish rich personal information that can be mined to extract valuable psychological cues; however, these data come with their own set of challenges, such as the need to disambiguate between statements about oneself and third parties. Traditionally, natural language processing techniques for social media have looked at text classifiers and user classification models separately, hence presenting a challenge for researchers who want to combine text sentiment and user sentiment analysis. Objective: The objective of this study is to develop a predictive model that can detect users with depression from Twitter posts and instantly identify textual content associated with mental health topics. The model can also address the problem of anaphoric resolution and highlight anaphoric interpretations. Methods: We retrieved the data set from Twitter by using a regular expression or stream of real-time tweets comprising 3682 users, of which 1983 self-declared their depression and 1699 declared no depression. Two multiple instance learning models were developed—one with and one without an anaphoric resolution encoder—to identify users with depression and highlight posts related to the mental health of the author. Several previously published models were applied to our data set, and their performance was compared with that of our models. Results: The maximum accuracy, F1 score, and area under the curve of our anaphoric resolution model were 92%, 92%, and 90%, respectively. The model outperformed alternative predictive models, which ranged from classical machine learning models to deep learning models. Conclusions: Our model with anaphoric resolution shows promising results when compared with other predictive models and provides valuable insights into textual content that is relevant to the mental health of the tweeter.

Original languageEnglish
Article numbere19824
JournalJMIR Mental Health
Volume8
Issue number8
DOIs
Publication statusPublished - Aug 2021

Keywords

  • Anaphora resolution
  • Deep learning
  • Depression
  • Depression markers
  • Mental health
  • Multiple-instance learning
  • Social media
  • Twitter

Fingerprint

Dive into the research topics of 'Deep learning with anaphora resolution for the detection of tweeters with depression: Algorithm development and validation study'. Together they form a unique fingerprint.

Cite this