¹¹institutetext: Instituto Politécnico Nacional, Centro de Investigación en Computación, Mexico City, Mexico
¹¹email: {iameer2019,mariff2021,sidorov,gelbukh}@cic.ipn.mx ²²institutetext: Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas, Universidad Nacional Autónoma de México, Mexico City, Mexico
²²email: helena.gomez@iimas.unam.mx

Mental Illness Classification on Social Media Texts using Deep Learning and Transfer Learning

Iqra Ameer 11 0000-0002-1134-9713 Muhammad Arif 11 0000-0001-06141-02047 Grigori Sidorov 11 0000-0003-3901-3522 Helena Gómez-Adorno 22 0000-0002-6966-9912 Alexander Gelbukh 11 0000-0001-7845-9039

Abstract

Given the current social distance restrictions across the world, most individuals now use social media as their major medium of communication. Millions of people suffering from mental diseases have been isolated due to this, and they are unable to get help in person. They have become more reliant on online venues to express themselves and seek advice on dealing with their mental disorders. According to the World health organization (WHO), approximately 450 million people are affected. Mental illnesses, such as depression, anxiety, etc., are immensely common and have affected an individual’s physical health. Recently Artificial Intelligence (AI) methods have been presented to help mental health providers, including psychiatrists and psychologists, in decision-making based on patients’ authentic information (e.g., medical records, behavioral data, social media utilization, etc.). AI innovations have demonstrated predominant execution in numerous real-world applications broadening from computer vision to healthcare. This study analyzes unstructured user data on the Reddit platform and classifies five common mental illnesses: depression, anxiety, bipolar disorder, ADHD, and PTSD. We trained traditional machine learning, deep learning, and transfer learning multi-class models to detect mental disorders of individuals. This effort will benefit the public health system by automating the detection process and informing appropriate authorities about people who require emergency assistance.

Keywords:

Mental Illnesses Classification Machine Learning Deep Learning Transfer Learning Reddit

1 Introduction

Mental illness could be a sort of health condition that changes a person’s intellect, feelings, or behavior (or all three) and has been appeared to affect an individual’s physical health [28, 23]. Mental health issues including depression, schizophrenia, attention-deficit hyperactivity disorder (ADHD), autism spectrum disorder (ASD), etc., are highly prevalent today, and it is estimated that around 450 million people worldwide suffer from such problems[28].

To way better get the mental health conditions and provide care, early detection of mental health problems is a basic step. Different from the diagnosis of other chronic conditions that depend on research facility tests and measurements, mental illnesses are regularly diagnosed based on an individual’s self-report to particular surveys planned for the detection of specific patterns of feelings or social interactions [18].

Amid these uncertain times when COVID-19 torments the world, many people have indicated clinical anxiety or depression. This could be due to lockdown, limited social activities, a higher unemployment rate, economic depression, and fatigue related to work. American Foundation for Suicide Anticipation reported that individuals encounter anxiety (53%) and sadness (51%) more regularly now as compared to the time before covid-19 widespread.

Within the past decade, social media has changed social interaction. Along with sharing data and news, individuals effectively communicate their day-to-day activities, experiences, hopes, emotions, etc., and generate tons of data online. This textual data gives information that can be utilized to design systems to predict people’s mental health. Moreover, the current limited social interaction state has forced people to express their thoughts on social media. It gives people an open stage to share their opinions with others numerous times attempt to find assistance [25].

A previous study explored the application of Machine Learning (ML) techniques in mental health [31]. They reviewed literature by grouping them into four main application domains, such as detection and diagnosis (ii) prognosis, treatment and support, (iii) public health applications, and (iv) research and clinical administration. Another study explored the emerging area of application of DL techniques in psychiatry. They focused on DL by embedding semantically interpretable computational models of brain dynamics or behavior into a statistical machine learning context [15].

This study uses reddit.com¹¹1https://www.reddit.com/ Last visited: 23-01-2022 user data proposed by Murarka and Radhakrishnan [25] to determine mental illnesses, see sample of dataset instances in Table 1. We applied traditional machine learning, deep learning, and transfer learning approaches to automatically detect mental disorders in social media texts. Our extensive experiments demonstrate that machine learning, deep learning, and transfer learning techniques have the potential to complement clinical procedures in the prediction of mental health between two classes of individuals: those seeking help online and those who are unaware of their condition.

Table 1: Sample instances of Reddit corpus

No.	Reddit Post	Label
1	all the ideas that normally disappear as soon as we reach for a writing device will be captured and started. imagine all the projects we will begin and never finish!	adhd
2	i know this is long and i don’t know if a lot of people will read this but i really just want to help. i had 2 panic attacks over the end of february and first day of march. i went to the doctor and had my blood work	anxiety
3	for example, did you ever notice that you had manic, hypomanic, depressive, etc. episodes? did you ever notice that sometimes you were ”sad” and other times you were ”excessively happy”? i’m in a sticky	bipolar
4	i just feel so trapped and i have to do something about it. i don’t know where i’ll go or what i’ll do to get by. i just can’t stay here any longer.	depression
5	synesthesia. what is synesthesia? according to google, ”synesthesia is a condition in which one sense (for example, hearing) is simultaneously perceived as if by one or more additional senses such as sight.	none
6	this is probably going to incite a lot of disagreement, maybe even anger, but that’s okay; i’m going to say it anyway. anyone else tired of being told that just talking about your problems will solve your ptsd?	ptsd

The rest of the paper is organized as follows: section 2 describes the studies on mental illness in literature. Section 3 explain the problem and gives dataset insights. Section 4 give details of methodology applied to detect mental disorders. Section 5 presents results and their analysis. 6 concludes the paper with possible future work.

2 Related Work

In recent years, people have been using social media to communicate and seek advice on mental health issues. This has motivated researchers to take the information and apply a variety of NLP and ML approaches to help individuals who may want assistance. Initially, many researchers have focused on Twitter text [27, 10, 12], later on the focus has shifted on Reddit platform [21, 17, 10, 34].

A wide range of approaches has been applied to mental health text analysis, from traditional ML to advanced DP. According to [12], they employed character-level language models to see how probable a user with mental health concerns would create a series of characters. [10] determined different types of mental health disorders by applying neural MTL, regression, and multi-layer perceptron single-task learning (STL) models. [1] trained the SVMs to distinguish 200 text messages into two classes: ”ADHD or not.” The most crucial step was the elimination of the acronym ADHD from the messages before learning, and further information concerning attention disorders was removed from the texts. The goal was to see how well the SVMS learns when keywords and even semantically relevant material are unavailable.

Deep feedforward neural network has outperformed typical ML models in a variety of data mining tasks [9, 7], and it has been used in the study of clinical and genetic data to predict mental health disorders. To diagnose depression, [27] used word embeddings in combination with a range of neural network models such as CNNs and RNNs. To conduct binary classification on mental health textual posts, [17] used Feed Forward Neural Networks, CNNs, traditional machine learning such as SVMs, and Linear classifiers. [30] detected depression, ADHD, anxiety, and other types of mental illnesses by training a binary classifier for each disease with Hierarchical Attention Networks. The most recent work on this was a CNN-based classification model [21]. In which, the team trained a separate binary classifier for each type of mental disorder to conduct the detection. Reference [19] found the potential factors to influence a person’s mental health during the Covid-19 pandemic by applying machine learning classifiers such as Naive Bayes, Logistic Regression, Support Vector Machine, Decision Tree, Random Forest, and Gradient Boosting. They have also presented an analysis of the feature selection technique LIME.

In today’s research world, transfer learning is extremely important. By using several types of transformers, researchers attempt to acquire greater accuracy and performance in each research study. The authors of [25] examined three approaches for identifying and diagnosing mental illness on Reddit, including LSTM, BERT, and RoBERTa. Among these three methods, RoBERTa outperformed. Reference [14] employed RoBERTa to categorize COVID-19-related informative tweets, and their method yielded the best results.

3 Problem Description and Dataset

Murarka et al. [25] developed a benchmark multi-class dataset from Reddit social media platform for mental illnesses detection.

3.1 Mental Illness Problem

This study handles the mental illness problem as a multi-class classification problem. A text post of Reddit platform is given, and task is to classify the post into one of the six following mental disorder classes:

•

ADHD (Attention Deficit Hyperactivity Disorder): A brain illness that affects how you pay attention, sit still, and control your behavior (common in children)²²2https://www.cdc.gov/ncbddd/adhd/facts.html Last visited: 24-01-2022.
•

Anxiety: A feeling of uneasiness, fear, and dread³³3https://medlineplus.gov/anxiety.html#:~:text=Anxiety%20is%20a%20feeling%20of,before%20making%20an%20important%20decision Last visited: 24-01-2022.
•

Bipolar: Extreme mood swings, including emotional highs and lows, are a symptom of a mental health issue ⁴⁴4https://www.mayoclinic.org/diseases-conditions/bipolar-disorder/symptoms-causes/syc-20355955 Last visited: 24-01-2022.
•

Depression: A widespread and significant medical condition that has a negative impact on how someone feels, thinks, and acts⁵⁵5https://www.psychiatry.org/patients-families/depression/what-is-depression Last visited: 24-01-2022.
•

PTSD (Post-traumatic stress disorder): A disorder that affects certain people after they have been through a traumatic, frightening, or dangerous incident⁶⁶6https://www.nimh.nih.gov/health/topics/post-traumatic-stress-disorder-ptsd#:~:text=Post%2Dtraumatic%20stress%20disorder%20(PTSD,danger%20or%20to%20avoid%20it. Last visited: 24-01-2022.
•

None: No mental illness.

3.2 Dataset

Reddit’s post dataset is developed to detect mental disorders into one of the five classes. The dataset comprises a total of 16,930 posts. The posts were further divided into train, dev, and test groups with 13,726 posts in the training set, 1,716 posts in the dev set, 1,488 posts in the test set. Table 2 presents the number of posts for each mental illness class.

Table 2: Number of posts for each mental illness class in whole dataset

ADHA	$3,023$
Anxiety	$2,973$
Bipolar	$2,956$
Depression	$3.004$
PTSD	$2,499$
None	$2,478$
Total	$16,930$

The dataset was already pre-processed by eliminating URLs or usernames containing sensitive material. However, we lowered-cased the post text, removed the punctuation marks, removed stop words, and normalized the elongated words [3, 32, 4], these are the most used pre-processing techniques in classification tasks. Figure 1 shows the number of instances of training and test sets according to classes.

Refer to caption — Figure 1: Mental Illness Dataset Statistics

4 Methods for Mental Illness Classification

This section describes our machine learning, deep learning, and transfer learning models applied for multi-class mental illness problems.

4.1 Machine Learning Classifiers

ML points at creating computational algorithms or statistical models that can consequently gather hidden patterns from the data [29, 33]. For a long time, has seen an increasing number of ML models being created to analyze healthcare data [26, 11]. Conventional ML approaches require a significant sum of feature engineering for ideal performance–an essential step for most application scenarios to get excellent performance– and time [16]. Words help to create contextual content. Their sequence and structure can give important insights to classify texts [6, 2]. In earlier studies, several researchers extracted word n-grams to classify user content on social media. [21] used word n-grams to detect mental illness from Reddit posts. Another study [20] utilized word n-grams to generate and evaluate artificial mental health records for NLP. We applied four different machine learning classifiers: Random Forest, Linear Support Vector Machine, Multinomial Naive Bayes, and Logistic Regression.

The maximum number of features for each experiment was 1,000, i.e., we used the n-grams with the highest TF-IDF values. For the combination of word n-grams, the length of $N$ was minimum = 1 and maximum = 3 (We also tried $N$ = 1 and 2, and the results were not improved.

4.2 Deep Learning Methods

The second type of method is deep learning-based, in which different state-of-the-art neural network models were applied. Series of CLPsych shared task⁷⁷7https://clpsych.org/ Last visited: 24-01-2022 an important role in development of mental health detection. We noticed that the most widely used models were Convolutional Neural Network, Recurrent Neural Networks, Long short-term memory, and Bidirectional Long short-term memory.

In [25], authors applied Long short-term memory (LSTM) to detect mental illness from Reddit posts and achieved promising results. In [24], applied LSTM with attention mechanism to estimate suicidal intent by utilizing temporal psycholinguistic. We applied several pre-trained deep learning algorithms for multi-class mental illness detection such as Convolutional Neural Network, Gated recurrent unit (GRU), Bidirectional Gated recurrent units (Bi-GRU), LSTM, and Bidirectional LSTM.

We used Scikit-learn implementation of deep learning models considering the following parameters, which are usually the default: hidden layers = 3, hidden units = 64, no. of epochs = 10, batch size = 64, and dropout = 0.001. The parameters of CNN model are as follows: activation function = Rectified Linear Units (ReLU), optimizer= adam, hidden layers = 3, loss function = sigmoid, no. of epochs = 10, batch size = 64, dropout = 0.001.

4.3 Transfer Learning Methods

Bidirectional Encoder Representations from Transformers (BERT) is one of the foremost well-known advanced methods for NLP problems. RoBERTa (Robustly Optimized BERT Pretraining Approach) is another state-of-the-art language model that builds on BERT by modifying key hyperparameters and training on more data. It outperforms BERT on several benchmark tasks and forms the core of our proposed solution. The BERT model provided state-of-the-art performance over different NLP tasks without any critical task-specific design changes [22, 13]. [5] used the BERT model for classification task on the multi-label text, which is trained by Google [13]. The model was developed to enable transfer learning, which is why it went through a pre-training procedure that included utilizing both the BookCorpus and the English Wikipedia to help the model learn English. This training procedure consumes a lot of resources and time; therefore, fine-tuning a pre-trained model to a specific downstream is more efficient.

The pre-trained uncased version of the BERT base model was used in this study, which means that the text was converted to lowercase before the word tokenization stage. Each of the 12 encoders in the BERT base model has eight levels: four multi-head self-attention layers and four feed-forward layers.

In this study, the pre-trained XLNet was also used. The XLNet base model design consists of 12 transformer levels with 768 hidden layers and 12 attention head layers. To tokenize the sequences, the XLNet tokenizer was utilized. The tokens were then padded, and categorization was completed.

RoBERTa (Robustly Optimized BERT Pretraining Approach) is a cutting-edge language model that improves on BERT by tweaking key hyperparameters and training on additional data. It outperforms BERT on a number of benchmark tasks and serves as the foundation for our suggested solution. We applied a pre-trained RoBERTa base model. The RoBERTa base model design consists of 12 transformer levels with 768 hidden layers and 12 attention head layers. To tokenize the sequences, the RoBERTa tokenizer was utilized. The tokens were then padded, and categorization was completed.

5 Results and Analysis

Our machine learning, deep learning, and transfer learning results are documented in Table 3. In this Table, “ML Algorithms” indicates traditional machine learning algorithms. The “LinearSVC” indicates to Linear Support Vector Classifier, “LR” indicates to Logistic Regression, “NB” indicates to Naive Bayes, “RF” indicates to Random Forest classifier. The “DL Algorithms” indicates deep learning algorithms used in this study such as GNN, Bi-GNN, CNN, LSTM, and Bi-LSTM. The “TL Algorithms” refers to pre-trained transfer learning algorithms applied to evaluate Reddit corpus, i.e., BERT, XLNet, and RoBERTa.

Our pre-trained transfer learning RoBERTa model outperformed other traditional machine learning and deep learning algorithms with an accuracy score of 0.80, which is quite good on this challenging multi-class mental illness detection problem. The performance of the XLNet model is close to the RoBERTa difference of 0.01.

The overall best results using the deep learning algorithm were on Bi-LSTM. This shows that Bi-LSTM is the most suitable algorithm to detect mental illness among deep learning algorithms. Interestingly, Bi-LSTM results are similar to the ones obtained with BERT. This indicates that the BERT model performs equally well on multi-class mental illness detection problems as advanced pre-trained transfer learning models. The accuracy scores of GRU, Bi-GRU, and CNN are not very high, highlighting the fact that multi-class mental illness detection on Reddit post’s text is a challenging task.

Using traditional machine learning algorithms, overall, best results are obtained using combination of word n-grams when length of N was minimum = 1, maximum = 3 (Accuracy = 0.78, F1 = 0.67). This shows that combinations of word grams (length of N = 1-3) were the most suitable features when we trained the model on the Reddit social media platform.

The RoBERTa model’s detailed class-wise results are shown in Table 4. The first surprising outcome is the model’s ability to recognize non-illness-related postings with high accuracy. The model can categorize the none class with an F1 score of more than 0.98. That gives us the indication that when it comes to detecting mental illness on social media, this model will suffer from very few false positives.

The two highest performing classes among mental disorder are adhd and ptsd, while on the contrary the two poorest performing classes are depression and anxiety.

There are number of factors contribute to the result of depression and anxiety classes. depression and anxiety have the fewest average number of tokens (words) per post among all classes.

Table 3: Results obtained by applying various classical machine learning, deep earning and transfer learning techniques

Classical Machine Learning
ML Algorithms	Accuracy	F1-score
LinearSVC	$0.79$	$0.80$
LR	$0.79$	$0.80$
NB	$0.74$	$0.75$
RF	$0.75$	$0.76$
Deep Learning
DL Algorithm	Accuracy	F1-score
GRU	$0.62$	$0.64$
Bi-GRU	$0.63$	$0.65$
CNN	$0.64$	$0.65$
LSTM	$0.76$	$0.77$
Bi-LSTM	$0.78$	$0.79$
Transfer Learning
DL Algorithm	Accuracy	F1-score
BERT	$0.78$	$0.80$
XLNet	$0.79$	$0.80$
RoBERTa	$0.83$	$0.83$

For example, when comparing depression and ptsd classes, depression class have around 53% fewer textual content. Furthermore, researchers demonstrate that depression is frequently associated with another mental disorder, and our findings support this. In 12% percent of ptsd posts, 12% percent of anxiety posts, and 31% of bipolar posts, the term depression appears. Anxiety is present in 12% of adhd posts, 14% of bipolar posts, and 20% of ptsd posts, respectively. This means that, unlike with the rest of the diseases, the model is unable to place a high value on the mention of these class names, making the categorization of these two labels challenging.

Table 4: RoBERTa class-wise results

Class	Precision	Recall	F1-score
adha	$0.85$	$0.83$	$0.84$
anxiety	$0.73$	$0.78$	$0.76$
bipolar	$0.83$	$0.76$	$0.80$
depression	$0.76$	$0.83$	$0.70$
ptsd	$0.90$	$0.87$	$0.88$
none	$0.99$	$0.96$	$0.98$

Figure 2 shows the confusion matrix for RoBERTa model. The terms depression and anxiety are mentioned in more instances in the adhd and ptsd classes than the name these class themselves. One could expect poor outcomes as a result of this, but these classes outperform all others. This exhibits the actual potential of our approach since it does not depend solely on the mention of class names in the post but also has a deep awareness of the post’s context.

6 Conclusion

The present Covid-19 outbreak and globally forced isolation are our primary motivations for multi-class mental illness detection efforts. We feel that social media platforms have become the most widely used communication medium for individuals, allowing them to express themselves without fear of being judged. We applied state-of-the-art traditional machine learning, deep learning, and transfer learning-based methods for multi-class mental illness detection problem. The best results (see Table 3) obtained using pre-trained RoBERTa transfer learning model (accuracy = 0.83, F1-score = 0.83).

In the future, we plan to develop a multi-label dataset for mental illness problems, which would be more reflective of the situation than a multi-class dataset, as a post can have more than one mental disease instead of one per post, i.e., depression, anxiety. We can also use the data augmentation technique on top of this existing mental health data [8]. We plan to apply other transfer learning-based models such as DistilBERT, etc., in the future. An ensemble modeling would be considered to improve classification performance.

Acknowledgements

The work was done with support from the Mexican Government through the grant A1-S-47854 of the CONACYT, Mexico and grants 20211784, 20211884, 20211178 of the Secretaría de Investigación y Posgrado of the Instituto Politécnico Nacional, Mexico, and grants of PAPIIT-UNAM project TA101722. The authors utilize the computing resources brought to them by the CONACYT through the Plataforma de Aprendizaje Profundo para Tecnologías del Lenguaje of the Laboratorio de Supercómputo of the INAOE, Mexico.

References

[1] Abusaa, M., Diederich, J., Al-Ajmi, A., et al.: Machine learning, text classification and mental health. HIC 2004: Proceedings p. 102 (2004)
[2] Ameer, I., Ashraf, N., Sidorov, G., Gómez Adorno, H.: Multi-label emotion classification using content-based features in twitter. Computación y Sistemas 24(3) (2020)
[3] Ameer, I., Siddiqui, M.H.F., Sidorov, G., Gelbukh, A.: Cic at semeval-2019 task 5: Simple yet very efficient approach to hate speech detection, aggressive behavior detection, and target classification in twitter. In: Proceedings of the 13th International Workshop on Semantic Evaluation. pp. 382–386 (2019)
[4] Ameer, I., Sidorov, G.: Author profiling using texts in social networks. In: Handbook of Research on Natural Language Processing and Smart Service Systems, pp. 245–265. IGI Global (2021)
[5] Ameer, I., Sidorov, G., Gómez-Adorno, H., Nawab, R.M.A.: Multi-label emotion classification on code-mixed text: Data and methods. IEEE Access 10, 8779–8789 (2022). https://doi.org/10.1109/ACCESS.2022.3143819
[6] Ameer, I., Sidorov, G., Nawab, R.M.A.: Author profiling for age and gender using combinations of features of various types. Journal of Intelligent & Fuzzy Systems 36(5), 4833–4843 (2019)
[7] Amjad, M., Ashraf, N., Zhila, A., Sidorov, G., Zubiaga, A., Gelbukh, A.: Threatening language detection and target identification in urdu tweets. IEEE Access 9, 128302–128313 (2021)
[8] Amjad, M., Sidorov, G., Zhila, A.: Data augmentation using machine translation for fake news detection in the urdu language. In: Proceedings of the 12th language resources and evaluation conference. pp. 2537–2542 (2020)
[9] Amjad, M., Sidorov, G., Zhila, A., Gómez-Adorno, H., Voronkov, I., Gelbukh, A.: “bend the truth”: Benchmark dataset for fake news detection in urdu language and its evaluation. Journal of Intelligent & Fuzzy Systems 39(2), 2457–2469 (2020)
[10] Benton, A., Mitchell, M., Hovy, D.: Multi-task learning for mental health using social media text. arXiv preprint arXiv:1712.03538 (2017)
[11] Biship, C.M.: Pattern recognition and machine learning (information science and statistics) (2007)
[12] Coppersmith, G., Dredze, M., Harman, C., Hollingshead, K., Mitchell, M.: Clpsych 2015 shared task: Depression and ptsd on twitter. In: Proceedings of the 2nd Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality. pp. 31–39 (2015)
[13] Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
[14] Dhanalaxmi, S., Agarwal, R., Sinha, A.: Detection of covid-19 informative tweets using roberta. arXiv preprint arXiv:2010.11238 (2020)
[15] Durstewitz, D., Koppe, G., Meyer-Lindenberg, A.: Deep neural networks in psychiatry. Molecular psychiatry 24(11), 1583–1598 (2019)
[16] Dwyer, D.B., Falkai, P., Koutsouleris, N.: Machine learning approaches for clinical psychology and psychiatry. Annual review of clinical psychology 14, 91–118 (2018)
[17] Gkotsis, G., Oellrich, A., Velupillai, S., Liakata, M., Hubbard, T.J., Dobson, R.J., Dutta, R.: Characterisation of mental health conditions in social media using informed deep learning. Scientific reports 7(1), 1–11 (2017)
[18] Hamilton, M.: Development of a rating scale for primary depressive illness. British journal of social and clinical psychology 6(4), 278–296 (1967)
[19] Hu, Y., Sokolova, M.: Explainable multi-class classification of the camh covid-19 mental health data. arXiv preprint arXiv:2105.13430 (2021)
[20] Ive, J., Viani, N., Kam, J., Yin, L., Verma, S., Puntis, S., Cardinal, R.N., Roberts, A., Stewart, R., Velupillai, S.: Generation and evaluation of artificial mental health records for natural language processing. NPJ digital medicine 3(1), 1–9 (2020)
[21] Kim, J., Lee, J., Park, E., Han, J.: A deep learning model for detecting mental illness from user content on social media. Scientific reports 10(1), 1–6 (2020)
[22] Li, X., Fu, X., Xu, G., Yang, Y., Wang, J., Jin, L., Liu, Q., Xiang, T.: Enhancing bert representation with context-aware embedding for aspect-based sentiment analysis. IEEE Access 8, 46868–46876 (2020)
[23] Marcus, M., Yasamy, M.T., van Ommeren, M.v., Chisholm, D., Saxena, S.: Depression: A global public health concern (2012)
[24] Mathur, P., Sawhney, R., Chopra, S., Leekha, M., Shah, R.R.: Utilizing temporal psycholinguistic cues for suicidal intent estimation. Advances in Information Retrieval 12036, 265 (2020)
[25] Murarka, A., Radhakrishnan, B., Ravichandran, S.: Classification of mental illnesses on social media using roberta. In: Proceedings of the 12th International Workshop on Health Text Mining and Information Analysis. pp. 59–68 (2021)
[26] Murphy, K.P.: Machine learning: a probabilistic perspective. MIT press (2012)
[27] Orabi, A.H., Buddhitha, P., Orabi, M.H., Inkpen, D.: Deep learning for depression detection of twitter users. In: Proceedings of the Fifth Workshop on Computational Linguistics and Clinical Psychology: From Keyboard to Clinic. pp. 88–97 (2018)
[28] Organization, W.H.: The world health report 2001: Mental health: new understanding, new hope (2001)
[29] Pervaz, I., Ameer, I., Sittar, A., Nawab, R.M.A.: Identification of author personality traits using stylistic features: Notebook for pan at clef 2015. In: CLEF (Working Notes). Citeseer (2015)
[30] Sekulić, I., Strube, M.: Adapting deep learning methods for mental health prediction on social media. arXiv preprint arXiv:2003.07634 (2020)
[31] Shatte, A.B., Hutchinson, D.M., Teague, S.J.: Machine learning in mental health: a scoping review of methods and applications. Psychological medicine 49(9), 1426–1448 (2019)
[32] Siddiqui, M.H.F., Ameer, I., Gelbukh, A.F., Sidorov, G.: Bots and gender profiling on twitter. In: CLEF (Working Notes) (2019)
[33] Sittar, A., Ameer, I.: Multi-lingual author profiling using stylistic features. In: FIRE (Working Notes). pp. 240–246 (2018)
[34] Zirikly, A., Resnik, P., Uzuner, O., Hollingshead, K.: Clpsych 2019 shared task: Predicting the degree of suicide risk in reddit posts. In: Proceedings of the sixth workshop on computational linguistics and clinical psychology. pp. 24–33 (2019)