Depression prognosis using natural language processing and machine learning from social media status

Md. Tazmim Hossain, Md. Arafat Rahman Talukder, Nusrat Jahan

Abstract


Depression is an acute problem throughout the world. Due to worst and prolong depression many people dies in every year. The problem is that most of the people are not concern of the fact that they are suffering from depression. In this research, our aim was to find out whether an individual is depressed or not by analyzing social media status. Therefore, we focused on real data. Our dataset consists of 2000 sentences, which was collected from different social media platforms Facebook, Twitter, and Instagram. Then, we have performed five data pre-processing approaches for natural language processing (NLP) such as tokenization, removal of stop words, removing empty string, removing punctuations, stemming and lemmatization. For our selected model, we considered that processed data as an input. Finally, we applied six machine learning (ML) classifiers multinomial Naive Bayes (NB), logistic regression, liner support vector classifier, random forest, K-nearest neighbour, and decision tree to achieve better accuracy over our dataset. Among six algorithms, multinomial NB and logistic regression performed well on our dataset and obtained 98% accuracy.


Keywords


depression; logistic regression; machine learning; multinomial NB; NLP; social media;

Full Text:

PDF


DOI: http://doi.org/10.11591/ijece.v12i3.pp2847-2855

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

International Journal of Electrical and Computer Engineering (IJECE)
p-ISSN 2088-8708, e-ISSN 2722-2578

This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).