Academia.eduAcademia.edu

Outline

Depression Detection from Speech

2020, International Journal of Engineering Research and

https://doi.org/10.17577/IJERTV9IS030611

Abstract

Depression (Major Depressive Disorder) has become one of the most prevalent mental illness across the globe. It can be described as a state in which a person feels sad or loses interest in everyday activities which are normally considered enjoyable. It affects the thoughts and the way one feels about the people that he/she comes across. More than 260 million people throughout the world suffer from depression. This comprises of people belonging to all age groups. Depression can lead a person to hopelessness and subsequently result into suicide. It is also estimated that depression will account for the most common of diseases by 2030. Nearly 20% of people with untreated depressive disorder commit suicide. Therefore ways to detect depression are a major concern. The current diagnoses are highly inconsistent and expensive. The most common technique used to detect depression is comparing the words in a person's response to a questionnaire, with a database comprising of terms commonly used by depressed patients. This method has found to be inefficient in terms of its accuracy in prediction as well as the time consumed in diagnosis. In this project, a new method for early detection of depression is implemented. The features of speech which are associated with signs of depression can be traced by neural networks. Convolutional Neural Networks, in particular, can be applied to identify depression indicators in speech. Spectrograms, which mark the intensities of frequencies of speech can be given as input to Convolutional Neural Networks designed to learn similar patterns in depressed audio.

References (4)

  1. Gratch, Artstein, Lucas, Stratou, Scherer, Nazarian, Wood, Boberg, DeVault, Marsella, Traum. The Distress Analysis Interview Corpus of human and computer interviews. InLREC 2014 May (pp. 3123-3128).
  2. Girard, Cohn. Automated Depression Analysis. Curr Opin Psychol. 2015 August; 4: 75-79.
  3. Ma, Yang, Chen, Huang, and Wang. DepAudioNet: An Efficient Deep Model for Audio based Depression Classification. ACM International Conference on Multimedia (ACM-MM) Workshop: Audio/Visual Emo- tion Challenge (AVEC), 2016. IJERTV9IS030611 (This work is licensed under a Creative Commons Attribution 4.0 International License.) www.ijert.org
  4. Giannakopoulos, Aggelos. Introduction to audio analysis: a MATLAB approach. Oxford: Academic Press, 2014.