Saved in:
Bibliographic Details
Main Authors: Binu, Sona, Jose, Jismi, K V, Fathima Shimna, Hans, Alino Luke, Cherian, Reni K., Alex, Starlet Ben, Srivastava, Priyanka, Yarra, Chiranjeevi
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2409.14769
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914954863443968
author Binu, Sona
Jose, Jismi
K V, Fathima Shimna
Hans, Alino Luke
Cherian, Reni K.
Alex, Starlet Ben
Srivastava, Priyanka
Yarra, Chiranjeevi
author_facet Binu, Sona
Jose, Jismi
K V, Fathima Shimna
Hans, Alino Luke
Cherian, Reni K.
Alex, Starlet Ben
Srivastava, Priyanka
Yarra, Chiranjeevi
contents The people with Major Depressive Disorder (MDD) exhibit the symptoms of tonal variations in their speech compared to the healthy counterparts. However, these tonal variations not only confine to the state of MDD but also on the language, which has unique tonal patterns. This work analyzes automatic speech-based depression detection across two languages, English and Malayalam, which exhibits distinctive prosodic and phonemic characteristics. We propose an approach that utilizes speech data collected along with self-reported labels from participants reading sentences from IViE corpus, in both English and Malayalam. The IViE corpus consists of five sets of sentences: simple sentences, WH-questions, questions without morphosyntactic markers, inversion questions and coordinations, that can naturally prompt speakers to speak in different tonal patterns. Convolutional Neural Networks (CNNs) are employed for detecting depression from speech. The CNN model is trained to identify acoustic features associated with depression in speech, focusing on both languages. The model's performance is evaluated on the collected dataset containing recordings from both depressed and non-depressed speakers, analyzing its effectiveness in detecting depression across the two languages. Our findings and collected data could contribute to the development of language-agnostic speech-based depression detection systems, thereby enhancing accessibility for diverse populations.
format Preprint
id arxiv_https___arxiv_org_abs_2409_14769
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Language-Agnostic Analysis of Speech Depression Detection
Binu, Sona
Jose, Jismi
K V, Fathima Shimna
Hans, Alino Luke
Cherian, Reni K.
Alex, Starlet Ben
Srivastava, Priyanka
Yarra, Chiranjeevi
Computation and Language
The people with Major Depressive Disorder (MDD) exhibit the symptoms of tonal variations in their speech compared to the healthy counterparts. However, these tonal variations not only confine to the state of MDD but also on the language, which has unique tonal patterns. This work analyzes automatic speech-based depression detection across two languages, English and Malayalam, which exhibits distinctive prosodic and phonemic characteristics. We propose an approach that utilizes speech data collected along with self-reported labels from participants reading sentences from IViE corpus, in both English and Malayalam. The IViE corpus consists of five sets of sentences: simple sentences, WH-questions, questions without morphosyntactic markers, inversion questions and coordinations, that can naturally prompt speakers to speak in different tonal patterns. Convolutional Neural Networks (CNNs) are employed for detecting depression from speech. The CNN model is trained to identify acoustic features associated with depression in speech, focusing on both languages. The model's performance is evaluated on the collected dataset containing recordings from both depressed and non-depressed speakers, analyzing its effectiveness in detecting depression across the two languages. Our findings and collected data could contribute to the development of language-agnostic speech-based depression detection systems, thereby enhancing accessibility for diverse populations.
title Language-Agnostic Analysis of Speech Depression Detection
topic Computation and Language
url https://arxiv.org/abs/2409.14769