Saved in:
| Main Authors: | Tao, Fuxiang, Li, Dongwei, Tang, Shuning, Ge, Xuri, Ma, Wei, Esposito, Anna, Vinciarelli, Alessandro |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.01533 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Who is Speaking or Who is Depressed? A Controlled Study of Speaker Leakage in Speech-Based Depression Detection
by: Yeh, Hsiang-Chen, et al.
Published: (2026)
by: Yeh, Hsiang-Chen, et al.
Published: (2026)
Distinctive Feature Codec: An Adaptive Efficient Speech Representation for Depression Detection
by: Zhang, Xiangyu, et al.
Published: (2025)
by: Zhang, Xiangyu, et al.
Published: (2025)
Automatic Detection of Depression in Speech Using Ensemble Convolutional Neural Networks
by: Vázquez-Romero, Adrián, et al.
Published: (2024)
by: Vázquez-Romero, Adrián, et al.
Published: (2024)
SpeechT-RAG: Reliable Depression Detection in LLMs with Retrieval-Augmented Generation Using Speech Timing Information
by: Zhang, Xiangyu, et al.
Published: (2025)
by: Zhang, Xiangyu, et al.
Published: (2025)
Speech-based Clinical Depression Screening: An Empirical Study
by: Chen, Yangbin, et al.
Published: (2024)
by: Chen, Yangbin, et al.
Published: (2024)
Optimizing Speech-Input Length for Speaker-Independent Depression Classification
by: Rutowski, Tomasz, et al.
Published: (2024)
by: Rutowski, Tomasz, et al.
Published: (2024)
Robust Speech and Natural Language Processing Models for Depression Screening
by: Lu, Y., et al.
Published: (2024)
by: Lu, Y., et al.
Published: (2024)
ASVspoof 5: Design, Collection and Validation of Resources for Spoofing, Deepfake, and Adversarial Attack Detection Using Crowdsourced Speech
by: Wang, Xin, et al.
Published: (2025)
by: Wang, Xin, et al.
Published: (2025)
Efficient Long Speech Sequence Modelling for Time-Domain Depression Level Estimation
by: Li, Shuanglin, et al.
Published: (2025)
by: Li, Shuanglin, et al.
Published: (2025)
Speech-preserving active noise control: a deep learning approach in reverberant environments
by: Dai, Shuning
Published: (2026)
by: Dai, Shuning
Published: (2026)
Personality-Enhanced Multimodal Depression Detection in the Elderly
by: Wang, Honghong, et al.
Published: (2025)
by: Wang, Honghong, et al.
Published: (2025)
Test-Time Training for Depression Detection
by: Dumpala, Sri Harsha, et al.
Published: (2024)
by: Dumpala, Sri Harsha, et al.
Published: (2024)
Hierarchical Self-Supervised Representation Learning for Depression Detection from Speech
by: Li, Yuxin, et al.
Published: (2025)
by: Li, Yuxin, et al.
Published: (2025)
Recurrence-Based Nonlinear Vocal Dynamics as Digital Biomarkers for Depression Detection from Conversational Speech
by: Samanta, Himadri S
Published: (2026)
by: Samanta, Himadri S
Published: (2026)
Why Pre-trained Models Fail: Feature Entanglement in Multi-modal Depression Detection
by: Zhang, Xiangyu, et al.
Published: (2025)
by: Zhang, Xiangyu, et al.
Published: (2025)
ComFeAT: Combination of Neural and Spectral Features for Improved Depression Detection
by: Phukan, Orchid Chetia, et al.
Published: (2024)
by: Phukan, Orchid Chetia, et al.
Published: (2024)
Multimodal Magic Elevating Depression Detection with a Fusion of Text and Audio Intelligence
by: Gan, Lindy, et al.
Published: (2025)
by: Gan, Lindy, et al.
Published: (2025)
Leveraging Cross-Attention Transformer and Multi-Feature Fusion for Cross-Linguistic Speech Emotion Recognition
by: Zhao, Ruoyu, et al.
Published: (2025)
by: Zhao, Ruoyu, et al.
Published: (2025)
Classification of Autistic and Non-Autistic Children's Speech: A Cross-Linguistic Study in Finnish, French, and Slovak
by: Kakouros, Sofoklis, et al.
Published: (2026)
by: Kakouros, Sofoklis, et al.
Published: (2026)
When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection
by: Zhang, Xiangyu, et al.
Published: (2024)
by: Zhang, Xiangyu, et al.
Published: (2024)
Self-Supervised Embeddings for Detecting Individual Symptoms of Depression
by: Dumpala, Sri Harsha, et al.
Published: (2024)
by: Dumpala, Sri Harsha, et al.
Published: (2024)
Post-training for Deepfake Speech Detection
by: Ge, Wanying, et al.
Published: (2025)
by: Ge, Wanying, et al.
Published: (2025)
Electrolaryngeal Speech Intelligibility Enhancement Through Robust Linguistic Encoders
by: Violeta, Lester Phillip, et al.
Published: (2023)
by: Violeta, Lester Phillip, et al.
Published: (2023)
Cross-Corpus Validation of Speech Emotion Recognition in Urdu using Domain-Knowledge Acoustic Features
by: Talpur, Unzela, et al.
Published: (2025)
by: Talpur, Unzela, et al.
Published: (2025)
Speech-Based Depression Prediction Using Encoder-Weight-Only Transfer Learning and a Large Corpus
by: Harati, Amir, et al.
Published: (2024)
by: Harati, Amir, et al.
Published: (2024)
Performance of Objective Speech Quality Metrics on Languages Beyond Validation Data: A Study of Turkish and Korean
by: Perez, Javier, et al.
Published: (2025)
by: Perez, Javier, et al.
Published: (2025)
Linguistic Knowledge Transfer Learning for Speech Enhancement
by: Hung, Kuo-Hsuan, et al.
Published: (2025)
by: Hung, Kuo-Hsuan, et al.
Published: (2025)
Towards Data Drift Monitoring for Speech Deepfake Detection in the context of MLOps
by: Wang, Xin, et al.
Published: (2025)
by: Wang, Xin, et al.
Published: (2025)
A Multi-Probe Audit of Clinical-Interview Depression Detection Benchmarks
by: Ishikawa, Takehiro, et al.
Published: (2026)
by: Ishikawa, Takehiro, et al.
Published: (2026)
Predicting Individual Depression Symptoms from Acoustic Features During Speech
by: Rodriguez, Sebastian, et al.
Published: (2024)
by: Rodriguez, Sebastian, et al.
Published: (2024)
Does Fine-tuning by Reinforcement Learning Improve Generalization in Binary Speech Deepfake Detection?
by: Wang, Xin, et al.
Published: (2026)
by: Wang, Xin, et al.
Published: (2026)
Towards Cross-Task Suicide Risk Detection via Speech LLM
by: Li, Jialun, et al.
Published: (2025)
by: Li, Jialun, et al.
Published: (2025)
Towards a Generalizable Speech Marker for Parkinson's Disease Diagnosis
by: Siniukov, Maksim, et al.
Published: (2025)
by: Siniukov, Maksim, et al.
Published: (2025)
Emotional Vietnamese Speech-Based Depression Diagnosis Using Dynamic Attention Mechanism
by: D., Quang-Anh N., et al.
Published: (2024)
by: D., Quang-Anh N., et al.
Published: (2024)
Linguistic Changes in Spontaneous Speech for Detecting Parkinsons Disease Using Large Language Models
by: Crawford, Jonathan
Published: (2024)
by: Crawford, Jonathan
Published: (2024)
Toward Objective and Interpretable Prosody Evaluation in Text-to-Speech: A Linguistically Motivated Approach
by: Chan, Cedric, et al.
Published: (2025)
by: Chan, Cedric, et al.
Published: (2025)
TTA: Transcribe, Translate and Alignment for Cross-lingual Speech Representation
by: Liu, Wei, et al.
Published: (2025)
by: Liu, Wei, et al.
Published: (2025)
Investigating Acoustic-Textual Emotional Inconsistency Information for Automatic Depression Detection
by: Su, Rongfeng, et al.
Published: (2024)
by: Su, Rongfeng, et al.
Published: (2024)
UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021
by: Chen, Xinhui, et al.
Published: (2021)
by: Chen, Xinhui, et al.
Published: (2021)
Reducing Linguistic Hallucination in LM-Based Speech Enhancement via Noise-Invariant Acoustic-Semantic Distillation
by: Wang, Zheng, et al.
Published: (2026)
by: Wang, Zheng, et al.
Published: (2026)
Similar Items
-
Who is Speaking or Who is Depressed? A Controlled Study of Speaker Leakage in Speech-Based Depression Detection
by: Yeh, Hsiang-Chen, et al.
Published: (2026) -
Distinctive Feature Codec: An Adaptive Efficient Speech Representation for Depression Detection
by: Zhang, Xiangyu, et al.
Published: (2025) -
Automatic Detection of Depression in Speech Using Ensemble Convolutional Neural Networks
by: Vázquez-Romero, Adrián, et al.
Published: (2024) -
SpeechT-RAG: Reliable Depression Detection in LLMs with Retrieval-Augmented Generation Using Speech Timing Information
by: Zhang, Xiangyu, et al.
Published: (2025) -
Speech-based Clinical Depression Screening: An Empirical Study
by: Chen, Yangbin, et al.
Published: (2024)