:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Tao, Fuxiang, Li, Dongwei, Tang, Shuning, Ge, Xuri, Ma, Wei, Esposito, Anna, Vinciarelli, Alessandro
Format:	Preprint
Published:	2026
Subjects:	Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2604.01533
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Who is Speaking or Who is Depressed? A Controlled Study of Speaker Leakage in Speech-Based Depression Detection
by: Yeh, Hsiang-Chen, et al.
Published: (2026)

Distinctive Feature Codec: An Adaptive Efficient Speech Representation for Depression Detection
by: Zhang, Xiangyu, et al.
Published: (2025)

Automatic Detection of Depression in Speech Using Ensemble Convolutional Neural Networks
by: Vázquez-Romero, Adrián, et al.
Published: (2024)

SpeechT-RAG: Reliable Depression Detection in LLMs with Retrieval-Augmented Generation Using Speech Timing Information
by: Zhang, Xiangyu, et al.
Published: (2025)

Speech-based Clinical Depression Screening: An Empirical Study
by: Chen, Yangbin, et al.
Published: (2024)

Optimizing Speech-Input Length for Speaker-Independent Depression Classification
by: Rutowski, Tomasz, et al.
Published: (2024)

Robust Speech and Natural Language Processing Models for Depression Screening
by: Lu, Y., et al.
Published: (2024)

ASVspoof 5: Design, Collection and Validation of Resources for Spoofing, Deepfake, and Adversarial Attack Detection Using Crowdsourced Speech
by: Wang, Xin, et al.
Published: (2025)

Efficient Long Speech Sequence Modelling for Time-Domain Depression Level Estimation
by: Li, Shuanglin, et al.
Published: (2025)

Speech-preserving active noise control: a deep learning approach in reverberant environments
by: Dai, Shuning
Published: (2026)

Personality-Enhanced Multimodal Depression Detection in the Elderly
by: Wang, Honghong, et al.
Published: (2025)

Test-Time Training for Depression Detection
by: Dumpala, Sri Harsha, et al.
Published: (2024)

Hierarchical Self-Supervised Representation Learning for Depression Detection from Speech
by: Li, Yuxin, et al.
Published: (2025)

Recurrence-Based Nonlinear Vocal Dynamics as Digital Biomarkers for Depression Detection from Conversational Speech
by: Samanta, Himadri S
Published: (2026)

Why Pre-trained Models Fail: Feature Entanglement in Multi-modal Depression Detection
by: Zhang, Xiangyu, et al.
Published: (2025)

ComFeAT: Combination of Neural and Spectral Features for Improved Depression Detection
by: Phukan, Orchid Chetia, et al.
Published: (2024)

Multimodal Magic Elevating Depression Detection with a Fusion of Text and Audio Intelligence
by: Gan, Lindy, et al.
Published: (2025)

Leveraging Cross-Attention Transformer and Multi-Feature Fusion for Cross-Linguistic Speech Emotion Recognition
by: Zhao, Ruoyu, et al.
Published: (2025)

Classification of Autistic and Non-Autistic Children's Speech: A Cross-Linguistic Study in Finnish, French, and Slovak
by: Kakouros, Sofoklis, et al.
Published: (2026)

When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection
by: Zhang, Xiangyu, et al.
Published: (2024)

Self-Supervised Embeddings for Detecting Individual Symptoms of Depression
by: Dumpala, Sri Harsha, et al.
Published: (2024)

Post-training for Deepfake Speech Detection
by: Ge, Wanying, et al.
Published: (2025)

Electrolaryngeal Speech Intelligibility Enhancement Through Robust Linguistic Encoders
by: Violeta, Lester Phillip, et al.
Published: (2023)

Cross-Corpus Validation of Speech Emotion Recognition in Urdu using Domain-Knowledge Acoustic Features
by: Talpur, Unzela, et al.
Published: (2025)

Speech-Based Depression Prediction Using Encoder-Weight-Only Transfer Learning and a Large Corpus
by: Harati, Amir, et al.
Published: (2024)

Performance of Objective Speech Quality Metrics on Languages Beyond Validation Data: A Study of Turkish and Korean
by: Perez, Javier, et al.
Published: (2025)

Linguistic Knowledge Transfer Learning for Speech Enhancement
by: Hung, Kuo-Hsuan, et al.
Published: (2025)

Towards Data Drift Monitoring for Speech Deepfake Detection in the context of MLOps
by: Wang, Xin, et al.
Published: (2025)

A Multi-Probe Audit of Clinical-Interview Depression Detection Benchmarks
by: Ishikawa, Takehiro, et al.
Published: (2026)

Predicting Individual Depression Symptoms from Acoustic Features During Speech
by: Rodriguez, Sebastian, et al.
Published: (2024)

Does Fine-tuning by Reinforcement Learning Improve Generalization in Binary Speech Deepfake Detection?
by: Wang, Xin, et al.
Published: (2026)

Towards Cross-Task Suicide Risk Detection via Speech LLM
by: Li, Jialun, et al.
Published: (2025)

Towards a Generalizable Speech Marker for Parkinson's Disease Diagnosis
by: Siniukov, Maksim, et al.
Published: (2025)

Emotional Vietnamese Speech-Based Depression Diagnosis Using Dynamic Attention Mechanism
by: D., Quang-Anh N., et al.
Published: (2024)

Linguistic Changes in Spontaneous Speech for Detecting Parkinsons Disease Using Large Language Models
by: Crawford, Jonathan
Published: (2024)

Toward Objective and Interpretable Prosody Evaluation in Text-to-Speech: A Linguistically Motivated Approach
by: Chan, Cedric, et al.
Published: (2025)

TTA: Transcribe, Translate and Alignment for Cross-lingual Speech Representation
by: Liu, Wei, et al.
Published: (2025)

Investigating Acoustic-Textual Emotional Inconsistency Information for Automatic Depression Detection
by: Su, Rongfeng, et al.
Published: (2024)

UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021
by: Chen, Xinhui, et al.
Published: (2021)

Reducing Linguistic Hallucination in LM-Based Speech Enhancement via Noise-Invariant Acoustic-Semantic Distillation
by: Wang, Zheng, et al.
Published: (2026)