Saved in:
| Main Authors: | Vats, Guneesh, Srivastava, Priyanka, Yarra, Chiranjeevi |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.20213 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Clinically Inspired Symptom-Guided Depression Detection from Emotion-Aware Speech Representations
by: Nerella, Chaithra, et al.
Published: (2026)
by: Nerella, Chaithra, et al.
Published: (2026)
Language-Agnostic Analysis of Speech Depression Detection
by: Binu, Sona, et al.
Published: (2024)
by: Binu, Sona, et al.
Published: (2024)
A Preliminary Analysis of Automatic Word and Syllable Prominence Detection in Non-Native Speech With Text-to-Speech Prosody Embeddings
by: Mondal, Anindita, et al.
Published: (2024)
by: Mondal, Anindita, et al.
Published: (2024)
NeuSpeech: Decode Neural signal as Speech
by: Yang, Yiqian, et al.
Published: (2024)
by: Yang, Yiqian, et al.
Published: (2024)
Unmask It! AI-Generated Product Review Detection in Dravidian Languages
by: De, Somsubhra, et al.
Published: (2025)
by: De, Somsubhra, et al.
Published: (2025)
Measuring the Redundancy of Decoder Layers in SpeechLLMs
by: Moumen, Adel, et al.
Published: (2026)
by: Moumen, Adel, et al.
Published: (2026)
Hierarchical Self-Supervised Representation Learning for Depression Detection from Speech
by: Li, Yuxin, et al.
Published: (2025)
by: Li, Yuxin, et al.
Published: (2025)
Semantic Differentiation in Speech Emotion Recognition: Insights from Descriptive and Expressive Speech Roles
by: Guo, Rongchen, et al.
Published: (2025)
by: Guo, Rongchen, et al.
Published: (2025)
Beyond Global Emotion: Fine-Grained Emotional Speech Synthesis with Dynamic Word-Level Modulation
by: Wang, Sirui, et al.
Published: (2025)
by: Wang, Sirui, et al.
Published: (2025)
Empowering Dysarthric Speech: Leveraging Advanced LLMs for Accurate Speech Correction and Multimodal Emotion Analysis
by: Attaluri, Kaushal, et al.
Published: (2024)
by: Attaluri, Kaushal, et al.
Published: (2024)
DepFlow: Disentangled Speech Generation to Mitigate Semantic Bias in Depression Detection
by: Li, Yuxin, et al.
Published: (2026)
by: Li, Yuxin, et al.
Published: (2026)
Understanding Emotion in Discourse: Recognition Insights and Linguistic Patterns for Generation
by: Jeong, Cheonkam, et al.
Published: (2026)
by: Jeong, Cheonkam, et al.
Published: (2026)
UDDETTS: Unifying Discrete and Dimensional Emotions for Controllable Emotional Text-to-Speech
by: Liu, Jiaxuan, et al.
Published: (2025)
by: Liu, Jiaxuan, et al.
Published: (2025)
SER Evals: In-domain and Out-of-domain Benchmarking for Speech Emotion Recognition
by: Osman, Mohamed, et al.
Published: (2024)
by: Osman, Mohamed, et al.
Published: (2024)
Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition
by: Ma, Ziyang, et al.
Published: (2023)
by: Ma, Ziyang, et al.
Published: (2023)
Enhancing Depression Detection with Chain-of-Thought Prompting: From Emotion to Reasoning Using Large Language Models
by: Teng, Shiyu, et al.
Published: (2025)
by: Teng, Shiyu, et al.
Published: (2025)
Multilingual State Space Models for Structured Question Answering in Indic Languages
by: Vats, Arpita, et al.
Published: (2025)
by: Vats, Arpita, et al.
Published: (2025)
Hybrid CNN-Transformer Architecture for Arabic Speech Emotion Recognition
by: Gheffari, Youcef Soufiane, et al.
Published: (2026)
by: Gheffari, Youcef Soufiane, et al.
Published: (2026)
SelfJudge: Faster Speculative Decoding via Self-Supervised Judge Verification
by: Yoon, Kanghoon, et al.
Published: (2025)
by: Yoon, Kanghoon, et al.
Published: (2025)
Traversing Emotional Landscapes and Linguistic Patterns in Bernard-Marie Koltès' Plays: An NLP Perspective
by: Pourzarandi, Arezou Zahiri, et al.
Published: (2024)
by: Pourzarandi, Arezou Zahiri, et al.
Published: (2024)
Investigating Acoustic-Textual Emotional Inconsistency Information for Automatic Depression Detection
by: Su, Rongfeng, et al.
Published: (2024)
by: Su, Rongfeng, et al.
Published: (2024)
An Exploration of Mamba for Speech Self-Supervised Models
by: Lin, Tzu-Quan, et al.
Published: (2025)
by: Lin, Tzu-Quan, et al.
Published: (2025)
LLMpatronous: Harnessing the Power of LLMs For Vulnerability Detection
by: Yarra, Rajesh
Published: (2025)
by: Yarra, Rajesh
Published: (2025)
Exploring Acoustic Similarity in Emotional Speech and Music via Self-Supervised Representations
by: Sun, Yujia, et al.
Published: (2024)
by: Sun, Yujia, et al.
Published: (2024)
IntelliAsk: Learning to Ask High-Quality Research Questions via RLVR
by: Sharma, Karun, et al.
Published: (2026)
by: Sharma, Karun, et al.
Published: (2026)
Bridging the Semantic Gap: Contrastive Rewards for Multilingual Text-to-SQL with GRPO
by: Kattamuri, Ashish, et al.
Published: (2025)
by: Kattamuri, Ashish, et al.
Published: (2025)
EmoNet-Voice: A Fine-Grained, Expert-Verified Benchmark for Speech Emotion Detection
by: Schuhmann, Christoph, et al.
Published: (2025)
by: Schuhmann, Christoph, et al.
Published: (2025)
Cross-Lingual Speech Emotion Recognition: Humans vs. Self-Supervised Models
by: Han, Zhichen, et al.
Published: (2024)
by: Han, Zhichen, et al.
Published: (2024)
Decoding Memories: An Efficient Pipeline for Self-Consistency Hallucination Detection
by: Gao, Weizhi, et al.
Published: (2025)
by: Gao, Weizhi, et al.
Published: (2025)
Speech Emotion Recognition with Distilled Prosodic and Linguistic Affect Representations
by: Shome, Debaditya, et al.
Published: (2023)
by: Shome, Debaditya, et al.
Published: (2023)
Exploring Self-Supervised Multi-view Contrastive Learning for Speech Emotion Recognition with Limited Annotations
by: Khaertdinov, Bulat, et al.
Published: (2024)
by: Khaertdinov, Bulat, et al.
Published: (2024)
Humane Speech Synthesis through Zero-Shot Emotion and Disfluency Generation
by: Chaudhury, Rohan, et al.
Published: (2024)
by: Chaudhury, Rohan, et al.
Published: (2024)
Equilibrium Dynamics and Mitigation of Gender Bias in Synthetically Generated Data
by: Kattamuri, Ashish, et al.
Published: (2025)
by: Kattamuri, Ashish, et al.
Published: (2025)
ESC-Skills: Discovering and Self-Evolving Skills for Emotional Support Conversations
by: Zhu, Jie, et al.
Published: (2026)
by: Zhu, Jie, et al.
Published: (2026)
PruneCD: Contrasting Pruned Self Model to Improve Decoding Factuality
by: Yu, Byeongho, et al.
Published: (2025)
by: Yu, Byeongho, et al.
Published: (2025)
Emotion-Aligned Generation in Diffusion Text to Speech Models via Preference-Guided Optimization
by: Shi, Jiacheng, et al.
Published: (2025)
by: Shi, Jiacheng, et al.
Published: (2025)
Jointly Fine-Tuning "BERT-like" Self Supervised Models to Improve Multimodal Speech Emotion Recognition
by: Siriwardhana, Shamane, et al.
Published: (2020)
by: Siriwardhana, Shamane, et al.
Published: (2020)
R-BI: Regularized Batched Inputs enhance Incremental Decoding Framework for Low-Latency Simultaneous Speech Translation
by: Guo, Jiaxin, et al.
Published: (2024)
by: Guo, Jiaxin, et al.
Published: (2024)
Depression Diagnosis Dialogue Simulation: Self-improving Psychiatrist with Tertiary Memory
by: Lan, Kunyao, et al.
Published: (2024)
by: Lan, Kunyao, et al.
Published: (2024)
VoxEmo: Benchmarking Speech Emotion Recognition with Speech LLMs
by: Zhang, Hezhao, et al.
Published: (2026)
by: Zhang, Hezhao, et al.
Published: (2026)
Similar Items
-
Clinically Inspired Symptom-Guided Depression Detection from Emotion-Aware Speech Representations
by: Nerella, Chaithra, et al.
Published: (2026) -
Language-Agnostic Analysis of Speech Depression Detection
by: Binu, Sona, et al.
Published: (2024) -
A Preliminary Analysis of Automatic Word and Syllable Prominence Detection in Non-Native Speech With Text-to-Speech Prosody Embeddings
by: Mondal, Anindita, et al.
Published: (2024) -
NeuSpeech: Decode Neural signal as Speech
by: Yang, Yiqian, et al.
Published: (2024) -
Unmask It! AI-Generated Product Review Detection in Dravidian Languages
by: De, Somsubhra, et al.
Published: (2025)