:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Vats, Guneesh, Srivastava, Priyanka, Yarra, Chiranjeevi
Format:	Preprint
Published:	2024
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2412.20213
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Clinically Inspired Symptom-Guided Depression Detection from Emotion-Aware Speech Representations
by: Nerella, Chaithra, et al.
Published: (2026)

Language-Agnostic Analysis of Speech Depression Detection
by: Binu, Sona, et al.
Published: (2024)

A Preliminary Analysis of Automatic Word and Syllable Prominence Detection in Non-Native Speech With Text-to-Speech Prosody Embeddings
by: Mondal, Anindita, et al.
Published: (2024)

NeuSpeech: Decode Neural signal as Speech
by: Yang, Yiqian, et al.
Published: (2024)

Unmask It! AI-Generated Product Review Detection in Dravidian Languages
by: De, Somsubhra, et al.
Published: (2025)

Measuring the Redundancy of Decoder Layers in SpeechLLMs
by: Moumen, Adel, et al.
Published: (2026)

Hierarchical Self-Supervised Representation Learning for Depression Detection from Speech
by: Li, Yuxin, et al.
Published: (2025)

Semantic Differentiation in Speech Emotion Recognition: Insights from Descriptive and Expressive Speech Roles
by: Guo, Rongchen, et al.
Published: (2025)

Beyond Global Emotion: Fine-Grained Emotional Speech Synthesis with Dynamic Word-Level Modulation
by: Wang, Sirui, et al.
Published: (2025)

Empowering Dysarthric Speech: Leveraging Advanced LLMs for Accurate Speech Correction and Multimodal Emotion Analysis
by: Attaluri, Kaushal, et al.
Published: (2024)

DepFlow: Disentangled Speech Generation to Mitigate Semantic Bias in Depression Detection
by: Li, Yuxin, et al.
Published: (2026)

Understanding Emotion in Discourse: Recognition Insights and Linguistic Patterns for Generation
by: Jeong, Cheonkam, et al.
Published: (2026)

UDDETTS: Unifying Discrete and Dimensional Emotions for Controllable Emotional Text-to-Speech
by: Liu, Jiaxuan, et al.
Published: (2025)

SER Evals: In-domain and Out-of-domain Benchmarking for Speech Emotion Recognition
by: Osman, Mohamed, et al.
Published: (2024)

Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition
by: Ma, Ziyang, et al.
Published: (2023)

Enhancing Depression Detection with Chain-of-Thought Prompting: From Emotion to Reasoning Using Large Language Models
by: Teng, Shiyu, et al.
Published: (2025)

Multilingual State Space Models for Structured Question Answering in Indic Languages
by: Vats, Arpita, et al.
Published: (2025)

Hybrid CNN-Transformer Architecture for Arabic Speech Emotion Recognition
by: Gheffari, Youcef Soufiane, et al.
Published: (2026)

SelfJudge: Faster Speculative Decoding via Self-Supervised Judge Verification
by: Yoon, Kanghoon, et al.
Published: (2025)

Traversing Emotional Landscapes and Linguistic Patterns in Bernard-Marie Koltès' Plays: An NLP Perspective
by: Pourzarandi, Arezou Zahiri, et al.
Published: (2024)

Investigating Acoustic-Textual Emotional Inconsistency Information for Automatic Depression Detection
by: Su, Rongfeng, et al.
Published: (2024)

An Exploration of Mamba for Speech Self-Supervised Models
by: Lin, Tzu-Quan, et al.
Published: (2025)

LLMpatronous: Harnessing the Power of LLMs For Vulnerability Detection
by: Yarra, Rajesh
Published: (2025)

Exploring Acoustic Similarity in Emotional Speech and Music via Self-Supervised Representations
by: Sun, Yujia, et al.
Published: (2024)

IntelliAsk: Learning to Ask High-Quality Research Questions via RLVR
by: Sharma, Karun, et al.
Published: (2026)

Bridging the Semantic Gap: Contrastive Rewards for Multilingual Text-to-SQL with GRPO
by: Kattamuri, Ashish, et al.
Published: (2025)

EmoNet-Voice: A Fine-Grained, Expert-Verified Benchmark for Speech Emotion Detection
by: Schuhmann, Christoph, et al.
Published: (2025)

Cross-Lingual Speech Emotion Recognition: Humans vs. Self-Supervised Models
by: Han, Zhichen, et al.
Published: (2024)

Decoding Memories: An Efficient Pipeline for Self-Consistency Hallucination Detection
by: Gao, Weizhi, et al.
Published: (2025)

Speech Emotion Recognition with Distilled Prosodic and Linguistic Affect Representations
by: Shome, Debaditya, et al.
Published: (2023)

Exploring Self-Supervised Multi-view Contrastive Learning for Speech Emotion Recognition with Limited Annotations
by: Khaertdinov, Bulat, et al.
Published: (2024)

Humane Speech Synthesis through Zero-Shot Emotion and Disfluency Generation
by: Chaudhury, Rohan, et al.
Published: (2024)

Equilibrium Dynamics and Mitigation of Gender Bias in Synthetically Generated Data
by: Kattamuri, Ashish, et al.
Published: (2025)

ESC-Skills: Discovering and Self-Evolving Skills for Emotional Support Conversations
by: Zhu, Jie, et al.
Published: (2026)

PruneCD: Contrasting Pruned Self Model to Improve Decoding Factuality
by: Yu, Byeongho, et al.
Published: (2025)

Emotion-Aligned Generation in Diffusion Text to Speech Models via Preference-Guided Optimization
by: Shi, Jiacheng, et al.
Published: (2025)

Jointly Fine-Tuning "BERT-like" Self Supervised Models to Improve Multimodal Speech Emotion Recognition
by: Siriwardhana, Shamane, et al.
Published: (2020)

R-BI: Regularized Batched Inputs enhance Incremental Decoding Framework for Low-Latency Simultaneous Speech Translation
by: Guo, Jiaxin, et al.
Published: (2024)

Depression Diagnosis Dialogue Simulation: Self-improving Psychiatrist with Tertiary Memory
by: Lan, Kunyao, et al.
Published: (2024)

VoxEmo: Benchmarking Speech Emotion Recognition with Speech LLMs
by: Zhang, Hezhao, et al.
Published: (2026)