:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Shi, Jiacheng, Du, Hongfei, Hong, Y. Alicia, Gao, Ye
Natura:	Preprint
Pubblicazione:	2025
Soggetti:	Artificial Intelligence
Accesso online:	https://arxiv.org/abs/2509.25458
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

EMO-TTA: Improving Test-Time Adaptation of Audio-Language Models for Speech Emotion Recognition
di: Shi, Jiacheng, et al.
Pubblicazione: (2025)

Emotion-Aligned Generation in Diffusion Text to Speech Models via Preference-Guided Optimization
di: Shi, Jiacheng, et al.
Pubblicazione: (2025)

AffectCodec: Emotion-Preserving Neural Speech Codec for Expressive Speech Modeling
di: Shi, Jiacheng, et al.
Pubblicazione: (2026)

Prompt-Unseen-Emotion: Zero-shot Expressive Speech Synthesis with Prompt-LLM Contextual Knowledge for Mixed Emotions
di: Gao, Xiaoxue, et al.
Pubblicazione: (2025)

MPE-TTS: Customized Emotion Zero-Shot Text-To-Speech Using Multi-Modal Prompt
di: Wu, Zhichao, et al.
Pubblicazione: (2025)

Sparse Autoencoders for Interpretable Emotion Control in Text-to-Speech
di: Du, Hongfei, et al.
Pubblicazione: (2026)

EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector
di: Cho, Deok-Hyeon, et al.
Pubblicazione: (2024)

Color-based Emotion Representation for Speech Emotion Recognition
di: Nagase, Ryotaro, et al.
Pubblicazione: (2026)

Emotional RAG: Enhancing Role-Playing Agents through Emotional Retrieval
di: Huang, Le, et al.
Pubblicazione: (2024)

Humane Speech Synthesis through Zero-Shot Emotion and Disfluency Generation
di: Chaudhury, Rohan, et al.
Pubblicazione: (2024)

Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition
di: Ma, Ziyang, et al.
Pubblicazione: (2023)

Expressive Prompting: Improving Emotion Intensity and Speaker Consistency in Zero-Shot TTS
di: Wang, Haoyu, et al.
Pubblicazione: (2024)

Large Language Models Meet Contrastive Learning: Zero-Shot Emotion Recognition Across Languages
di: Zou, Heqing, et al.
Pubblicazione: (2025)

Can We Estimate Purchase Intention Based on Zero-shot Speech Emotion Recognition?
di: Nagase, Ryotaro, et al.
Pubblicazione: (2024)

EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting
di: Yang, Guanrou, et al.
Pubblicazione: (2025)

PromptLoop: Plug-and-Play Prompt Refinement via Latent Feedback for Diffusion Model Alignment
di: Lee, Suhyeon, et al.
Pubblicazione: (2025)

Speech Emotion Recognition via Entropy-Aware Score Selection
di: Chua, ChenYi, et al.
Pubblicazione: (2025)

PTS-SNN: A Prompt-Tuned Temporal Shift Spiking Neural Networks for Efficient Speech Emotion Recognition
di: Su, Xun, et al.
Pubblicazione: (2026)

Persian Speech Emotion Recognition by Fine-Tuning Transformers
di: Shayaninasab, Minoo, et al.
Pubblicazione: (2024)

MATER: Multi-level Acoustic and Textual Emotion Representation for Interpretable Speech Emotion Recognition
di: Jon, Hyo Jin, et al.
Pubblicazione: (2025)

Amplifying Emotional Signals: Data-Efficient Deep Learning for Robust Speech Emotion Recognition
di: Vu, Tai
Pubblicazione: (2025)

VoxEmo: Benchmarking Speech Emotion Recognition with Speech LLMs
di: Zhang, Hezhao, et al.
Pubblicazione: (2026)

Qieemo: Speech Is All You Need in the Emotion Recognition in Conversations
di: Chen, Jinming, et al.
Pubblicazione: (2025)

Hybrid CNN-Transformer Architecture for Arabic Speech Emotion Recognition
di: Gheffari, Youcef Soufiane, et al.
Pubblicazione: (2026)

UDDETTS: Unifying Discrete and Dimensional Emotions for Controllable Emotional Text-to-Speech
di: Liu, Jiaxuan, et al.
Pubblicazione: (2025)

Semantic Differentiation in Speech Emotion Recognition: Insights from Descriptive and Expressive Speech Roles
di: Guo, Rongchen, et al.
Pubblicazione: (2025)

Multi-Modal Emotion Recognition by Text, Speech and Video Using Pretrained Transformers
di: Shayaninasab, Minoo, et al.
Pubblicazione: (2024)

Do LLMs Feel? Teaching Emotion Recognition with Prompts, Retrieval, and Curriculum Learning
di: Li, Xinran, et al.
Pubblicazione: (2025)

Plug, Play, and Fuse: Zero-Shot Joint Decoding via Word-Level Re-ranking Across Diverse Vocabularies
di: Koneru, Sai, et al.
Pubblicazione: (2024)

Frequency-Semantic Enhanced Variational Autoencoder for Zero-Shot Skeleton-based Action Recognition
di: Wu, Wenhan, et al.
Pubblicazione: (2025)

SER Evals: In-domain and Out-of-domain Benchmarking for Speech Emotion Recognition
di: Osman, Mohamed, et al.
Pubblicazione: (2024)

Efficient Finetuning for Dimensional Speech Emotion Recognition in the Age of Transformers
di: Sampath, Aneesha, et al.
Pubblicazione: (2025)

Speech Emotion Recognition with Distilled Prosodic and Linguistic Affect Representations
di: Shome, Debaditya, et al.
Pubblicazione: (2023)

Finding Dino: A Plug-and-Play Framework for Zero-Shot Detection of Out-of-Distribution Objects Using Prototypes
di: Sinhamahapatra, Poulami, et al.
Pubblicazione: (2024)

Multimodal Emotion Recognition with Vision-language Prompting and Modality Dropout
di: QI, Anbin, et al.
Pubblicazione: (2024)

Graph-DPEP: Decomposed Plug and Ensemble Play for Few-Shot Document Relation Extraction with Graph-of-Thoughts Reasoning
di: Zhang, Tao, et al.
Pubblicazione: (2024)

Bimodal Connection Attention Fusion for Speech Emotion Recognition
di: Luo, Jiachen, et al.
Pubblicazione: (2025)

Plug and Play with Prompts: A Prompt Tuning Approach for Controlling Text Generation
di: Ajwani, Rohan Deepak, et al.
Pubblicazione: (2024)

MSAC: Multiple Speech Attribute Control Method for Reliable Speech Emotion Recognition
di: Pan, Yu, et al.
Pubblicazione: (2023)

Emotion-Disentangled Embedding Alignment for Noise-Robust and Cross-Corpus Speech Emotion Recognition
di: Tiwari, Upasana, et al.
Pubblicazione: (2025)