Saved in:
| Main Authors: | Chen, Jingyi, Guo, Zhimeng, Chun, Jiyun, Wang, Pichao, Perrault, Andrew, Elsner, Micha |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.10444 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Fine-Tuning Text-to-Speech Diffusion Models Using Reinforcement Learning with Human Feedback
by: Chen, Jingyi, et al.
Published: (2025)
by: Chen, Jingyi, et al.
Published: (2025)
Shortcomings of LLMs for Low-Resource Translation: Retrieval and Understanding are Both the Problem
by: Court, Sara, et al.
Published: (2024)
by: Court, Sara, et al.
Published: (2024)
DLPO: Diffusion Model Loss-Guided Reinforcement Learning for Fine-Tuning Text-to-Speech Diffusion Models
by: Chen, Jingyi, et al.
Published: (2024)
by: Chen, Jingyi, et al.
Published: (2024)
Beyond Length: Context-Aware Expansion and Independence as Developmentally Sensitive Evaluation in Child Utterances
by: Chun, Jiyun, et al.
Published: (2026)
by: Chun, Jiyun, et al.
Published: (2026)
Prompt and circumstance: A word-by-word LLM prompting approach to interlinear glossing for low-resource languages
by: Elsner, Micha, et al.
Published: (2025)
by: Elsner, Micha, et al.
Published: (2025)
ARES: Alternating Reinforcement Learning and Supervised Fine-Tuning for Enhanced Multi-Modal Chain-of-Thought Reasoning Through Diverse AI Feedback
by: Byun, Ju-Seung, et al.
Published: (2024)
by: Byun, Ju-Seung, et al.
Published: (2024)
Why is "Chicago" Predictive of Deceptive Reviews? Using LLMs to Discover Language Phenomena from Lexical Cues
by: Qu, Jiaming, et al.
Published: (2025)
by: Qu, Jiaming, et al.
Published: (2025)
Speech LLMs are Contextual Reasoning Transcribers
by: Deng, Keqi, et al.
Published: (2026)
by: Deng, Keqi, et al.
Published: (2026)
Do LLMs Really Memorize Personally Identifiable Information? Revisiting PII Leakage with a Cue-Controlled Memorization Framework
by: Luo, Xiaoyu, et al.
Published: (2026)
by: Luo, Xiaoyu, et al.
Published: (2026)
LISTEN to Your Preferences: An LLM Framework for Multi-Objective Selection
by: Jovine, Adam S., et al.
Published: (2025)
by: Jovine, Adam S., et al.
Published: (2025)
Do LLMs Really Adapt to Domains? An Ontology Learning Perspective
by: Mai, Huu Tan, et al.
Published: (2024)
by: Mai, Huu Tan, et al.
Published: (2024)
Do LLMs Really Think Step-by-step In Implicit Reasoning?
by: Yu, Yijiong
Published: (2024)
by: Yu, Yijiong
Published: (2024)
The Role of Prosodic and Lexical Cues in Turn-Taking with Self-Supervised Speech Representations
by: Russell, Sam OConnor, et al.
Published: (2026)
by: Russell, Sam OConnor, et al.
Published: (2026)
All That Glitters Is Not Audio: Rethinking Text Priors and Audio Reliance in Audio-Language Evaluation
by: Foo, Leonardo Haw-Yang, et al.
Published: (2026)
by: Foo, Leonardo Haw-Yang, et al.
Published: (2026)
Acquiring Pronunciation Knowledge from Transcribed Speech Audio via Multi-task Learning
by: Sun, Siqi, et al.
Published: (2024)
by: Sun, Siqi, et al.
Published: (2024)
Do Emotions Really Affect Argument Convincingness? A Dynamic Approach with LLM-based Manipulation Checks
by: Chen, Yanran, et al.
Published: (2025)
by: Chen, Yanran, et al.
Published: (2025)
Do LLMs Really Forget? Evaluating Unlearning with Knowledge Correlation and Confidence Awareness
by: Wei, Rongzhe, et al.
Published: (2025)
by: Wei, Rongzhe, et al.
Published: (2025)
Dispersion Measures as Predictors of Lexical Decision Time, Word Familiarity, and Lexical Complexity
by: Nohejl, Adam, et al.
Published: (2025)
by: Nohejl, Adam, et al.
Published: (2025)
Do MLLMs Really Understand the Charts?
by: Zhang, Xiao, et al.
Published: (2025)
by: Zhang, Xiao, et al.
Published: (2025)
On the Contribution of Lexical Features to Speech Emotion Recognition
by: Combei, David
Published: (2025)
by: Combei, David
Published: (2025)
GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio
by: Chen, Guoguo, et al.
Published: (2021)
by: Chen, Guoguo, et al.
Published: (2021)
Counterfactual Cultural Cues Reduce Medical QA Accuracy in LLMs: Identifier vs Context Effects
by: Rezaei, Amirhossein Haji Mohammad, et al.
Published: (2026)
by: Rezaei, Amirhossein Haji Mohammad, et al.
Published: (2026)
Do LLMs Know What Luxembourgish Borrows? Probing Lexical Neology in Low-Resource Multilingual Models
by: Hosseini-Kivanani, Nina
Published: (2026)
by: Hosseini-Kivanani, Nina
Published: (2026)
Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models
by: Bi, Baolong, et al.
Published: (2025)
by: Bi, Baolong, et al.
Published: (2025)
Rubato: Transcribing Piano Music with Timestamps
by: Tamer, Nazif Can, et al.
Published: (2026)
by: Tamer, Nazif Can, et al.
Published: (2026)
Do the Right Thing, Just Debias! Multi-Category Bias Mitigation Using LLMs
by: Roy, Amartya, et al.
Published: (2024)
by: Roy, Amartya, et al.
Published: (2024)
Meta-Tuning LLMs to Leverage Lexical Knowledge for Generalizable Language Style Understanding
by: Guo, Ruohao, et al.
Published: (2023)
by: Guo, Ruohao, et al.
Published: (2023)
MSR-86K: An Evolving, Multilingual Corpus with 86,300 Hours of Transcribed Audio for Speech Recognition Research
by: Li, Song, et al.
Published: (2024)
by: Li, Song, et al.
Published: (2024)
Are LLMs Really Not Knowledgeable? Mining the Submerged Knowledge in LLMs' Memory
by: Tao, Xingjian, et al.
Published: (2024)
by: Tao, Xingjian, et al.
Published: (2024)
Do LLMs "Feel"? Emotion Circuits Discovery and Control
by: Wang, Chenxi, et al.
Published: (2025)
by: Wang, Chenxi, et al.
Published: (2025)
Self-Train Before You Transcribe
by: Flynn, Robert, et al.
Published: (2024)
by: Flynn, Robert, et al.
Published: (2024)
Do Efficient Transformers Really Save Computation?
by: Yang, Kai, et al.
Published: (2024)
by: Yang, Kai, et al.
Published: (2024)
HearSay Benchmark: Do Audio LLMs Leak What They Hear?
by: Wang, Jin, et al.
Published: (2026)
by: Wang, Jin, et al.
Published: (2026)
Offloading Score: Measuring AI Reliance Through Counterfactual Workflows
by: Padmakumar, Vishakh, et al.
Published: (2026)
by: Padmakumar, Vishakh, et al.
Published: (2026)
TOGGL: Transcribing Overlapping Speech with Staggered Labeling
by: Li, Chak-Fai, et al.
Published: (2024)
by: Li, Chak-Fai, et al.
Published: (2024)
Resolving Transcription Ambiguity in Spanish: A Hybrid Acoustic-Lexical System for Punctuation Restoration
by: Zhu, Xiliang, et al.
Published: (2024)
by: Zhu, Xiliang, et al.
Published: (2024)
AG-LSEC: Audio Grounded Lexical Speaker Error Correction
by: Paturi, Rohit, et al.
Published: (2024)
by: Paturi, Rohit, et al.
Published: (2024)
Fake Alignment: Are LLMs Really Aligned Well?
by: Wang, Yixu, et al.
Published: (2023)
by: Wang, Yixu, et al.
Published: (2023)
Emotion is Not Just a Label: Latent Emotional Factors in LLM Processing
by: Reichman, Benjamin, et al.
Published: (2026)
by: Reichman, Benjamin, et al.
Published: (2026)
AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?
by: Gong, Kaixiong, et al.
Published: (2024)
by: Gong, Kaixiong, et al.
Published: (2024)
Similar Items
-
Fine-Tuning Text-to-Speech Diffusion Models Using Reinforcement Learning with Human Feedback
by: Chen, Jingyi, et al.
Published: (2025) -
Shortcomings of LLMs for Low-Resource Translation: Retrieval and Understanding are Both the Problem
by: Court, Sara, et al.
Published: (2024) -
DLPO: Diffusion Model Loss-Guided Reinforcement Learning for Fine-Tuning Text-to-Speech Diffusion Models
by: Chen, Jingyi, et al.
Published: (2024) -
Beyond Length: Context-Aware Expansion and Independence as Developmentally Sensitive Evaluation in Child Utterances
by: Chun, Jiyun, et al.
Published: (2026) -
Prompt and circumstance: A word-by-word LLM prompting approach to interlinear glossing for low-resource languages
by: Elsner, Micha, et al.
Published: (2025)