Saved in:
| Main Authors: | Chi, Jie, de Seyssel, Maureen, Schluter, Natalie |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.05389 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Discriminating Form and Meaning in Multilingual Models with Minimal-Pair ABX Tasks
by: de Seyssel, Maureen, et al.
Published: (2025)
by: de Seyssel, Maureen, et al.
Published: (2025)
Leveraging Audio-Visual Data to Reduce the Multilingual Gap in Self-Supervised Speech Models
by: Blandón, María Andrea Cruz, et al.
Published: (2025)
by: Blandón, María Andrea Cruz, et al.
Published: (2025)
Assessing the Role of Data Quality in Training Bilingual Language Models
by: Seto, Skyler, et al.
Published: (2025)
by: Seto, Skyler, et al.
Published: (2025)
Which Evaluation for Which Model? A Taxonomy for Speech Model Assessment
by: de Seyssel, Maureen, et al.
Published: (2025)
by: de Seyssel, Maureen, et al.
Published: (2025)
Toward Machine Interpreting: Lessons from Human Interpreting Studies
by: Sperber, Matthias, et al.
Published: (2025)
by: Sperber, Matthias, et al.
Published: (2025)
GSQA: An End-to-End Model for Generative Spoken Question Answering
by: Shih, Min-Han, et al.
Published: (2023)
by: Shih, Min-Han, et al.
Published: (2023)
Enhancing Speech Instruction Understanding and Disambiguation in Robotics via Speech Prosody
by: Sasu, David, et al.
Published: (2025)
by: Sasu, David, et al.
Published: (2025)
Attention-guided Evidence Grounding for Spoken Question Answering
by: Yang, Ke, et al.
Published: (2026)
by: Yang, Ke, et al.
Published: (2026)
EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models
by: de Seyssel, Maureen, et al.
Published: (2023)
by: de Seyssel, Maureen, et al.
Published: (2023)
HeySQuAD: A Spoken Question Answering Dataset
by: Wu, Yijing, et al.
Published: (2023)
by: Wu, Yijing, et al.
Published: (2023)
SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering
by: Lin, Chyi-Jiunn, et al.
Published: (2024)
by: Lin, Chyi-Jiunn, et al.
Published: (2024)
LibriSQA: A Novel Dataset and Framework for Spoken Question Answering with Large Language Models
by: Zhao, Zihan, et al.
Published: (2023)
by: Zhao, Zihan, et al.
Published: (2023)
Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM
by: Nachmani, Eliya, et al.
Published: (2023)
by: Nachmani, Eliya, et al.
Published: (2023)
Zero-Shot End-To-End Spoken Question Answering In Medical Domain
by: Labrak, Yanis, et al.
Published: (2024)
by: Labrak, Yanis, et al.
Published: (2024)
End-to-end Contrastive Language-Speech Pretraining Model For Long-form Spoken Question Answering
by: Hu, Jiliang, et al.
Published: (2025)
by: Hu, Jiliang, et al.
Published: (2025)
The Role of Exploration Modules in Small Language Models for Knowledge Graph Question Answering
by: Cheng, Yi-Jie, et al.
Published: (2025)
by: Cheng, Yi-Jie, et al.
Published: (2025)
BUCA: A Binary Classification Approach to Unsupervised Commonsense Question Answering
by: He, Jie, et al.
Published: (2023)
by: He, Jie, et al.
Published: (2023)
Leveraging Chain of Thought towards Empathetic Spoken Dialogue without Corresponding Question-Answering Data
by: Xie, Jingran, et al.
Published: (2025)
by: Xie, Jingran, et al.
Published: (2025)
The Prosody of Emojis
by: Zhou, Giulio, et al.
Published: (2025)
by: Zhou, Giulio, et al.
Published: (2025)
Pitch Accent Detection improves Pretrained Automatic Speech Recognition
by: Sasu, David, et al.
Published: (2025)
by: Sasu, David, et al.
Published: (2025)
Closing the Gap Between Text and Speech Understanding in LLMs
by: Cuervo, Santiago, et al.
Published: (2025)
by: Cuervo, Santiago, et al.
Published: (2025)
DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial Training
by: Oh, Hyung-Seok, et al.
Published: (2023)
by: Oh, Hyung-Seok, et al.
Published: (2023)
Inferential Question Answering
by: Mozafari, Jamshid, et al.
Published: (2026)
by: Mozafari, Jamshid, et al.
Published: (2026)
ProsodyLM: Uncovering the Emerging Prosody Processing Capabilities in Speech Language Models
by: Qian, Kaizhi, et al.
Published: (2025)
by: Qian, Kaizhi, et al.
Published: (2025)
Credible Plan-Driven RAG Method for Multi-Hop Question Answering
by: Zhang, Ningning, et al.
Published: (2025)
by: Zhang, Ningning, et al.
Published: (2025)
Beyond Static: Related Questions Retrieval Through Conversations in Community Question Answering
by: Ao, Xiao, et al.
Published: (2026)
by: Ao, Xiao, et al.
Published: (2026)
Table Question Answering for Low-resourced Indic Languages
by: Pal, Vaishali, et al.
Published: (2024)
by: Pal, Vaishali, et al.
Published: (2024)
LiGT: Layout-infused Generative Transformer for Visual Question Answering on Vietnamese Receipts
by: Le, Thanh-Phong, et al.
Published: (2025)
by: Le, Thanh-Phong, et al.
Published: (2025)
Enhancing Event Causality Identification with Rationale and Structure-Aware Causal Question Answering
by: Zhang, Baiyan, et al.
Published: (2024)
by: Zhang, Baiyan, et al.
Published: (2024)
Consistency Training by Synthetic Question Generation for Conversational Question Answering
by: Hemati, Hamed Hematian, et al.
Published: (2024)
by: Hemati, Hamed Hematian, et al.
Published: (2024)
Question Calibration and Multi-Hop Modeling for Temporal Question Answering
by: Xue, Chao, et al.
Published: (2024)
by: Xue, Chao, et al.
Published: (2024)
Selectively Answering Visual Questions
by: Eisenschlos, Julian Martin, et al.
Published: (2024)
by: Eisenschlos, Julian Martin, et al.
Published: (2024)
An Entity Linking Agent for Question Answering
by: Luo, Yajie, et al.
Published: (2025)
by: Luo, Yajie, et al.
Published: (2025)
PDF Retrieval Augmented Question Answering
by: Hoang, Thi Thu Uyen, et al.
Published: (2025)
by: Hoang, Thi Thu Uyen, et al.
Published: (2025)
Coal Mining Question Answering with LLMs
by: Rivera, Antonio Carlos, et al.
Published: (2024)
by: Rivera, Antonio Carlos, et al.
Published: (2024)
Structured List-Grounded Question Answering
by: Sung, Mujeen, et al.
Published: (2024)
by: Sung, Mujeen, et al.
Published: (2024)
AccurateRAG: A Framework for Building Accurate Retrieval-Augmented Question-Answering Applications
by: Nguyen, Linh The, et al.
Published: (2025)
by: Nguyen, Linh The, et al.
Published: (2025)
A Role-Aware Multi-Agent Framework for Financial Education Question Answering with LLMs
by: Zhu, Andy, et al.
Published: (2025)
by: Zhu, Andy, et al.
Published: (2025)
Medical Spoken Named Entity Recognition
by: Le-Duc, Khai, et al.
Published: (2024)
by: Le-Duc, Khai, et al.
Published: (2024)
No Verifiable Reward for Prosody: Toward Preference-Guided Prosody Learning in TTS
by: Shin, Seungyoun, et al.
Published: (2025)
by: Shin, Seungyoun, et al.
Published: (2025)
Similar Items
-
Discriminating Form and Meaning in Multilingual Models with Minimal-Pair ABX Tasks
by: de Seyssel, Maureen, et al.
Published: (2025) -
Leveraging Audio-Visual Data to Reduce the Multilingual Gap in Self-Supervised Speech Models
by: Blandón, María Andrea Cruz, et al.
Published: (2025) -
Assessing the Role of Data Quality in Training Bilingual Language Models
by: Seto, Skyler, et al.
Published: (2025) -
Which Evaluation for Which Model? A Taxonomy for Speech Model Assessment
by: de Seyssel, Maureen, et al.
Published: (2025) -
Toward Machine Interpreting: Lessons from Human Interpreting Studies
by: Sperber, Matthias, et al.
Published: (2025)