:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Chi, Jie, de Seyssel, Maureen, Schluter, Natalie
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2502.05389
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Discriminating Form and Meaning in Multilingual Models with Minimal-Pair ABX Tasks
by: de Seyssel, Maureen, et al.
Published: (2025)

Leveraging Audio-Visual Data to Reduce the Multilingual Gap in Self-Supervised Speech Models
by: Blandón, María Andrea Cruz, et al.
Published: (2025)

Assessing the Role of Data Quality in Training Bilingual Language Models
by: Seto, Skyler, et al.
Published: (2025)

Which Evaluation for Which Model? A Taxonomy for Speech Model Assessment
by: de Seyssel, Maureen, et al.
Published: (2025)

Toward Machine Interpreting: Lessons from Human Interpreting Studies
by: Sperber, Matthias, et al.
Published: (2025)

GSQA: An End-to-End Model for Generative Spoken Question Answering
by: Shih, Min-Han, et al.
Published: (2023)

Enhancing Speech Instruction Understanding and Disambiguation in Robotics via Speech Prosody
by: Sasu, David, et al.
Published: (2025)

Attention-guided Evidence Grounding for Spoken Question Answering
by: Yang, Ke, et al.
Published: (2026)

EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models
by: de Seyssel, Maureen, et al.
Published: (2023)

HeySQuAD: A Spoken Question Answering Dataset
by: Wu, Yijing, et al.
Published: (2023)

SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering
by: Lin, Chyi-Jiunn, et al.
Published: (2024)

LibriSQA: A Novel Dataset and Framework for Spoken Question Answering with Large Language Models
by: Zhao, Zihan, et al.
Published: (2023)

Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM
by: Nachmani, Eliya, et al.
Published: (2023)

Zero-Shot End-To-End Spoken Question Answering In Medical Domain
by: Labrak, Yanis, et al.
Published: (2024)

End-to-end Contrastive Language-Speech Pretraining Model For Long-form Spoken Question Answering
by: Hu, Jiliang, et al.
Published: (2025)

The Role of Exploration Modules in Small Language Models for Knowledge Graph Question Answering
by: Cheng, Yi-Jie, et al.
Published: (2025)

BUCA: A Binary Classification Approach to Unsupervised Commonsense Question Answering
by: He, Jie, et al.
Published: (2023)

Leveraging Chain of Thought towards Empathetic Spoken Dialogue without Corresponding Question-Answering Data
by: Xie, Jingran, et al.
Published: (2025)

The Prosody of Emojis
by: Zhou, Giulio, et al.
Published: (2025)

Pitch Accent Detection improves Pretrained Automatic Speech Recognition
by: Sasu, David, et al.
Published: (2025)

Closing the Gap Between Text and Speech Understanding in LLMs
by: Cuervo, Santiago, et al.
Published: (2025)

DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial Training
by: Oh, Hyung-Seok, et al.
Published: (2023)

Inferential Question Answering
by: Mozafari, Jamshid, et al.
Published: (2026)

ProsodyLM: Uncovering the Emerging Prosody Processing Capabilities in Speech Language Models
by: Qian, Kaizhi, et al.
Published: (2025)

Credible Plan-Driven RAG Method for Multi-Hop Question Answering
by: Zhang, Ningning, et al.
Published: (2025)

Beyond Static: Related Questions Retrieval Through Conversations in Community Question Answering
by: Ao, Xiao, et al.
Published: (2026)

Table Question Answering for Low-resourced Indic Languages
by: Pal, Vaishali, et al.
Published: (2024)

LiGT: Layout-infused Generative Transformer for Visual Question Answering on Vietnamese Receipts
by: Le, Thanh-Phong, et al.
Published: (2025)

Enhancing Event Causality Identification with Rationale and Structure-Aware Causal Question Answering
by: Zhang, Baiyan, et al.
Published: (2024)

Consistency Training by Synthetic Question Generation for Conversational Question Answering
by: Hemati, Hamed Hematian, et al.
Published: (2024)

Question Calibration and Multi-Hop Modeling for Temporal Question Answering
by: Xue, Chao, et al.
Published: (2024)

Selectively Answering Visual Questions
by: Eisenschlos, Julian Martin, et al.
Published: (2024)

An Entity Linking Agent for Question Answering
by: Luo, Yajie, et al.
Published: (2025)

PDF Retrieval Augmented Question Answering
by: Hoang, Thi Thu Uyen, et al.
Published: (2025)

Coal Mining Question Answering with LLMs
by: Rivera, Antonio Carlos, et al.
Published: (2024)

Structured List-Grounded Question Answering
by: Sung, Mujeen, et al.
Published: (2024)

AccurateRAG: A Framework for Building Accurate Retrieval-Augmented Question-Answering Applications
by: Nguyen, Linh The, et al.
Published: (2025)

A Role-Aware Multi-Agent Framework for Financial Education Question Answering with LLMs
by: Zhu, Andy, et al.
Published: (2025)

Medical Spoken Named Entity Recognition
by: Le-Duc, Khai, et al.
Published: (2024)

No Verifiable Reward for Prosody: Toward Preference-Guided Prosody Learning in TTS
by: Shin, Seungyoun, et al.
Published: (2025)