Saved in:
| Main Authors: | Wang, Tsai-Ning, Chen, Lin-Lin, Zeghidour, Neil, Saeed, Aaqib |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.01199 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Language Models as Semantic Teachers: Post-Training Alignment for Medical Audio Understanding
by: Wang, Tsai-Ning, et al.
Published: (2025)
by: Wang, Tsai-Ning, et al.
Published: (2025)
Adaptive Test-Time Scaling for Zero-Shot Respiratory Audio Classification
by: Wang, Tsai-Ning, et al.
Published: (2026)
by: Wang, Tsai-Ning, et al.
Published: (2026)
StethoLM: Audio Language Model for Cardiopulmonary Analysis Across Clinical Tasks
by: Wang, Yishan, et al.
Published: (2026)
by: Wang, Yishan, et al.
Published: (2026)
Electrocardiogram-Language Model for Few-Shot Question Answering with Meta Learning
by: Tang, Jialu, et al.
Published: (2024)
by: Tang, Jialu, et al.
Published: (2024)
Electrocardiogram Report Generation and Question Answering via Retrieval-Augmented Self-Supervised Modeling
by: Tang, Jialu, et al.
Published: (2024)
by: Tang, Jialu, et al.
Published: (2024)
UniPACT: A Multimodal Framework for Prognostic Question Answering on Raw ECG and Structured EHR
by: Tang, Jialu, et al.
Published: (2026)
by: Tang, Jialu, et al.
Published: (2026)
RespLLM: Unifying Audio and Text with Multimodal LLMs for Generalized Respiratory Health Prediction
by: Zhang, Yuwei, et al.
Published: (2024)
by: Zhang, Yuwei, et al.
Published: (2024)
A Semantic-Sampling Framework for Evaluating Calibration in Open-Ended Question Answering
by: Wang, Zhanliang, et al.
Published: (2026)
by: Wang, Zhanliang, et al.
Published: (2026)
Unified Acoustic Representations for Screening Neurological and Respiratory Pathologies from Voice
by: Piao, Ran, et al.
Published: (2025)
by: Piao, Ran, et al.
Published: (2025)
AQA-Bench: An Interactive Benchmark for Evaluating LLMs' Sequential Reasoning Ability
by: Yang, Siwei, et al.
Published: (2024)
by: Yang, Siwei, et al.
Published: (2024)
PedSleepMAE: Generative Model for Multimodal Pediatric Sleep Signals
by: Pandey, Saurav R., et al.
Published: (2024)
by: Pandey, Saurav R., et al.
Published: (2024)
Aligning Spoken Dialogue Models from User Interactions
by: Wu, Anne, et al.
Published: (2025)
by: Wu, Anne, et al.
Published: (2025)
RA-QA: A Benchmarking System for Respiratory Audio Question Answering Under Real-World Heterogeneity
by: Bertolino, Gaia A., et al.
Published: (2026)
by: Bertolino, Gaia A., et al.
Published: (2026)
Scaling Open-Ended Reasoning to Predict the Future
by: Chandak, Nikhil, et al.
Published: (2025)
by: Chandak, Nikhil, et al.
Published: (2025)
MoReVQA: Exploring Modular Reasoning Models for Video Question Answering
by: Min, Juhong, et al.
Published: (2024)
by: Min, Juhong, et al.
Published: (2024)
Collaboratively Learning Federated Models from Noisy Decentralized Data
by: Li, Haoyuan, et al.
Published: (2024)
by: Li, Haoyuan, et al.
Published: (2024)
Boosting Masked ECG-Text Auto-Encoders as Discriminative Learners
by: Pham, Hung Manh, et al.
Published: (2024)
by: Pham, Hung Manh, et al.
Published: (2024)
AQA-TTRL: Self-Adaptation in Audio Question Answering with Test-Time Reinforcement Learning
by: Zhang, Haoyu, et al.
Published: (2025)
by: Zhang, Haoyu, et al.
Published: (2025)
Beyond Prompting: An Efficient Embedding Framework for Open-Domain Question Answering
by: Hu, Zhanghao, et al.
Published: (2025)
by: Hu, Zhanghao, et al.
Published: (2025)
Federated Learning with a Single Shared Image
by: Soni, Sunny, et al.
Published: (2024)
by: Soni, Sunny, et al.
Published: (2024)
Self-Rewarding Rubric-Based Reinforcement Learning for Open-Ended Reasoning
by: Ye, Zhiling, et al.
Published: (2025)
by: Ye, Zhiling, et al.
Published: (2025)
FAST: Federated Active Learning with Foundation Models for Communication-efficient Sampling and Training
by: Li, Haoyuan, et al.
Published: (2025)
by: Li, Haoyuan, et al.
Published: (2025)
MCU: An Evaluation Framework for Open-Ended Game Agents
by: Zheng, Xinyue, et al.
Published: (2023)
by: Zheng, Xinyue, et al.
Published: (2023)
Robust Agents in Open-Ended Worlds
by: Samvelyan, Mikayel
Published: (2025)
by: Samvelyan, Mikayel
Published: (2025)
OWLViz: An Open-World Benchmark for Visual Question Answering
by: Nguyen, Thuy, et al.
Published: (2025)
by: Nguyen, Thuy, et al.
Published: (2025)
COREVQA: A Crowd Observation and Reasoning Entailment Visual Question Answering Benchmark
by: Chintapatla, Ishant, et al.
Published: (2025)
by: Chintapatla, Ishant, et al.
Published: (2025)
Communication-Efficient Federated Learning through Adaptive Weight Clustering and Server-Side Distillation
by: Tsouvalas, Vasileios, et al.
Published: (2024)
by: Tsouvalas, Vasileios, et al.
Published: (2024)
AQA: a multilingual Anaphora annotation scheme for Question Answering
by: E. Boldrini
Published: (2009)
by: E. Boldrini
Published: (2009)
Towards Multilingual Audio-Visual Question Answering
by: Phukan, Orchid Chetia, et al.
Published: (2024)
by: Phukan, Orchid Chetia, et al.
Published: (2024)
A Curriculum Learning Approach to Reinforcement Learning: Leveraging RAG for Multimodal Question Answering
by: Zhang, Chenliang, et al.
Published: (2025)
by: Zhang, Chenliang, et al.
Published: (2025)
Knowing When to Answer: Adaptive Confidence Refinement for Reliable Audio-Visual Question Answering
by: Tran, Dinh Phu, et al.
Published: (2026)
by: Tran, Dinh Phu, et al.
Published: (2026)
Audio Question Answering with GRPO-Based Fine-Tuning and Calibrated Segment-Level Predictions
by: Gibier, Marcel, et al.
Published: (2025)
by: Gibier, Marcel, et al.
Published: (2025)
Biomedical Entity Linking as Multiple Choice Question Answering
by: Lin, Zhenxi, et al.
Published: (2024)
by: Lin, Zhenxi, et al.
Published: (2024)
MediFact at MEDIQA-M3G 2024: Medical Question Answering in Dermatology with Multimodal Learning
by: Saeed, Nadia
Published: (2024)
by: Saeed, Nadia
Published: (2024)
LongAudio-RAG: Event-Grounded Question Answering over Multi-Hour Long Audio
by: Vakada, Naveen, et al.
Published: (2026)
by: Vakada, Naveen, et al.
Published: (2026)
Learning under Label Noise through Few-Shot Human-in-the-Loop Refinement
by: Saeed, Aaqib, et al.
Published: (2024)
by: Saeed, Aaqib, et al.
Published: (2024)
CausalEvolve: Towards Open-Ended Discovery with Causal Scratchpad
by: Chen, Yongqiang, et al.
Published: (2026)
by: Chen, Yongqiang, et al.
Published: (2026)
InfiBench: Evaluating the Question-Answering Capabilities of Code Large Language Models
by: Li, Linyi, et al.
Published: (2024)
by: Li, Linyi, et al.
Published: (2024)
Grad2Reward: From Sparse Judgment to Dense Rewards for Improving Open-Ended LLM Reasoning
by: Zhang, Zheng, et al.
Published: (2026)
by: Zhang, Zheng, et al.
Published: (2026)
AQAScore: Evaluating Semantic Alignment in Text-to-Audio Generation via Audio Question Answering
by: Kuan, Chun-Yi, et al.
Published: (2026)
by: Kuan, Chun-Yi, et al.
Published: (2026)
Similar Items
-
Language Models as Semantic Teachers: Post-Training Alignment for Medical Audio Understanding
by: Wang, Tsai-Ning, et al.
Published: (2025) -
Adaptive Test-Time Scaling for Zero-Shot Respiratory Audio Classification
by: Wang, Tsai-Ning, et al.
Published: (2026) -
StethoLM: Audio Language Model for Cardiopulmonary Analysis Across Clinical Tasks
by: Wang, Yishan, et al.
Published: (2026) -
Electrocardiogram-Language Model for Few-Shot Question Answering with Meta Learning
by: Tang, Jialu, et al.
Published: (2024) -
Electrocardiogram Report Generation and Question Answering via Retrieval-Augmented Self-Supervised Modeling
by: Tang, Jialu, et al.
Published: (2024)