Saved in:
| Main Authors: | Srinivasan, Adarsh, Dineen, Jacob, Afzal, Muhammad Umar, Sarfraz, Muhammad Uzair, Riaz, Irbaz B., Zhou, Ben |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.10746 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Evaluating Medical LLMs by Levels of Autonomy: A Survey Moving from Benchmarks to Applications
by: Ye, Xiao, et al.
Published: (2025)
by: Ye, Xiao, et al.
Published: (2025)
EviSearch: A Human in the Loop System for Extracting and Auditing Clinical Evidence for Systematic Reviews
by: Ahuja, Naman, et al.
Published: (2026)
by: Ahuja, Naman, et al.
Published: (2026)
Vocabulary Dropout for Curriculum Diversity in LLM Co-Evolution
by: Dineen, Jacob, et al.
Published: (2026)
by: Dineen, Jacob, et al.
Published: (2026)
BOW: Reinforcement Learning for Bottlenecked Next Word Prediction
by: Shen, Ming, et al.
Published: (2025)
by: Shen, Ming, et al.
Published: (2025)
UrduFactCheck: An Agentic Fact-Checking Framework for Urdu with Evidence Boosting and Benchmarking
by: Ahmad, Sarfraz, et al.
Published: (2025)
by: Ahmad, Sarfraz, et al.
Published: (2025)
LLMs as Data Annotators: How Close Are We to Human Performance
by: Haq, Muhammad Uzair Ul, et al.
Published: (2025)
by: Haq, Muhammad Uzair Ul, et al.
Published: (2025)
SCOPE:Planning for Hybrid Querying over Clinical Trial Data
by: Chowdhury, Suparno Roy, et al.
Published: (2026)
by: Chowdhury, Suparno Roy, et al.
Published: (2026)
FD-NL2SQL: Feedback-Driven Clinical NL2SQL that Improves with Use
by: Chowdhury, Suparno Roy, et al.
Published: (2026)
by: Chowdhury, Suparno Roy, et al.
Published: (2026)
CTC-DID: CTC-Based Arabic dialect identification for streaming applications
by: Farooq, Muhammad Umar, et al.
Published: (2026)
by: Farooq, Muhammad Umar, et al.
Published: (2026)
ESCoT: Towards Interpretable Emotional Support Dialogue Systems
by: Zhang, Tenggan, et al.
Published: (2024)
by: Zhang, Tenggan, et al.
Published: (2024)
ArenaBencher: Automatic Benchmark Evolution via Multi-Model Competitive Evaluation
by: Liu, Qin, et al.
Published: (2025)
by: Liu, Qin, et al.
Published: (2025)
Promptception: How Sensitive Are Large Multimodal Models to Prompts?
by: Ismithdeen, Mohamed Insaf, et al.
Published: (2025)
by: Ismithdeen, Mohamed Insaf, et al.
Published: (2025)
SAGE: Spliced-Audio Generated Data for Enhancing Foundational Models in Low-Resource Arabic-English Code-Switched Speech Recognition
by: Farooq, Muhammad Umar, et al.
Published: (2025)
by: Farooq, Muhammad Umar, et al.
Published: (2025)
From Text to Talent: A Pipeline for Extracting Insights from Candidate Profiles
by: Frazzetto, Paolo, et al.
Published: (2025)
by: Frazzetto, Paolo, et al.
Published: (2025)
ThinkTuning: Instilling Cognitive Reflections without Distillation
by: RRV, Aswin, et al.
Published: (2025)
by: RRV, Aswin, et al.
Published: (2025)
ToW: Thoughts of Words Improve Reasoning in Large Language Models
by: Xu, Zhikun, et al.
Published: (2024)
by: Xu, Zhikun, et al.
Published: (2024)
CC-LEARN: Cohort-based Consistency Learning
by: Ye, Xiao, et al.
Published: (2025)
by: Ye, Xiao, et al.
Published: (2025)
Building Trust in Clinical LLMs: Bias Analysis and Dataset Transparency
by: Maslenkova, Svetlana, et al.
Published: (2025)
by: Maslenkova, Svetlana, et al.
Published: (2025)
Exploring LLM-based Data Annotation Strategies for Medical Dialogue Preference Alignment
by: Dou, Chengfeng, et al.
Published: (2024)
by: Dou, Chengfeng, et al.
Published: (2024)
CoDial: Interpretable Task-Oriented Dialogue Systems Through Dialogue Flow Alignment
by: Shayanfar, Radin, et al.
Published: (2025)
by: Shayanfar, Radin, et al.
Published: (2025)
EthicMind: A Risk-Aware Framework for Ethical-Emotional Alignment in Multi-Turn Dialogue
by: Deng, Jiawen, et al.
Published: (2026)
by: Deng, Jiawen, et al.
Published: (2026)
Reasoning Like a Doctor: Improving Medical Dialogue Systems via Diagnostic Reasoning Process Alignment
by: Xu, Kaishuai, et al.
Published: (2024)
by: Xu, Kaishuai, et al.
Published: (2024)
Think out Loud: Emotion Deducing Explanation in Dialogues
by: Li, Jiangnan, et al.
Published: (2024)
by: Li, Jiangnan, et al.
Published: (2024)
Emotionally Intelligent Task-oriented Dialogue Systems: Architecture, Representation, and Optimisation
by: Feng, Shutong, et al.
Published: (2025)
by: Feng, Shutong, et al.
Published: (2025)
Infusing Emotions into Task-oriented Dialogue Systems: Understanding, Management, and Generation
by: Feng, Shutong, et al.
Published: (2024)
by: Feng, Shutong, et al.
Published: (2024)
EmplifAI: a Fine-grained Dataset for Japanese Empathetic Medical Dialogues in 28 Emotion Labels
by: She, Wan Jou, et al.
Published: (2026)
by: She, Wan Jou, et al.
Published: (2026)
Artificial intelligence across the cancer care continuum
by: Irbaz Bin Riaz, et al.
Published: (2025)
by: Irbaz Bin Riaz, et al.
Published: (2025)
Inference Acceleration for Large Language Models on CPUs
by: PS, Ditto, et al.
Published: (2024)
by: PS, Ditto, et al.
Published: (2024)
QA-LIGN: Aligning LLMs through Constitutionally Decomposed QA
by: Dineen, Jacob, et al.
Published: (2025)
by: Dineen, Jacob, et al.
Published: (2025)
Enhancing Long-Range Dependency with State Space Model and Kolmogorov-Arnold Networks for Aspect-Based Sentiment Analysis
by: Lawan, Adamu, et al.
Published: (2024)
by: Lawan, Adamu, et al.
Published: (2024)
Efficient Hybrid Inference for LLMs: Reward-Based Token Modelling with Selective Cloud Assistance
by: MS, Adarsh, et al.
Published: (2024)
by: MS, Adarsh, et al.
Published: (2024)
Dialogue Systems for Emotional Support via Value Reinforcement
by: Kim, Juhee, et al.
Published: (2025)
by: Kim, Juhee, et al.
Published: (2025)
Automated Rubrics for Reliable Evaluation of Medical Dialogue Systems
by: Chen, Yinzhu, et al.
Published: (2026)
by: Chen, Yinzhu, et al.
Published: (2026)
A Methodology for Identifying Evaluation Items for Practical Dialogue Systems Based on Business-Dialogue System Alignment Models
by: Nakano, Mikio, et al.
Published: (2026)
by: Nakano, Mikio, et al.
Published: (2026)
Detecting Propaganda Techniques in Code-Switched Social Media Text
by: Salman, Muhammad Umar, et al.
Published: (2023)
by: Salman, Muhammad Umar, et al.
Published: (2023)
FActBench: A Benchmark for Fine-grained Automatic Evaluation of LLM-Generated Text in the Medical Domain
by: Afzal, Anum, et al.
Published: (2025)
by: Afzal, Anum, et al.
Published: (2025)
Under Pressure: Emotional Framing Induces Measurable Behavioral Shifts and Structured Internal Geometry in Small Language Models
by: Usman, Rana Muhammad
Published: (2026)
by: Usman, Rana Muhammad
Published: (2026)
EMO-Reasoning: Benchmarking Emotional Reasoning Capabilities in Spoken Dialogue Systems
by: Liu, Jingwen, et al.
Published: (2025)
by: Liu, Jingwen, et al.
Published: (2025)
Inference Time Alignment with Reward-Guided Tree Search
by: Hung, Chia-Yu, et al.
Published: (2024)
by: Hung, Chia-Yu, et al.
Published: (2024)
IntentionESC: An Intention-Centered Framework for Enhancing Emotional Support in Dialogue Systems
by: Zhang, Xinjie, et al.
Published: (2025)
by: Zhang, Xinjie, et al.
Published: (2025)
Similar Items
-
Evaluating Medical LLMs by Levels of Autonomy: A Survey Moving from Benchmarks to Applications
by: Ye, Xiao, et al.
Published: (2025) -
EviSearch: A Human in the Loop System for Extracting and Auditing Clinical Evidence for Systematic Reviews
by: Ahuja, Naman, et al.
Published: (2026) -
Vocabulary Dropout for Curriculum Diversity in LLM Co-Evolution
by: Dineen, Jacob, et al.
Published: (2026) -
BOW: Reinforcement Learning for Bottlenecked Next Word Prediction
by: Shen, Ming, et al.
Published: (2025) -
UrduFactCheck: An Agentic Fact-Checking Framework for Urdu with Evidence Boosting and Benchmarking
by: Ahmad, Sarfraz, et al.
Published: (2025)