Saved in:
| Main Authors: | Devanathan, Rishikesh, Nathan, Varun, Kumar, Ayush |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.18210 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Inverse Scaling: When Bigger Isn't Better
by: McKenzie, Ian R., et al.
Published: (2023)
by: McKenzie, Ian R., et al.
Published: (2023)
Seeing Isn't Knowing: Do VLMs Know When Not to Answer Spatial Questions (and Why)?
by: Zhang, Yue, et al.
Published: (2026)
by: Zhang, Yue, et al.
Published: (2026)
Reasoning Isn't Enough: Examining Truth-Bias and Sycophancy in LLMs
by: Barkett, Emilio, et al.
Published: (2025)
by: Barkett, Emilio, et al.
Published: (2025)
Unlearning Isn't Deletion: Investigating Reversibility of Machine Unlearning in LLMs
by: Xu, Xiaoyu, et al.
Published: (2025)
by: Xu, Xiaoyu, et al.
Published: (2025)
Spot the BlindSpots: Systematic Identification and Quantification of Fine-Grained LLM Biases in Contact Center Summaries
by: Mayilvaghanan, Kawin, et al.
Published: (2025)
by: Mayilvaghanan, Kawin, et al.
Published: (2025)
Talk Isn't Always Cheap: Understanding Failure Modes in Multi-Agent Debate
by: Wynn, Andrea, et al.
Published: (2025)
by: Wynn, Andrea, et al.
Published: (2025)
Seeing Isn't Believing: Mitigating Belief Inertia via Active Intervention in Embodied Agents
by: Wang, Hanlin, et al.
Published: (2026)
by: Wang, Hanlin, et al.
Published: (2026)
When Correct Isn't Usable: Improving Structured Output Reliability in Small Language Models
by: Galeone, Cosimo, et al.
Published: (2026)
by: Galeone, Cosimo, et al.
Published: (2026)
Recall Isn't Enough: Bounding Commitments in Personalized Language Systems
by: Tang, Rui, et al.
Published: (2026)
by: Tang, Rui, et al.
Published: (2026)
Tool-Aware Planning in Contact Center AI: Evaluating LLMs through Lineage-Guided Query Decomposition
by: Nathan, Varun, et al.
Published: (2026)
by: Nathan, Varun, et al.
Published: (2026)
Why Isn't Relational Learning Taking Over the World?
by: Poole, David
Published: (2025)
by: Poole, David
Published: (2025)
Synthetic Dialogue Dataset Generation using LLM Agents
by: Abdullin, Yelaman, et al.
Published: (2024)
by: Abdullin, Yelaman, et al.
Published: (2024)
Synthetic Dialogue Generation for Interactive Conversational Elicitation & Recommendation (ICER)
by: Ryu, Moonkyung, et al.
Published: (2025)
by: Ryu, Moonkyung, et al.
Published: (2025)
When Privacy Isn't Synthetic: Hidden Data Leakage in Generative AI Models
by: Mustaqim, S. M., et al.
Published: (2025)
by: Mustaqim, S. M., et al.
Published: (2025)
When Pretty Isn't Useful: Investigating Why Modern Text-to-Image Models Fail as Reliable Training Data Generators
by: Adamkiewicz, Krzysztof, et al.
Published: (2026)
by: Adamkiewicz, Krzysztof, et al.
Published: (2026)
IntentionESC: An Intention-Centered Framework for Enhancing Emotional Support in Dialogue Systems
by: Zhang, Xinjie, et al.
Published: (2025)
by: Zhang, Xinjie, et al.
Published: (2025)
Enhancing Hallucination Detection through Perturbation-Based Synthetic Data Generation in System Responses
by: Zhang, Dongxu, et al.
Published: (2024)
by: Zhang, Dongxu, et al.
Published: (2024)
A Typology of Synthetic Datasets for Dialogue Processing in Clinical Contexts
by: Bedrick, Steven, et al.
Published: (2025)
by: Bedrick, Steven, et al.
Published: (2025)
MEDSAGE: Enhancing Robustness of Medical Dialogue Summarization to ASR Errors with LLM-generated Synthetic Dialogues
by: Binici, Kuluhan, et al.
Published: (2024)
by: Binici, Kuluhan, et al.
Published: (2024)
Persona-Aware Alignment Framework for Personalized Dialogue Generation
by: Li, Guanrong, et al.
Published: (2025)
by: Li, Guanrong, et al.
Published: (2025)
Making Task-Oriented Dialogue Datasets More Natural by Synthetically Generating Indirect User Requests
by: Mannekote, Amogh, et al.
Published: (2024)
by: Mannekote, Amogh, et al.
Published: (2024)
A Stack-Propagation Framework for Low-Resource Personalized Dialogue Generation
by: Song, Haoyu, et al.
Published: (2024)
by: Song, Haoyu, et al.
Published: (2024)
A Static and Dynamic Attention Framework for Multi Turn Dialogue Generation
by: Zhang, Wei-Nan, et al.
Published: (2024)
by: Zhang, Wei-Nan, et al.
Published: (2024)
MedSynth: Realistic, Synthetic Medical Dialogue-Note Pairs
by: Mianroodi, Ahmad Rezaie, et al.
Published: (2025)
by: Mianroodi, Ahmad Rezaie, et al.
Published: (2025)
MADial-Bench: Towards Real-world Evaluation of Memory-Augmented Dialogue Generation
by: He, Junqing, et al.
Published: (2024)
by: He, Junqing, et al.
Published: (2024)
Language Models' Factuality Depends on the Language of Inquiry
by: Aggarwal, Tushar, et al.
Published: (2025)
by: Aggarwal, Tushar, et al.
Published: (2025)
Real-Time Textless Dialogue Generation
by: Mai, Long, et al.
Published: (2025)
by: Mai, Long, et al.
Published: (2025)
LLM-Based Section Identifiers Excel on Open Source but Stumble in Real World Applications
by: Krishnamoorthy, Saranya, et al.
Published: (2024)
by: Krishnamoorthy, Saranya, et al.
Published: (2024)
Synthetic Patient-Physician Dialogue Generation from Clinical Notes Using LLM
by: Das, Trisha, et al.
Published: (2024)
by: Das, Trisha, et al.
Published: (2024)
Why Solving Multi-agent Path Finding with Large Language Model has not Succeeded Yet
by: Chen, Weizhe, et al.
Published: (2024)
by: Chen, Weizhe, et al.
Published: (2024)
Syn-TurnTurk: A Synthetic Dataset for Turn-Taking Prediction in Turkish Dialogues
by: Bayrak, Ahmet Tuğrul, et al.
Published: (2026)
by: Bayrak, Ahmet Tuğrul, et al.
Published: (2026)
From Medical Records to Diagnostic Dialogues: A Clinical-Grounded Approach and Dataset for Psychiatric Comorbidity
by: Wan, Tianxi, et al.
Published: (2025)
by: Wan, Tianxi, et al.
Published: (2025)
Synthetic Image Verification in the Era of Generative AI: What Works and What Isn't There Yet
by: Tariang, Diangarti, et al.
Published: (2024)
by: Tariang, Diangarti, et al.
Published: (2024)
DIAL-SUMMER: A Structured Evaluation Framework of Hierarchical Errors in Dialogue Summaries
by: Ramnath, Sahana, et al.
Published: (2026)
by: Ramnath, Sahana, et al.
Published: (2026)
GPT-4V Cannot Generate Radiology Reports Yet
by: Jiang, Yuyang, et al.
Published: (2024)
by: Jiang, Yuyang, et al.
Published: (2024)
ToxiCraft: A Novel Framework for Synthetic Generation of Harmful Information
by: Hui, Zheng, et al.
Published: (2024)
by: Hui, Zheng, et al.
Published: (2024)
Semantic Flow Regularization: Teaching LLMs to Generate Diverse Yet Coherent Responses
by: Peng, Kerui, et al.
Published: (2026)
by: Peng, Kerui, et al.
Published: (2026)
Tell Me Why: Designing an Explainable LLM-based Dialogue System for Student Problem Behavior Diagnosis
by: Fan, Zhilin, et al.
Published: (2026)
by: Fan, Zhilin, et al.
Published: (2026)
Leveraging Graph Structures and Large Language Models for End-to-End Synthetic Task-Oriented Dialogues
by: Medjad, Maya, et al.
Published: (2025)
by: Medjad, Maya, et al.
Published: (2025)
When One Moment Isn't Enough: Multi-Moment Retrieval with Cross-Moment Interactions
by: Cao, Zhuo, et al.
Published: (2025)
by: Cao, Zhuo, et al.
Published: (2025)
Similar Items
-
Inverse Scaling: When Bigger Isn't Better
by: McKenzie, Ian R., et al.
Published: (2023) -
Seeing Isn't Knowing: Do VLMs Know When Not to Answer Spatial Questions (and Why)?
by: Zhang, Yue, et al.
Published: (2026) -
Reasoning Isn't Enough: Examining Truth-Bias and Sycophancy in LLMs
by: Barkett, Emilio, et al.
Published: (2025) -
Unlearning Isn't Deletion: Investigating Reversibility of Machine Unlearning in LLMs
by: Xu, Xiaoyu, et al.
Published: (2025) -
Spot the BlindSpots: Systematic Identification and Quantification of Fine-Grained LLM Biases in Contact Center Summaries
by: Mayilvaghanan, Kawin, et al.
Published: (2025)