Saved in:
| Main Authors: | Mayr, Roman, Schimpf, Michel, Bohné, Thomas |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.16792 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
AI-Assisted Goal Setting Improves Goal Progress Through Social Accountability
by: Schimpf, Michel, et al.
Published: (2026)
by: Schimpf, Michel, et al.
Published: (2026)
Can AI Help You Get Over Your Breakup? One Session with a Belief-Reframing Chatbot Shows Sustained Distress Reduction
by: Menzel, Thomas, et al.
Published: (2026)
by: Menzel, Thomas, et al.
Published: (2026)
Dream to Chat: Model-based Reinforcement Learning on Dialogues with User Belief Modeling
by: Zhao, Yue, et al.
Published: (2025)
by: Zhao, Yue, et al.
Published: (2025)
GCAgent: Enhancing Group Chat Communication through Dialogue Agents System
by: Meng, Zijie, et al.
Published: (2026)
by: Meng, Zijie, et al.
Published: (2026)
An Analysis of User Behaviors for Objectively Evaluating Spoken Dialogue Systems
by: Inoue, Koji, et al.
Published: (2024)
by: Inoue, Koji, et al.
Published: (2024)
Large Language Models as User-Agents for Evaluating Task-Oriented-Dialogue Systems
by: Kazi, Taaha, et al.
Published: (2024)
by: Kazi, Taaha, et al.
Published: (2024)
Development and Validation of Engagement and Rapport Scales for Evaluating User Experience in Multimodal Dialogue Systems
by: Kurata, Fuma, et al.
Published: (2025)
by: Kurata, Fuma, et al.
Published: (2025)
Chatting with Papers: A Hybrid Approach Using LLMs and Knowledge Graphs
by: Tykhonov, Vyacheslav, et al.
Published: (2025)
by: Tykhonov, Vyacheslav, et al.
Published: (2025)
Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue
by: Wang, Jian, et al.
Published: (2024)
by: Wang, Jian, et al.
Published: (2024)
ConsistencyChecker: Tree-based Evaluation of LLM Generalization Capabilities
by: Hong, Zhaochen, et al.
Published: (2025)
by: Hong, Zhaochen, et al.
Published: (2025)
ChatSOP: An SOP-Guided MCTS Planning Framework for Controllable LLM Dialogue Agents
by: Li, Zhigen, et al.
Published: (2024)
by: Li, Zhigen, et al.
Published: (2024)
Stakeholder Participation for Responsible AI Development: Disconnects Between Guidance and Current Practice
by: Kallina, Emma, et al.
Published: (2025)
by: Kallina, Emma, et al.
Published: (2025)
User Review Writing via Interview with Dialogue Systems
by: Tanaka, Yoshiki, et al.
Published: (2026)
by: Tanaka, Yoshiki, et al.
Published: (2026)
Enhancing User Performance and Human Factors through Visual Guidance in AR Assembly Tasks
by: Pietschmann, Leon, et al.
Published: (2025)
by: Pietschmann, Leon, et al.
Published: (2025)
User Simulation for Evaluating Information Access Systems
by: Balog, Krisztian, et al.
Published: (2023)
by: Balog, Krisztian, et al.
Published: (2023)
Vibe Checker: Aligning Code Evaluation with Human Preference
by: Zhong, Ming, et al.
Published: (2025)
by: Zhong, Ming, et al.
Published: (2025)
WebChecker: A Versatile EVL Plugin for Validating HTML Pages with Bootstrap Frameworks
by: Cherukuri, Milind
Published: (2025)
by: Cherukuri, Milind
Published: (2025)
RecUserSim: A Realistic and Diverse User Simulator for Evaluating Conversational Recommender Systems
by: Chen, Luyu, et al.
Published: (2025)
by: Chen, Luyu, et al.
Published: (2025)
LLMEffiChecker: Understanding and Testing Efficiency Degradation of Large Language Models
by: Feng, Xiaoning, et al.
Published: (2022)
by: Feng, Xiaoning, et al.
Published: (2022)
Enhancing Dialogue State Tracking Models through LLM-backed User-Agents Simulation
by: Niu, Cheng, et al.
Published: (2024)
by: Niu, Cheng, et al.
Published: (2024)
PlatoLM: Teaching LLMs in Multi-Round Dialogue via a User Simulator
by: Kong, Chuyi, et al.
Published: (2023)
by: Kong, Chuyi, et al.
Published: (2023)
DialSim: A Dialogue Simulator for Evaluating Long-Term Multi-Party Dialogue Understanding of Conversational Agents
by: Kim, Jiho, et al.
Published: (2024)
by: Kim, Jiho, et al.
Published: (2024)
The StudyChat Dataset: Analyzing Student Dialogues With ChatGPT in an Artificial Intelligence Course
by: McNichols, Hunter, et al.
Published: (2025)
by: McNichols, Hunter, et al.
Published: (2025)
ATOD: An Evaluation Framework and Benchmark for Agentic Task-Oriented Dialogue Systems
by: Zhang, Yifei, et al.
Published: (2026)
by: Zhang, Yifei, et al.
Published: (2026)
Data Augmentation Integrating Dialogue Flow and Style to Adapt Spoken Dialogue Systems to Low-Resource User Groups
by: Qi, Zhiyang, et al.
Published: (2024)
by: Qi, Zhiyang, et al.
Published: (2024)
Decision-aware User Simulation Agent for Evaluating Conversational Recommender Systems
by: Li, Yuan-Chi, et al.
Published: (2026)
by: Li, Yuan-Chi, et al.
Published: (2026)
ChatCLIDS: Simulating Persuasive AI Dialogues to Promote Closed-Loop Insulin Adoption in Type 1 Diabetes Care
by: Yao, Zonghai, et al.
Published: (2025)
by: Yao, Zonghai, et al.
Published: (2025)
DialogueForge: LLM Simulation of Human-Chatbot Dialogue
by: Zhu, Ruizhe, et al.
Published: (2025)
by: Zhu, Ruizhe, et al.
Published: (2025)
DuetSim: Building User Simulator with Dual Large Language Models for Task-Oriented Dialogues
by: Luo, Xiang, et al.
Published: (2024)
by: Luo, Xiang, et al.
Published: (2024)
Beyond Single-User Dialogue: Assessing Multi-User Dialogue State Tracking Capabilities of Large Language Models
by: Song, Sangmin, et al.
Published: (2025)
by: Song, Sangmin, et al.
Published: (2025)
Knowledge-Augmented Explainable and Interpretable Learning for Anomaly Detection and Diagnosis
by: Atzmueller, Martin, et al.
Published: (2024)
by: Atzmueller, Martin, et al.
Published: (2024)
Prosa: Rubric-Based Evaluation of LLMs on Real User Chats in Brazilian Portuguese
by: Junior, Roseval Malaquias, et al.
Published: (2026)
by: Junior, Roseval Malaquias, et al.
Published: (2026)
AICCE: AI Driven Compliance Checker Engine
by: Rahman, Mohammad Wali Ur, et al.
Published: (2026)
by: Rahman, Mohammad Wali Ur, et al.
Published: (2026)
Saliency Map-Guided Knowledge Discovery for Subclass Identification with LLM-Based Symbolic Approximations
by: Bohne, Tim, et al.
Published: (2025)
by: Bohne, Tim, et al.
Published: (2025)
Evaluating Explanations Through LLMs: Beyond Traditional User Studies
by: De Bona, Francesco Bombassei, et al.
Published: (2024)
by: De Bona, Francesco Bombassei, et al.
Published: (2024)
Automated Rubrics for Reliable Evaluation of Medical Dialogue Systems
by: Chen, Yinzhu, et al.
Published: (2026)
by: Chen, Yinzhu, et al.
Published: (2026)
Safety Alignment of LMs via Non-cooperative Games
by: Paulus, Anselm, et al.
Published: (2025)
by: Paulus, Anselm, et al.
Published: (2025)
Can Community Notes Replace Professional Fact-Checkers?
by: Borenstein, Nadav, et al.
Published: (2025)
by: Borenstein, Nadav, et al.
Published: (2025)
Efficient Agent Evaluation via Diversity-Guided User Simulation
by: Nakash, Itay, et al.
Published: (2026)
by: Nakash, Itay, et al.
Published: (2026)
SimUSER: Simulating User Behavior with Large Language Models for Recommender System Evaluation
by: Bougie, Nicolas, et al.
Published: (2025)
by: Bougie, Nicolas, et al.
Published: (2025)
Similar Items
-
AI-Assisted Goal Setting Improves Goal Progress Through Social Accountability
by: Schimpf, Michel, et al.
Published: (2026) -
Can AI Help You Get Over Your Breakup? One Session with a Belief-Reframing Chatbot Shows Sustained Distress Reduction
by: Menzel, Thomas, et al.
Published: (2026) -
Dream to Chat: Model-based Reinforcement Learning on Dialogues with User Belief Modeling
by: Zhao, Yue, et al.
Published: (2025) -
GCAgent: Enhancing Group Chat Communication through Dialogue Agents System
by: Meng, Zijie, et al.
Published: (2026) -
An Analysis of User Behaviors for Objectively Evaluating Spoken Dialogue Systems
by: Inoue, Koji, et al.
Published: (2024)