:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Mayr, Roman, Schimpf, Michel, Bohné, Thomas
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2507.16792
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

AI-Assisted Goal Setting Improves Goal Progress Through Social Accountability
by: Schimpf, Michel, et al.
Published: (2026)

Can AI Help You Get Over Your Breakup? One Session with a Belief-Reframing Chatbot Shows Sustained Distress Reduction
by: Menzel, Thomas, et al.
Published: (2026)

Dream to Chat: Model-based Reinforcement Learning on Dialogues with User Belief Modeling
by: Zhao, Yue, et al.
Published: (2025)

GCAgent: Enhancing Group Chat Communication through Dialogue Agents System
by: Meng, Zijie, et al.
Published: (2026)

An Analysis of User Behaviors for Objectively Evaluating Spoken Dialogue Systems
by: Inoue, Koji, et al.
Published: (2024)

Large Language Models as User-Agents for Evaluating Task-Oriented-Dialogue Systems
by: Kazi, Taaha, et al.
Published: (2024)

Development and Validation of Engagement and Rapport Scales for Evaluating User Experience in Multimodal Dialogue Systems
by: Kurata, Fuma, et al.
Published: (2025)

Chatting with Papers: A Hybrid Approach Using LLMs and Knowledge Graphs
by: Tykhonov, Vyacheslav, et al.
Published: (2025)

Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue
by: Wang, Jian, et al.
Published: (2024)

ConsistencyChecker: Tree-based Evaluation of LLM Generalization Capabilities
by: Hong, Zhaochen, et al.
Published: (2025)

ChatSOP: An SOP-Guided MCTS Planning Framework for Controllable LLM Dialogue Agents
by: Li, Zhigen, et al.
Published: (2024)

Stakeholder Participation for Responsible AI Development: Disconnects Between Guidance and Current Practice
by: Kallina, Emma, et al.
Published: (2025)

User Review Writing via Interview with Dialogue Systems
by: Tanaka, Yoshiki, et al.
Published: (2026)

Enhancing User Performance and Human Factors through Visual Guidance in AR Assembly Tasks
by: Pietschmann, Leon, et al.
Published: (2025)

User Simulation for Evaluating Information Access Systems
by: Balog, Krisztian, et al.
Published: (2023)

Vibe Checker: Aligning Code Evaluation with Human Preference
by: Zhong, Ming, et al.
Published: (2025)

WebChecker: A Versatile EVL Plugin for Validating HTML Pages with Bootstrap Frameworks
by: Cherukuri, Milind
Published: (2025)

RecUserSim: A Realistic and Diverse User Simulator for Evaluating Conversational Recommender Systems
by: Chen, Luyu, et al.
Published: (2025)

LLMEffiChecker: Understanding and Testing Efficiency Degradation of Large Language Models
by: Feng, Xiaoning, et al.
Published: (2022)

Enhancing Dialogue State Tracking Models through LLM-backed User-Agents Simulation
by: Niu, Cheng, et al.
Published: (2024)

PlatoLM: Teaching LLMs in Multi-Round Dialogue via a User Simulator
by: Kong, Chuyi, et al.
Published: (2023)

DialSim: A Dialogue Simulator for Evaluating Long-Term Multi-Party Dialogue Understanding of Conversational Agents
by: Kim, Jiho, et al.
Published: (2024)

The StudyChat Dataset: Analyzing Student Dialogues With ChatGPT in an Artificial Intelligence Course
by: McNichols, Hunter, et al.
Published: (2025)

ATOD: An Evaluation Framework and Benchmark for Agentic Task-Oriented Dialogue Systems
by: Zhang, Yifei, et al.
Published: (2026)

Data Augmentation Integrating Dialogue Flow and Style to Adapt Spoken Dialogue Systems to Low-Resource User Groups
by: Qi, Zhiyang, et al.
Published: (2024)

Decision-aware User Simulation Agent for Evaluating Conversational Recommender Systems
by: Li, Yuan-Chi, et al.
Published: (2026)

ChatCLIDS: Simulating Persuasive AI Dialogues to Promote Closed-Loop Insulin Adoption in Type 1 Diabetes Care
by: Yao, Zonghai, et al.
Published: (2025)

DialogueForge: LLM Simulation of Human-Chatbot Dialogue
by: Zhu, Ruizhe, et al.
Published: (2025)

DuetSim: Building User Simulator with Dual Large Language Models for Task-Oriented Dialogues
by: Luo, Xiang, et al.
Published: (2024)

Beyond Single-User Dialogue: Assessing Multi-User Dialogue State Tracking Capabilities of Large Language Models
by: Song, Sangmin, et al.
Published: (2025)

Knowledge-Augmented Explainable and Interpretable Learning for Anomaly Detection and Diagnosis
by: Atzmueller, Martin, et al.
Published: (2024)

Prosa: Rubric-Based Evaluation of LLMs on Real User Chats in Brazilian Portuguese
by: Junior, Roseval Malaquias, et al.
Published: (2026)

AICCE: AI Driven Compliance Checker Engine
by: Rahman, Mohammad Wali Ur, et al.
Published: (2026)

Saliency Map-Guided Knowledge Discovery for Subclass Identification with LLM-Based Symbolic Approximations
by: Bohne, Tim, et al.
Published: (2025)

Evaluating Explanations Through LLMs: Beyond Traditional User Studies
by: De Bona, Francesco Bombassei, et al.
Published: (2024)

Automated Rubrics for Reliable Evaluation of Medical Dialogue Systems
by: Chen, Yinzhu, et al.
Published: (2026)

Safety Alignment of LMs via Non-cooperative Games
by: Paulus, Anselm, et al.
Published: (2025)

Can Community Notes Replace Professional Fact-Checkers?
by: Borenstein, Nadav, et al.
Published: (2025)

Efficient Agent Evaluation via Diversity-Guided User Simulation
by: Nakash, Itay, et al.
Published: (2026)

SimUSER: Simulating User Behavior with Large Language Models for Recommender System Evaluation
by: Bougie, Nicolas, et al.
Published: (2025)