:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Zhang, Hongbin, Gao, Ning, Dai, Yuqin, Wu, Ruiyuan, Wang, Jinpeng, Gao, Rena Wei, Tan, Bingdong, Gao, Shuzheng, Li, Zongjie, Wang, Chaozheng
Format:	Preprint
Veröffentlicht:	2026
Schlagworte:	Artificial Intelligence
Online-Zugang:	https://arxiv.org/abs/2605.22240
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

Reinforcing Real-world Service Agents: Balancing Utility and Cost in Task-oriented Dialogue
von: Gao, Ning, et al.
Veröffentlicht: (2026)

SEAD: Self-Evolving Agent for Multi-Turn Service Dialogue
von: Dai, Yuqin, et al.
Veröffentlicht: (2026)

SAGE: A Service Agent Graph-guided Evaluation Benchmark
von: Shi, Ling, et al.
Veröffentlicht: (2026)

Empirical Study of Code Large Language Models for Binary Security Patch Detection
von: Li, Qingyuan, et al.
Veröffentlicht: (2025)

SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue Agents
von: Si, Shuzheng, et al.
Veröffentlicht: (2023)

Split and Merge: Aligning Position Biases in LLM-based Evaluators
von: Li, Zongjie, et al.
Veröffentlicht: (2023)

The Prompt Alchemist: Automated LLM-Tailored Prompt Optimization for Test Case Generation
von: Gao, Shuzheng, et al.
Veröffentlicht: (2025)

Enhancing User-Oriented Proactivity in Open-Domain Dialogues with Critic Guidance
von: Wang, Yufeng, et al.
Veröffentlicht: (2025)

WARBENCH: A Comprehensive Benchmark for Evaluating LLMs in Military Decision-Making
von: Li, Zongjie, et al.
Veröffentlicht: (2026)

Beyond Task-Oriented and Chitchat Dialogues: Proactive and Transition-Aware Conversational Agents
von: Yoon, Yejin, et al.
Veröffentlicht: (2025)

Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk
von: Ulmer, Dennis, et al.
Veröffentlicht: (2024)

'No' Matters: Out-of-Distribution Detection in Multimodality Long Dialogue
von: Gao, Rena, et al.
Veröffentlicht: (2024)

TREAT: A Code LLMs Trustworthiness / Reliability Evaluation and Testing Framework
von: Gao, Shuzheng, et al.
Veröffentlicht: (2025)

Making Task-Oriented Dialogue Datasets More Natural by Synthetically Generating Indirect User Requests
von: Mannekote, Amogh, et al.
Veröffentlicht: (2024)

Pseudo-Siamese Network for Planning in Target-Oriented Proactive Dialogues
von: Kang, Xinyue, et al.
Veröffentlicht: (2026)

Search-Based LLMs for Code Optimization
von: Gao, Shuzheng, et al.
Veröffentlicht: (2024)

Rewarding What Matters: Step-by-Step Reinforcement Learning for Task-Oriented Dialogue
von: Du, Huifang, et al.
Veröffentlicht: (2024)

Self-Instructed Derived Prompt Generation Meets In-Context Learning: Unlocking New Potential of Black-Box LLMs
von: Li, Zhuo, et al.
Veröffentlicht: (2024)

Non-Cross Diffusion for Semantic Consistency
von: Zheng, Ziyang, et al.
Veröffentlicht: (2023)

Measuring the Permission Gate: A Stress-Test Evaluation of Claude Code's Auto Mode
von: Ji, Zimo, et al.
Veröffentlicht: (2026)

Proactive Memory for Ad-Hoc Recall over Streaming Dialogues
von: Wang, Bingbing, et al.
Veröffentlicht: (2026)

SEER: Enhancing Chain-of-Thought Code Generation through Self-Exploring Deep Reasoning
von: Gao, Shuzheng, et al.
Veröffentlicht: (2025)

Reasoning as a Resource: Optimizing Fast and Slow Thinking in Code Generation Models
von: Li, Zongjie, et al.
Veröffentlicht: (2025)

Long-term Task-oriented Agent: Proactive Long-term Intent Maintenance in Dynamic Environments
von: Shi, Qinglong, et al.
Veröffentlicht: (2026)

VidAudio-Bench: Benchmarking V2A and VT2A Generation across Four Audio Categories
von: Zhang, Qian, et al.
Veröffentlicht: (2026)

Target-constrained Bidirectional Planning for Generation of Target-oriented Proactive Dialogue
von: Wang, Jian, et al.
Veröffentlicht: (2024)

Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning
von: Si, Shuzheng, et al.
Veröffentlicht: (2025)

DuetSim: Building User Simulator with Dual Large Language Models for Task-Oriented Dialogues
von: Luo, Xiang, et al.
Veröffentlicht: (2024)

SID: Benchmarking Guided Instruction Capabilities in STEM Education with a Socratic Interdisciplinary Dialogues Dataset
von: Jiang, Mei, et al.
Veröffentlicht: (2025)

Towards Automatic Evaluation of Task-Oriented Dialogue Flows
von: Mirtaheri, Mehrnoosh, et al.
Veröffentlicht: (2024)

Taxonomy, Evaluation and Exploitation of IPI-Centric LLM Agent Defense Frameworks
von: Ji, Zimo, et al.
Veröffentlicht: (2025)

Redefining Proactivity for Information Seeking Dialogue
von: Lee, Jing Yang, et al.
Veröffentlicht: (2024)

TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities
von: Zhang, Ming, et al.
Veröffentlicht: (2024)

Differentiation-Based Extraction of Proprietary Data from Fine-Tuned LLMs
von: Li, Zongjie, et al.
Veröffentlicht: (2025)

API-guided Dataset Synthesis to Finetune Large Code Models
von: Li, Zongjie, et al.
Veröffentlicht: (2024)

UMoE: Unifying Attention and FFN with Shared Experts
von: Yang, Yuanhang, et al.
Veröffentlicht: (2025)

HierTOD: A Task-Oriented Dialogue System Driven by Hierarchical Goals
von: Mo, Lingbo, et al.
Veröffentlicht: (2024)

HingeMem: Boundary Guided Long-Term Memory with Query Adaptive Retrieval for Scalable Dialogues
von: Zhong, Yijie, et al.
Veröffentlicht: (2026)

Using Medical Algorithms for Task-Oriented Dialogue in LLM-Based Medical Interviews
von: Reis, Rui, et al.
Veröffentlicht: (2025)

TSEmbed: Unlocking Task Scaling in Universal Multimodal Embeddings
von: Wu, Yebo, et al.
Veröffentlicht: (2026)