:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lee, Seungkyu, Kim, Nalim, Jo, Yohan
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2509.01560
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

SimuHome: A Temporal- and Environment-Aware Benchmark for Smart Home LLM Agents
by: Seo, Gyuhyeon, et al.
Published: (2025)

KMI: A Dataset of Korean Motivational Interviewing Dialogues for Psychotherapy
by: Kim, Hyunjong, et al.
Published: (2025)

Generating Plausible Distractors for Multiple-Choice Questions via Student Choice Prediction
by: Lee, Yooseop, et al.
Published: (2025)

PVP: An Image Dataset for Personalized Visual Persuasion with Persuasion Strategies, Viewer Characteristics, and Persuasiveness Ratings
by: Kim, Junseo, et al.
Published: (2025)

Thinking Like a Doctor: Conversational Diagnosis through the Exploration of Diagnostic Knowledge Graphs
by: Won, Jeongmoon, et al.
Published: (2026)

Where Should Diffusion Enter a Language Model? Geometry-Guided Hidden-State Replacement
by: Kong, Injin, et al.
Published: (2026)

PKG API: A Tool for Personal Knowledge Graph Management
by: Bernard, Nolwenn, et al.
Published: (2024)

Infant Agent: A Tool-Integrated, Logic-Driven Agent with Cost-Effective API Usage
by: Lei, Bin, et al.
Published: (2024)

Mechanism Shift During Post-training from Autoregressive to Masked Diffusion Language Models
by: Kong, Injin, et al.
Published: (2026)

Context-Robust Knowledge Editing for Language Models
by: Park, Haewon, et al.
Published: (2025)

Mitigating Hallucination in Abstractive Summarization with Domain-Conditional Mutual Information
by: Chae, Kyubyung, et al.
Published: (2024)

ThinkBrake: Efficient Reasoning via Log-Probability Margin Guided Decoding
by: Song, Sangjun, et al.
Published: (2025)

Psychometric Item Validation Using Virtual Respondents with Trait-Response Mediators
by: Lim, Sungjib, et al.
Published: (2025)

Model-based Preference Optimization in Abstractive Summarization without Human Feedback
by: Choi, Jaepill, et al.
Published: (2024)

Ever-Evolving Memory by Blending and Refining the Past
by: Kim, Seo Hyun, et al.
Published: (2024)

DialSim: A Dialogue Simulator for Evaluating Long-Term Multi-Party Dialogue Understanding of Conversational Agents
by: Kim, Jiho, et al.
Published: (2024)

API Pack: A Massive Multi-Programming Language Dataset for API Call Generation
by: Guo, Zhen, et al.
Published: (2024)

Learning to Retrieve User History and Generate User Profiles for Personalized Persuasiveness Prediction
by: Park, Sejun, et al.
Published: (2026)

Improving Dialogue State Tracking through Combinatorial Search for In-Context Examples
by: Pyun, Haesung, et al.
Published: (2025)

R2-KG: General-Purpose Dual-Agent Framework for Reliable Reasoning on Knowledge Graphs
by: Jo, Sumin, et al.
Published: (2025)

Pre-Storage Reasoning for Episodic Memory: Shifting Inference Burden to Memory for Personalized Dialogue
by: Kim, Sangyeop, et al.
Published: (2025)

SciToolAgent: A Knowledge Graph-Driven Scientific Agent for Multi-Tool Integration
by: Ding, Keyan, et al.
Published: (2025)

Dialogue Systems for Emotional Support via Value Reinforcement
by: Kim, Juhee, et al.
Published: (2025)

Beyond the Final Answer: Evaluating the Reasoning Trajectories of Tool-Augmented Agents
by: Kim, Wonjoong, et al.
Published: (2025)

Human Psychometric Questionnaires Mischaracterize LLM Behavior
by: Song, Woojung, et al.
Published: (2025)

StableToolBench-MirrorAPI: Modeling Tool Environments as Mirrors of 7,000+ Real-World APIs
by: Guo, Zhicheng, et al.
Published: (2025)

API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs
by: Basu, Kinjal, et al.
Published: (2024)

Beyond Perfect APIs: A Comprehensive Evaluation of LLM Agents Under Real-World API Complexity
by: Kim, Doyoung, et al.
Published: (2026)

Deterministic Legal Agents: A Canonical Primitive API for Auditable Reasoning over Temporal Knowledge Graphs
by: de Martim, Hudson
Published: (2025)

The Amazing Agent Race: Strong Tool Users, Weak Navigators
by: Kim, Zae Myung, et al.
Published: (2026)

Value Portrait: Assessing Language Models' Values through Psychometrically and Ecologically Valid Items
by: Han, Jongwook, et al.
Published: (2025)

MIST: Multimodal Interactive Speech-based Tool-calling Conversational Assistants for Smart Homes
by: Chen, Maximillian, et al.
Published: (2026)

DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model
by: Zhao, Lirui, et al.
Published: (2024)

ToolFactory: Automating Tool Generation by Leveraging LLM to Understand REST API Documentations
by: Ni, Xinyi, et al.
Published: (2025)

KL for a KL: On-Policy Distillation with Control Variate Baseline
by: Oh, Minjae, et al.
Published: (2026)

Agent Tools Orchestration Leaks More: Dataset, Benchmark, and Mitigation
by: Qiao, Yuxuan, et al.
Published: (2025)

GAP: Graph-Based Agent Planning with Parallel Tool Use and Reinforcement Learning
by: Wu, Jiaqi, et al.
Published: (2025)

Towards Human-like Multimodal Conversational Agent by Generating Engaging Speech
by: Kim, Taesoo, et al.
Published: (2025)

Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Model
by: Kim, Daehee, et al.
Published: (2024)

LLM-C3MOD: A Human-LLM Collaborative System for Cross-Cultural Hate Speech Moderation
by: Park, Junyeong, et al.
Published: (2025)