Saved in:
| Main Authors: | Chae, Hyungjoo, Kim, Namyoung, Ong, Kai Tzu-iunn, Gwak, Minju, Song, Gwanwoo, Kim, Jihoon, Kim, Sunghwan, Lee, Dongha, Yeo, Jinyoung |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.13232 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Towards Lifelong Dialogue Agents via Timeline-based Memory Management
by: Ong, Kai Tzu-iunn, et al.
Published: (2024)
by: Ong, Kai Tzu-iunn, et al.
Published: (2024)
PRINCIPLES: Synthetic Strategy Memory for Proactive Dialogue Agents
by: Kim, Namyoung, et al.
Published: (2025)
by: Kim, Namyoung, et al.
Published: (2025)
Commonsense-augmented Memory Construction and Management in Long-term Conversations via Context-aware Persona Refinement
by: Kim, Hana, et al.
Published: (2024)
by: Kim, Hana, et al.
Published: (2024)
Web-Shepherd: Advancing PRMs for Reinforcing Web Agents
by: Chae, Hyungjoo, et al.
Published: (2025)
by: Chae, Hyungjoo, et al.
Published: (2025)
Rethinking Reward Model Evaluation Through the Lens of Reward Overoptimization
by: Kim, Sunghwan, et al.
Published: (2025)
by: Kim, Sunghwan, et al.
Published: (2025)
ToolHaystack: Stress-Testing Tool-Augmented Language Models in Realistic Long-Term Interactions
by: Kwak, Beong-woo, et al.
Published: (2025)
by: Kwak, Beong-woo, et al.
Published: (2025)
AgenticShop: Benchmarking Agentic Product Curation for Personalized Web Shopping
by: Kim, Sunghwan, et al.
Published: (2026)
by: Kim, Sunghwan, et al.
Published: (2026)
Evaluating Robustness of Reward Models for Mathematical Reasoning
by: Kim, Sunghwan, et al.
Published: (2024)
by: Kim, Sunghwan, et al.
Published: (2024)
VerifiNER: Verification-augmented NER via Knowledge-grounded Reasoning with Large Language Models
by: Kim, Seoyeon, et al.
Published: (2024)
by: Kim, Seoyeon, et al.
Published: (2024)
Large Language Models Are Self-Taught Reasoners: Enhancing LLM Applications via Tailored Problem-Solving Demonstrations
by: Ong, Kai Tzu-iunn, et al.
Published: (2024)
by: Ong, Kai Tzu-iunn, et al.
Published: (2024)
Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models
by: Chae, Hyungjoo, et al.
Published: (2024)
by: Chae, Hyungjoo, et al.
Published: (2024)
Coffee: Boost Your Code LLMs by Fixing Bugs with Feedback
by: Moon, Seungjun, et al.
Published: (2023)
by: Moon, Seungjun, et al.
Published: (2023)
Can You Share Your Story? Modeling Clients' Metacognition and Openness for LLM Therapist Evaluation
by: Kim, Minju, et al.
Published: (2025)
by: Kim, Minju, et al.
Published: (2025)
Stop Playing the Guessing Game! Target-free User Simulation for Evaluating Conversational Recommender Systems
by: Kim, Sunghwan, et al.
Published: (2024)
by: Kim, Sunghwan, et al.
Published: (2024)
Evidence-Focused Fact Summarization for Knowledge-Augmented Zero-Shot Question Answering
by: Ko, Sungho, et al.
Published: (2024)
by: Ko, Sungho, et al.
Published: (2024)
Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code
by: Chae, Hyungjoo, et al.
Published: (2024)
by: Chae, Hyungjoo, et al.
Published: (2024)
YA-TA: Towards Personalized Question-Answering Teaching Assistants using Instructor-Student Dual Retrieval-augmented Knowledge Fusion
by: Yang, Dongil, et al.
Published: (2024)
by: Yang, Dongil, et al.
Published: (2024)
CONDESION-BENCH: Conditional Decision-Making of Large Language Models in Compositional Action Space
by: Hwang, Yeonjun, et al.
Published: (2026)
by: Hwang, Yeonjun, et al.
Published: (2026)
Persona2Web: Benchmarking Personalized Web Agents for Contextual Reasoning with User History
by: Kim, Serin, et al.
Published: (2026)
by: Kim, Serin, et al.
Published: (2026)
Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics
by: Lee, Seungbeen, et al.
Published: (2024)
by: Lee, Seungbeen, et al.
Published: (2024)
Towards Direct Evaluation of Harness Optimizers via Priority Ranking
by: Ong, Kai Tzu-iunn, et al.
Published: (2026)
by: Ong, Kai Tzu-iunn, et al.
Published: (2026)
Safe and Scalable Web Agent Learning via Recreated Websites
by: Chae, Hyungjoo, et al.
Published: (2026)
by: Chae, Hyungjoo, et al.
Published: (2026)
Fast and Fluent Diffusion Language Models via Convolutional Decoding and Rejective Fine-tuning
by: Seo, Yeongbin, et al.
Published: (2025)
by: Seo, Yeongbin, et al.
Published: (2025)
Large Language Models are Clinical Reasoners: Reasoning-Aware Diagnosis Framework with Prompt-Generated Rationales
by: Kwon, Taeyoon, et al.
Published: (2023)
by: Kwon, Taeyoon, et al.
Published: (2023)
Can Code-Switched Texts Activate a Knowledge Switch in LLMs? A Case Study on English-Korean Code-Switching
by: Kim, Seoyeon, et al.
Published: (2024)
by: Kim, Seoyeon, et al.
Published: (2024)
SAGEO Arena: A Realistic Environment for Evaluating Search-Augmented Generative Engine Optimization
by: Kim, Sunghwan, et al.
Published: (2026)
by: Kim, Sunghwan, et al.
Published: (2026)
COCOA: CBT-based Conversational Counseling Agent using Memory Specialized in Cognitive Distortions and Dynamic Prompt
by: Lee, Suyeon, et al.
Published: (2024)
by: Lee, Suyeon, et al.
Published: (2024)
Towards Personalized Conversational Sales Agents: Contextual User Profiling for Strategic Action
by: Kim, Tongyoung, et al.
Published: (2025)
by: Kim, Tongyoung, et al.
Published: (2025)
LEGO-Eval: Towards Fine-Grained Evaluation on Synthesizing 3D Embodied Environments with Tool Augmentation
by: Hwangbo, Gyeom, et al.
Published: (2025)
by: Hwangbo, Gyeom, et al.
Published: (2025)
Can Large Language Models be Good Emotional Supporter? Mitigating Preference Bias on Emotional Support Conversation
by: Kang, Dongjin, et al.
Published: (2024)
by: Kang, Dongjin, et al.
Published: (2024)
Polymerization‐Induced Direct Photolithography of Quantum Dots
by: Taehyung Kim, et al.
Published: (2025)
by: Taehyung Kim, et al.
Published: (2025)
Cactus: Towards Psychological Counseling Conversations using Cognitive Behavioral Theory
by: Lee, Suyeon, et al.
Published: (2024)
by: Lee, Suyeon, et al.
Published: (2024)
Region4Web: Rethinking Observation Space Granularity for Web Agents
by: Kwon, Donguk, et al.
Published: (2026)
by: Kwon, Donguk, et al.
Published: (2026)
Pearl: A Review-driven Persona-Knowledge Grounded Conversational Recommendation Dataset
by: Kim, Minjin, et al.
Published: (2024)
by: Kim, Minjin, et al.
Published: (2024)
Train-Attention: Meta-Learning Where to Focus in Continual Knowledge Learning
by: Seo, Yeongbin, et al.
Published: (2024)
by: Seo, Yeongbin, et al.
Published: (2024)
Quantifying Genuine Awareness in Hallucination Prediction Beyond Question-Side Shortcuts
by: Seo, Yeongbin, et al.
Published: (2025)
by: Seo, Yeongbin, et al.
Published: (2025)
Unveiling Implicit Table Knowledge with Question-Then-Pinpoint Reasoner for Insightful Table Summarization
by: Seo, Kwangwook, et al.
Published: (2024)
by: Seo, Kwangwook, et al.
Published: (2024)
MVIGER: Multi-View Variational Integration of Complementary Knowledge for Generative Recommender
by: Kim, Tongyoung, et al.
Published: (2024)
by: Kim, Tongyoung, et al.
Published: (2024)
Revisiting the Uniform Information Density Hypothesis in LLM Reasoning
by: Gwak, Minju, et al.
Published: (2025)
by: Gwak, Minju, et al.
Published: (2025)
Revisiting the UID Hypothesis in LLM Reasoning Traces
by: Gwak, Minju, et al.
Published: (2025)
by: Gwak, Minju, et al.
Published: (2025)
Similar Items
-
Towards Lifelong Dialogue Agents via Timeline-based Memory Management
by: Ong, Kai Tzu-iunn, et al.
Published: (2024) -
PRINCIPLES: Synthetic Strategy Memory for Proactive Dialogue Agents
by: Kim, Namyoung, et al.
Published: (2025) -
Commonsense-augmented Memory Construction and Management in Long-term Conversations via Context-aware Persona Refinement
by: Kim, Hana, et al.
Published: (2024) -
Web-Shepherd: Advancing PRMs for Reinforcing Web Agents
by: Chae, Hyungjoo, et al.
Published: (2025) -
Rethinking Reward Model Evaluation Through the Lens of Reward Overoptimization
by: Kim, Sunghwan, et al.
Published: (2025)