:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Choi, Junhyuk, Park, Sohhyung, Cho, Chanhee, Park, Hyeonchu, Kim, Bugeun
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2602.00521
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Pay What LLM Wants: Can LLM Simulate Economics Experiment with 522 Real-human Persona?
by: Choi, Junhyuk, et al.
Published: (2025)

DART: An AIGT Detector using AMR of Rephrased Text
by: Park, Hyeonchu, et al.
Published: (2024)

People will agree what I think: Investigating LLM's False Consensus Effect
by: Choi, Junhyuk, et al.
Published: (2024)

A Stereotype Content Analysis on Color-related Social Bias in Large Vision Language Models
by: Choi, Junhyuk, et al.
Published: (2025)

Belief in Authority: Impact of Authority in Multi-Agent Evaluation Framework
by: Choi, Junhyuk, et al.
Published: (2026)

PHISH in MESH: Korean Adversarial Phonetic Substitution and Phonetic-Semantic Feature Integration Defense
by: Kim, Byungjun, et al.
Published: (2025)

Acoustic-based Gender Differentiation in Speech-aware Language Models
by: Choi, Junhyuk, et al.
Published: (2025)

KITE: A Benchmark for Evaluating Korean Instruction-Following Abilities in Large Language Models
by: Kim, Dongjun, et al.
Published: (2025)

SPAM: Style Prompt Adherence Metric for Prompt-based TTS
by: Cho, Chanhee, et al.
Published: (2026)

MIRAGE: A Metric-Intensive Benchmark for Retrieval-Augmented Generation Evaluation
by: Park, Chanhee, et al.
Published: (2025)

Examining Identity Drift in Conversations of LLM Agents
by: Choi, Junhyuk, et al.
Published: (2024)

Diagnosing LLM Judge Reliability: Conformal Prediction Sets and Transitivity Violations
by: Gupta, Manan, et al.
Published: (2026)

Learning Compact Representations of LLM Abilities via Item Response Theory
by: Chen, Jianhao, et al.
Published: (2025)

Judge Reliability Harness: Stress Testing the Reliability of LLM Judges
by: Dev, Sunishchal, et al.
Published: (2026)

Token-Efficient Item Representation via Images for LLM Recommender Systems
by: Kim, Kibum, et al.
Published: (2025)

Beyond Task-Oriented and Chitchat Dialogues: Proactive and Transition-Aware Conversational Agents
by: Yoon, Yejin, et al.
Published: (2025)

IRT-Router: Effective and Interpretable Multi-LLM Routing via Item Response Theory
by: Song, Wei, et al.
Published: (2025)

Analysis of Utterance Embeddings and Clustering Methods Related to Intent Induction for Task-Oriented Dialogue
by: Park, Jeiyoon, et al.
Published: (2022)

ARGORA: Orchestrated Argumentation for Causally Grounded LLM Reasoning and Decision Making
by: Jin, Youngjin, et al.
Published: (2026)

FRDiff : Feature Reuse for Universal Training-free Acceleration of Diffusion Models
by: So, Junhyuk, et al.
Published: (2023)

LLMServingSim2.0: A Unified Simulator for Heterogeneous Hardware and Serving Techniques in LLM Infrastructure
by: Cho, Jaehong, et al.
Published: (2025)

Fine-Grained and Thematic Evaluation of LLMs in Social Deduction Game
by: Kim, Byungjun, et al.
Published: (2024)

Leveraging Large Language Models for Active Merchant Non-player Characters
by: Kim, Byungjun, et al.
Published: (2024)

Judgment-of-Thought Prompting: A Courtroom-Inspired Framework for Binary Logical Reasoning with Large Language Models
by: Park, Sungjune, et al.
Published: (2024)

JudgeFlow: Agentic Workflow Optimization via Block Judge
by: Ma, Zihan, et al.
Published: (2026)

Can LLMs and humans be friends? Uncovering factors affecting human-AI intimacy formation
by: Hong, Yeseon, et al.
Published: (2025)

Towards Trustworthy LLM-Based Recommendation via Rationale Integration
by: Park, Chung, et al.
Published: (2025)

Towards Privacy-Preserving Large Language Model: Text-free Inference Through Alignment and Adaptation
by: Yoon, Jeongho, et al.
Published: (2026)

Evidential Transformation Network: Turning Pretrained Models into Evidential Models for Post-hoc Uncertainty Estimation
by: Chun, Yongchan, et al.
Published: (2026)

LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale
by: Cho, Jaehong, et al.
Published: (2024)

LLM-guided Plan and Retrieval: A Strategic Alignment for Interpretable User Satisfaction Estimation in Dialogue
by: Kim, Sangyeop, et al.
Published: (2025)

VorTEX: Various overlap ratio for Target speech EXtraction
by: Oh, Ro-hoon, et al.
Published: (2026)

Self-HarmLLM: Can Large Language Model Harm Itself?
by: Kim, Heehwan, et al.
Published: (2025)

AIS-LLM: A Unified Framework for Maritime Trajectory Prediction, Anomaly Detection, and Collision Risk Assessment with Explainable Forecasting
by: Park, Hyobin, et al.
Published: (2025)

VIRO: Robust and Efficient Neuro-Symbolic Reasoning with Verification for Referring Expression Comprehension
by: Park, Hyejin, et al.
Published: (2026)

Expanding Search Space with Diverse Prompting Agents: An Efficient Sampling Approach for LLM Mathematical Reasoning
by: Lee, Gisang, et al.
Published: (2024)

LLMServingSim 2.0: A Unified Simulator for Heterogeneous and Disaggregated LLM Serving Infrastructure
by: Cho, Jaehong, et al.
Published: (2026)

MMTB: Evaluating Terminal Agents on Multimedia-File Tasks
by: Heo, Chiyeong, et al.
Published: (2026)

TRUEBench: Can LLM Response Meet Real-world Constraints as Productivity Assistant?
by: Park, Jiho, et al.
Published: (2025)

PsyProbe: Proactive and Interpretable Dialogue through User State Modeling for Exploratory Counseling
by: Park, Sohhyung, et al.
Published: (2026)