Saved in:
| Main Authors: | Choi, Junhyuk, Park, Sohhyung, Cho, Chanhee, Park, Hyeonchu, Kim, Bugeun |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.00521 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Pay What LLM Wants: Can LLM Simulate Economics Experiment with 522 Real-human Persona?
by: Choi, Junhyuk, et al.
Published: (2025)
by: Choi, Junhyuk, et al.
Published: (2025)
DART: An AIGT Detector using AMR of Rephrased Text
by: Park, Hyeonchu, et al.
Published: (2024)
by: Park, Hyeonchu, et al.
Published: (2024)
People will agree what I think: Investigating LLM's False Consensus Effect
by: Choi, Junhyuk, et al.
Published: (2024)
by: Choi, Junhyuk, et al.
Published: (2024)
A Stereotype Content Analysis on Color-related Social Bias in Large Vision Language Models
by: Choi, Junhyuk, et al.
Published: (2025)
by: Choi, Junhyuk, et al.
Published: (2025)
Belief in Authority: Impact of Authority in Multi-Agent Evaluation Framework
by: Choi, Junhyuk, et al.
Published: (2026)
by: Choi, Junhyuk, et al.
Published: (2026)
PHISH in MESH: Korean Adversarial Phonetic Substitution and Phonetic-Semantic Feature Integration Defense
by: Kim, Byungjun, et al.
Published: (2025)
by: Kim, Byungjun, et al.
Published: (2025)
Acoustic-based Gender Differentiation in Speech-aware Language Models
by: Choi, Junhyuk, et al.
Published: (2025)
by: Choi, Junhyuk, et al.
Published: (2025)
KITE: A Benchmark for Evaluating Korean Instruction-Following Abilities in Large Language Models
by: Kim, Dongjun, et al.
Published: (2025)
by: Kim, Dongjun, et al.
Published: (2025)
SPAM: Style Prompt Adherence Metric for Prompt-based TTS
by: Cho, Chanhee, et al.
Published: (2026)
by: Cho, Chanhee, et al.
Published: (2026)
MIRAGE: A Metric-Intensive Benchmark for Retrieval-Augmented Generation Evaluation
by: Park, Chanhee, et al.
Published: (2025)
by: Park, Chanhee, et al.
Published: (2025)
Examining Identity Drift in Conversations of LLM Agents
by: Choi, Junhyuk, et al.
Published: (2024)
by: Choi, Junhyuk, et al.
Published: (2024)
Diagnosing LLM Judge Reliability: Conformal Prediction Sets and Transitivity Violations
by: Gupta, Manan, et al.
Published: (2026)
by: Gupta, Manan, et al.
Published: (2026)
Learning Compact Representations of LLM Abilities via Item Response Theory
by: Chen, Jianhao, et al.
Published: (2025)
by: Chen, Jianhao, et al.
Published: (2025)
Judge Reliability Harness: Stress Testing the Reliability of LLM Judges
by: Dev, Sunishchal, et al.
Published: (2026)
by: Dev, Sunishchal, et al.
Published: (2026)
Token-Efficient Item Representation via Images for LLM Recommender Systems
by: Kim, Kibum, et al.
Published: (2025)
by: Kim, Kibum, et al.
Published: (2025)
Beyond Task-Oriented and Chitchat Dialogues: Proactive and Transition-Aware Conversational Agents
by: Yoon, Yejin, et al.
Published: (2025)
by: Yoon, Yejin, et al.
Published: (2025)
IRT-Router: Effective and Interpretable Multi-LLM Routing via Item Response Theory
by: Song, Wei, et al.
Published: (2025)
by: Song, Wei, et al.
Published: (2025)
Analysis of Utterance Embeddings and Clustering Methods Related to Intent Induction for Task-Oriented Dialogue
by: Park, Jeiyoon, et al.
Published: (2022)
by: Park, Jeiyoon, et al.
Published: (2022)
ARGORA: Orchestrated Argumentation for Causally Grounded LLM Reasoning and Decision Making
by: Jin, Youngjin, et al.
Published: (2026)
by: Jin, Youngjin, et al.
Published: (2026)
FRDiff : Feature Reuse for Universal Training-free Acceleration of Diffusion Models
by: So, Junhyuk, et al.
Published: (2023)
by: So, Junhyuk, et al.
Published: (2023)
LLMServingSim2.0: A Unified Simulator for Heterogeneous Hardware and Serving Techniques in LLM Infrastructure
by: Cho, Jaehong, et al.
Published: (2025)
by: Cho, Jaehong, et al.
Published: (2025)
Fine-Grained and Thematic Evaluation of LLMs in Social Deduction Game
by: Kim, Byungjun, et al.
Published: (2024)
by: Kim, Byungjun, et al.
Published: (2024)
Leveraging Large Language Models for Active Merchant Non-player Characters
by: Kim, Byungjun, et al.
Published: (2024)
by: Kim, Byungjun, et al.
Published: (2024)
Judgment-of-Thought Prompting: A Courtroom-Inspired Framework for Binary Logical Reasoning with Large Language Models
by: Park, Sungjune, et al.
Published: (2024)
by: Park, Sungjune, et al.
Published: (2024)
JudgeFlow: Agentic Workflow Optimization via Block Judge
by: Ma, Zihan, et al.
Published: (2026)
by: Ma, Zihan, et al.
Published: (2026)
Can LLMs and humans be friends? Uncovering factors affecting human-AI intimacy formation
by: Hong, Yeseon, et al.
Published: (2025)
by: Hong, Yeseon, et al.
Published: (2025)
Towards Trustworthy LLM-Based Recommendation via Rationale Integration
by: Park, Chung, et al.
Published: (2025)
by: Park, Chung, et al.
Published: (2025)
Towards Privacy-Preserving Large Language Model: Text-free Inference Through Alignment and Adaptation
by: Yoon, Jeongho, et al.
Published: (2026)
by: Yoon, Jeongho, et al.
Published: (2026)
Evidential Transformation Network: Turning Pretrained Models into Evidential Models for Post-hoc Uncertainty Estimation
by: Chun, Yongchan, et al.
Published: (2026)
by: Chun, Yongchan, et al.
Published: (2026)
LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale
by: Cho, Jaehong, et al.
Published: (2024)
by: Cho, Jaehong, et al.
Published: (2024)
LLM-guided Plan and Retrieval: A Strategic Alignment for Interpretable User Satisfaction Estimation in Dialogue
by: Kim, Sangyeop, et al.
Published: (2025)
by: Kim, Sangyeop, et al.
Published: (2025)
VorTEX: Various overlap ratio for Target speech EXtraction
by: Oh, Ro-hoon, et al.
Published: (2026)
by: Oh, Ro-hoon, et al.
Published: (2026)
Self-HarmLLM: Can Large Language Model Harm Itself?
by: Kim, Heehwan, et al.
Published: (2025)
by: Kim, Heehwan, et al.
Published: (2025)
AIS-LLM: A Unified Framework for Maritime Trajectory Prediction, Anomaly Detection, and Collision Risk Assessment with Explainable Forecasting
by: Park, Hyobin, et al.
Published: (2025)
by: Park, Hyobin, et al.
Published: (2025)
VIRO: Robust and Efficient Neuro-Symbolic Reasoning with Verification for Referring Expression Comprehension
by: Park, Hyejin, et al.
Published: (2026)
by: Park, Hyejin, et al.
Published: (2026)
Expanding Search Space with Diverse Prompting Agents: An Efficient Sampling Approach for LLM Mathematical Reasoning
by: Lee, Gisang, et al.
Published: (2024)
by: Lee, Gisang, et al.
Published: (2024)
LLMServingSim 2.0: A Unified Simulator for Heterogeneous and Disaggregated LLM Serving Infrastructure
by: Cho, Jaehong, et al.
Published: (2026)
by: Cho, Jaehong, et al.
Published: (2026)
MMTB: Evaluating Terminal Agents on Multimedia-File Tasks
by: Heo, Chiyeong, et al.
Published: (2026)
by: Heo, Chiyeong, et al.
Published: (2026)
TRUEBench: Can LLM Response Meet Real-world Constraints as Productivity Assistant?
by: Park, Jiho, et al.
Published: (2025)
by: Park, Jiho, et al.
Published: (2025)
PsyProbe: Proactive and Interpretable Dialogue through User State Modeling for Exploratory Counseling
by: Park, Sohhyung, et al.
Published: (2026)
by: Park, Sohhyung, et al.
Published: (2026)
Similar Items
-
Pay What LLM Wants: Can LLM Simulate Economics Experiment with 522 Real-human Persona?
by: Choi, Junhyuk, et al.
Published: (2025) -
DART: An AIGT Detector using AMR of Rephrased Text
by: Park, Hyeonchu, et al.
Published: (2024) -
People will agree what I think: Investigating LLM's False Consensus Effect
by: Choi, Junhyuk, et al.
Published: (2024) -
A Stereotype Content Analysis on Color-related Social Bias in Large Vision Language Models
by: Choi, Junhyuk, et al.
Published: (2025) -
Belief in Authority: Impact of Authority in Multi-Agent Evaluation Framework
by: Choi, Junhyuk, et al.
Published: (2026)