:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Choi, Minseok, Kim, Dongjin, Yang, Seungbin, Kim, Subin, Kwak, Youngjun, Oh, Juyoung, Choo, Jaegul, Son, Jungmin
Format:	Preprint
Published:	2026
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2603.02588
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

BankMathBench: A Benchmark for Numerical Reasoning in Banking Scenarios
by: Lee, Yunseung, et al.
Published: (2026)

LiveWeb-IE: A Benchmark For Online Web Information Extraction
by: Yang, Seungbin, et al.
Published: (2026)

Retrieve Only Relevant Tables Whether Few or Many: Adaptive Table Retrieval Method
by: Kim, Taehee, et al.
Published: (2026)

Can Tool-augmented Large Language Models be Aware of Incomplete Conditions?
by: Yang, Seungbin, et al.
Published: (2024)

Cross-Lingual Unlearning of Selective Knowledge in Multilingual Language Models
by: Choi, Minseok, et al.
Published: (2024)

Opt-Out: Investigating Entity-Level Unlearning for Large Language Models via Optimal Transport
by: Choi, Minseok, et al.
Published: (2024)

Protecting Privacy Through Approximating Optimal Parameters for Sequence Unlearning in Language Models
by: Lee, Dohyun, et al.
Published: (2024)

Breaking Chains: Unraveling the Links in Multi-Hop Knowledge Unlearning
by: Choi, Minseok, et al.
Published: (2024)

PairEval: Open-domain Dialogue Evaluation with Pairwise Comparison
by: Park, ChaeHun, et al.
Published: (2024)

FENCE: A Financial and Multimodal Jailbreak Detection Dataset
by: Kim, Mirae, et al.
Published: (2026)

SEDD: Scalable and Efficient Dataset Deduplication with GPUs
by: Son, Youngjun, et al.
Published: (2025)

BingoGuard: LLM Content Moderation Tools with Risk Levels
by: Yin, Fan, et al.
Published: (2025)

TV-LiVE: Training-Free, Text-Guided Video Editing via Layer Informed Vitality Exploitation
by: Kim, Min-Jung, et al.
Published: (2025)

LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation
by: Kim, Eunsu, et al.
Published: (2024)

On Calibration of LLM-based Guard Models for Reliable Content Moderation
by: Liu, Hongfu, et al.
Published: (2024)

When Confidence Misleads: Suffix Anchoring and Anchor-Proximity Confidence Modulation for Diffusion Language Models
by: Park, Jungwon, et al.
Published: (2026)

Multi-News+: Cost-efficient Dataset Cleansing via LLM-based Data Annotation
by: Choi, Juhwan, et al.
Published: (2024)

MemeGuard: An LLM and VLM-based Framework for Advancing Content Moderation via Meme Intervention
by: Jha, Prince, et al.
Published: (2024)

Building Resource-Constrained Language Agents: A Korean Case Study on Chemical Toxicity Information
by: Cho, Hojun, et al.
Published: (2025)

Bielik Guard: Efficient Polish Language Safety Classifiers for LLM Content Moderation
by: Wróbel, Krzysztof, et al.
Published: (2026)

The Comparative Trap: Pairwise Comparisons Amplifies Biased Preferences of LLM Evaluators
by: Jeong, Hawon, et al.
Published: (2024)

STAND-Guard: A Small Task-Adaptive Content Moderation Model
by: Wang, Minjia, et al.
Published: (2024)

Aligning Extraction and Generation for Robust Retrieval-Augmented Generation
by: Song, Hwanjun, et al.
Published: (2025)

Enhancing Intrinsic Features for Debiasing via Investigating Class-Discerning Common Attributes in Bias-Contrastive Pair
by: Park, Jeonghoon, et al.
Published: (2024)

Federated Learning for Face Recognition via Intra-subject Self-supervised Learning
by: Kim, Hansol, et al.
Published: (2024)

CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean
by: Kim, Eunsu, et al.
Published: (2024)

Focus on the Core: Efficient Attention via Pruned Token Compression for Document Classification
by: Yun, Jungmin, et al.
Published: (2024)

Evaluating Automatic Speech Recognition Systems for Korean Meteorological Experts
by: Park, ChaeHun, et al.
Published: (2024)

Cross-lingual Collapse: How Language-Centric Foundation Models Shape Reasoning in Large Language Models
by: Park, Cheonbok, et al.
Published: (2025)

Scaling Up LLM Reviews for Google Ads Content Moderation
by: Qiao, Wei, et al.
Published: (2024)

ToolHaystack: Stress-Testing Tool-Augmented Language Models in Realistic Long-Term Interactions
by: Kwak, Beong-woo, et al.
Published: (2025)

InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion
by: Jin, Hoiyeong, et al.
Published: (2025)

Exploring In-context Example Generation for Machine Translation
by: Lee, Dohyun, et al.
Published: (2025)

LaDiMo: Layer-wise Distillation Inspired MoEfier
by: Kim, Sungyoon, et al.
Published: (2024)

Making Sense of Korean Sentences: A Comprehensive Evaluation of LLMs through KoSEnd Dataset
by: Yu, Seunguk, et al.
Published: (2025)

Efficient Terminology Integration for LLM-based Translation in Specialized Domains
by: Kim, Sejoon, et al.
Published: (2024)

Bones Can't Be Triangles: Accurate and Efficient Vertebrae Keypoint Estimation through Collaborative Error Revision
by: Kim, Jinhee, et al.
Published: (2024)

VoiceBBQ: Investigating Effect of Content and Acoustics in Social Bias of Spoken Language Model
by: Choi, Junhyuk, et al.
Published: (2025)

Training Spatial-Frequency Visual Prompts and Probabilistic Clusters for Accurate Black-Box Transfer Learning
by: Cho, Wonwoo, et al.
Published: (2024)

Single Ground Truth Is Not Enough: Adding Flexibility to Aspect-Based Sentiment Analysis Evaluation
by: Yang, Soyoung, et al.
Published: (2024)