Saved in:
| Main Authors: | Choi, Minseok, Kim, Dongjin, Yang, Seungbin, Kim, Subin, Kwak, Youngjun, Oh, Juyoung, Choo, Jaegul, Son, Jungmin |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.02588 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
BankMathBench: A Benchmark for Numerical Reasoning in Banking Scenarios
by: Lee, Yunseung, et al.
Published: (2026)
by: Lee, Yunseung, et al.
Published: (2026)
LiveWeb-IE: A Benchmark For Online Web Information Extraction
by: Yang, Seungbin, et al.
Published: (2026)
by: Yang, Seungbin, et al.
Published: (2026)
Retrieve Only Relevant Tables Whether Few or Many: Adaptive Table Retrieval Method
by: Kim, Taehee, et al.
Published: (2026)
by: Kim, Taehee, et al.
Published: (2026)
Can Tool-augmented Large Language Models be Aware of Incomplete Conditions?
by: Yang, Seungbin, et al.
Published: (2024)
by: Yang, Seungbin, et al.
Published: (2024)
Cross-Lingual Unlearning of Selective Knowledge in Multilingual Language Models
by: Choi, Minseok, et al.
Published: (2024)
by: Choi, Minseok, et al.
Published: (2024)
Opt-Out: Investigating Entity-Level Unlearning for Large Language Models via Optimal Transport
by: Choi, Minseok, et al.
Published: (2024)
by: Choi, Minseok, et al.
Published: (2024)
Protecting Privacy Through Approximating Optimal Parameters for Sequence Unlearning in Language Models
by: Lee, Dohyun, et al.
Published: (2024)
by: Lee, Dohyun, et al.
Published: (2024)
Breaking Chains: Unraveling the Links in Multi-Hop Knowledge Unlearning
by: Choi, Minseok, et al.
Published: (2024)
by: Choi, Minseok, et al.
Published: (2024)
PairEval: Open-domain Dialogue Evaluation with Pairwise Comparison
by: Park, ChaeHun, et al.
Published: (2024)
by: Park, ChaeHun, et al.
Published: (2024)
FENCE: A Financial and Multimodal Jailbreak Detection Dataset
by: Kim, Mirae, et al.
Published: (2026)
by: Kim, Mirae, et al.
Published: (2026)
SEDD: Scalable and Efficient Dataset Deduplication with GPUs
by: Son, Youngjun, et al.
Published: (2025)
by: Son, Youngjun, et al.
Published: (2025)
BingoGuard: LLM Content Moderation Tools with Risk Levels
by: Yin, Fan, et al.
Published: (2025)
by: Yin, Fan, et al.
Published: (2025)
TV-LiVE: Training-Free, Text-Guided Video Editing via Layer Informed Vitality Exploitation
by: Kim, Min-Jung, et al.
Published: (2025)
by: Kim, Min-Jung, et al.
Published: (2025)
LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation
by: Kim, Eunsu, et al.
Published: (2024)
by: Kim, Eunsu, et al.
Published: (2024)
On Calibration of LLM-based Guard Models for Reliable Content Moderation
by: Liu, Hongfu, et al.
Published: (2024)
by: Liu, Hongfu, et al.
Published: (2024)
When Confidence Misleads: Suffix Anchoring and Anchor-Proximity Confidence Modulation for Diffusion Language Models
by: Park, Jungwon, et al.
Published: (2026)
by: Park, Jungwon, et al.
Published: (2026)
Multi-News+: Cost-efficient Dataset Cleansing via LLM-based Data Annotation
by: Choi, Juhwan, et al.
Published: (2024)
by: Choi, Juhwan, et al.
Published: (2024)
MemeGuard: An LLM and VLM-based Framework for Advancing Content Moderation via Meme Intervention
by: Jha, Prince, et al.
Published: (2024)
by: Jha, Prince, et al.
Published: (2024)
Building Resource-Constrained Language Agents: A Korean Case Study on Chemical Toxicity Information
by: Cho, Hojun, et al.
Published: (2025)
by: Cho, Hojun, et al.
Published: (2025)
Bielik Guard: Efficient Polish Language Safety Classifiers for LLM Content Moderation
by: Wróbel, Krzysztof, et al.
Published: (2026)
by: Wróbel, Krzysztof, et al.
Published: (2026)
The Comparative Trap: Pairwise Comparisons Amplifies Biased Preferences of LLM Evaluators
by: Jeong, Hawon, et al.
Published: (2024)
by: Jeong, Hawon, et al.
Published: (2024)
STAND-Guard: A Small Task-Adaptive Content Moderation Model
by: Wang, Minjia, et al.
Published: (2024)
by: Wang, Minjia, et al.
Published: (2024)
Aligning Extraction and Generation for Robust Retrieval-Augmented Generation
by: Song, Hwanjun, et al.
Published: (2025)
by: Song, Hwanjun, et al.
Published: (2025)
Enhancing Intrinsic Features for Debiasing via Investigating Class-Discerning Common Attributes in Bias-Contrastive Pair
by: Park, Jeonghoon, et al.
Published: (2024)
by: Park, Jeonghoon, et al.
Published: (2024)
Federated Learning for Face Recognition via Intra-subject Self-supervised Learning
by: Kim, Hansol, et al.
Published: (2024)
by: Kim, Hansol, et al.
Published: (2024)
CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean
by: Kim, Eunsu, et al.
Published: (2024)
by: Kim, Eunsu, et al.
Published: (2024)
Focus on the Core: Efficient Attention via Pruned Token Compression for Document Classification
by: Yun, Jungmin, et al.
Published: (2024)
by: Yun, Jungmin, et al.
Published: (2024)
Evaluating Automatic Speech Recognition Systems for Korean Meteorological Experts
by: Park, ChaeHun, et al.
Published: (2024)
by: Park, ChaeHun, et al.
Published: (2024)
Cross-lingual Collapse: How Language-Centric Foundation Models Shape Reasoning in Large Language Models
by: Park, Cheonbok, et al.
Published: (2025)
by: Park, Cheonbok, et al.
Published: (2025)
Scaling Up LLM Reviews for Google Ads Content Moderation
by: Qiao, Wei, et al.
Published: (2024)
by: Qiao, Wei, et al.
Published: (2024)
ToolHaystack: Stress-Testing Tool-Augmented Language Models in Realistic Long-Term Interactions
by: Kwak, Beong-woo, et al.
Published: (2025)
by: Kwak, Beong-woo, et al.
Published: (2025)
InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion
by: Jin, Hoiyeong, et al.
Published: (2025)
by: Jin, Hoiyeong, et al.
Published: (2025)
Exploring In-context Example Generation for Machine Translation
by: Lee, Dohyun, et al.
Published: (2025)
by: Lee, Dohyun, et al.
Published: (2025)
LaDiMo: Layer-wise Distillation Inspired MoEfier
by: Kim, Sungyoon, et al.
Published: (2024)
by: Kim, Sungyoon, et al.
Published: (2024)
Making Sense of Korean Sentences: A Comprehensive Evaluation of LLMs through KoSEnd Dataset
by: Yu, Seunguk, et al.
Published: (2025)
by: Yu, Seunguk, et al.
Published: (2025)
Efficient Terminology Integration for LLM-based Translation in Specialized Domains
by: Kim, Sejoon, et al.
Published: (2024)
by: Kim, Sejoon, et al.
Published: (2024)
Bones Can't Be Triangles: Accurate and Efficient Vertebrae Keypoint Estimation through Collaborative Error Revision
by: Kim, Jinhee, et al.
Published: (2024)
by: Kim, Jinhee, et al.
Published: (2024)
VoiceBBQ: Investigating Effect of Content and Acoustics in Social Bias of Spoken Language Model
by: Choi, Junhyuk, et al.
Published: (2025)
by: Choi, Junhyuk, et al.
Published: (2025)
Training Spatial-Frequency Visual Prompts and Probabilistic Clusters for Accurate Black-Box Transfer Learning
by: Cho, Wonwoo, et al.
Published: (2024)
by: Cho, Wonwoo, et al.
Published: (2024)
Single Ground Truth Is Not Enough: Adding Flexibility to Aspect-Based Sentiment Analysis Evaluation
by: Yang, Soyoung, et al.
Published: (2024)
by: Yang, Soyoung, et al.
Published: (2024)
Similar Items
-
BankMathBench: A Benchmark for Numerical Reasoning in Banking Scenarios
by: Lee, Yunseung, et al.
Published: (2026) -
LiveWeb-IE: A Benchmark For Online Web Information Extraction
by: Yang, Seungbin, et al.
Published: (2026) -
Retrieve Only Relevant Tables Whether Few or Many: Adaptive Table Retrieval Method
by: Kim, Taehee, et al.
Published: (2026) -
Can Tool-augmented Large Language Models be Aware of Incomplete Conditions?
by: Yang, Seungbin, et al.
Published: (2024) -
Cross-Lingual Unlearning of Selective Knowledge in Multilingual Language Models
by: Choi, Minseok, et al.
Published: (2024)