Saved in:
| Main Authors: | Kim, Doyoung, Lee, Youngjun, Kim, Joeun, Bang, Jihwan, Song, Hwanjun, Yoon, Susik, Lee, Jae-Gil |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.06552 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
One Size Fits All for Semantic Shifts: Adaptive Prompt Tuning for Continual Learning
by: Kim, Doyoung, et al.
Published: (2023)
by: Kim, Doyoung, et al.
Published: (2023)
SimPO: Simple Preference Optimization with a Reference-Free Reward
by: Meng, Yu, et al.
Published: (2024)
by: Meng, Yu, et al.
Published: (2024)
Back to the Future: Look-ahead Augmentation and Parallel Self-Refinement for Time Series Forecasting
by: Kim, Sunho, et al.
Published: (2026)
by: Kim, Sunho, et al.
Published: (2026)
Understanding Reference Policies in Direct Preference Optimization
by: Liu, Yixin, et al.
Published: (2024)
by: Liu, Yixin, et al.
Published: (2024)
Multi-Reference Preference Optimization for Large Language Models
by: Le, Hung, et al.
Published: (2024)
by: Le, Hung, et al.
Published: (2024)
ReFeed: Multi-dimensional Summarization Refinement with Reflective Reasoning on Feedback
by: Yun, Taewon, et al.
Published: (2025)
by: Yun, Taewon, et al.
Published: (2025)
Towards Verifiable Text Generation with Symbolic References
by: Hennigen, Lucas Torroba, et al.
Published: (2023)
by: Hennigen, Lucas Torroba, et al.
Published: (2023)
VPO: Leveraging the Number of Votes in Preference Optimization
by: Cho, Jae Hyeon, et al.
Published: (2024)
by: Cho, Jae Hyeon, et al.
Published: (2024)
DiaTool-DPO: Multi-Turn Direct Preference Optimization for Tool-Augmented Large Language Models
by: Jung, Sunghee, et al.
Published: (2025)
by: Jung, Sunghee, et al.
Published: (2025)
Spread Preference Annotation: Direct Preference Judgment for Efficient LLM Alignment
by: Kim, Dongyoung, et al.
Published: (2024)
by: Kim, Dongyoung, et al.
Published: (2024)
Let Multimodal Embedders Learn When to Augment Query via Adaptive Query Augmentation
by: Kim, Wongyu, et al.
Published: (2025)
by: Kim, Wongyu, et al.
Published: (2025)
Text Change Detection in Multilingual Documents Using Image Comparison
by: Park, Doyoung, et al.
Published: (2024)
by: Park, Doyoung, et al.
Published: (2024)
Projectable Models: One-Shot Generation of Small Specialized Transformers from Large Ones
by: Zhmoginov, Andrey, et al.
Published: (2025)
by: Zhmoginov, Andrey, et al.
Published: (2025)
Reference-based Metrics Disprove Themselves in Question Generation
by: Nguyen, Bang, et al.
Published: (2024)
by: Nguyen, Bang, et al.
Published: (2024)
Segment-driven Structural Induction and Semantic Alignment for Heterogeneous Tabular Representation
by: Jung, Woojun, et al.
Published: (2026)
by: Jung, Woojun, et al.
Published: (2026)
Conversational Query Reformulation with the Guidance of Retrieved Documents
by: Park, Jeonghyun, et al.
Published: (2024)
by: Park, Jeonghyun, et al.
Published: (2024)
ConfPO: Exploiting Policy Model Confidence for Critical Token Selection in Preference Optimization
by: Yoon, Hee Suk, et al.
Published: (2025)
by: Yoon, Hee Suk, et al.
Published: (2025)
Preference Alignment with Flow Matching
by: Kim, Minu, et al.
Published: (2024)
by: Kim, Minu, et al.
Published: (2024)
Universal Time-Series Representation Learning: A Survey
by: Trirat, Patara, et al.
Published: (2024)
by: Trirat, Patara, et al.
Published: (2024)
Multi-level Diagnosis and Evaluation for Robust Tabular Feature Engineering with Large Language Models
by: Lim, Yebin, et al.
Published: (2025)
by: Lim, Yebin, et al.
Published: (2025)
BAPO: Base-Anchored Preference Optimization for Overcoming Forgetting in Large Language Models Personalization
by: Lee, Gihun, et al.
Published: (2024)
by: Lee, Gihun, et al.
Published: (2024)
FlowBot: Inducing LLM Workflows with Bilevel Optimization and Textual Gradients
by: Yu, Hongyeon, et al.
Published: (2026)
by: Yu, Hongyeon, et al.
Published: (2026)
LLM-based User Profile Management for Recommender System
by: Bang, Seunghwan, et al.
Published: (2025)
by: Bang, Seunghwan, et al.
Published: (2025)
Contextually Guided Transformers via Low-Rank Adaptation
by: Zhmoginov, Andrey, et al.
Published: (2025)
by: Zhmoginov, Andrey, et al.
Published: (2025)
Case-Based Reasoning Approach for Solving Financial Question Answering
by: Kim, Yikyung, et al.
Published: (2024)
by: Kim, Yikyung, et al.
Published: (2024)
MAGE: All-[MASK] Block Already Knows Where to Look in Diffusion LLM
by: Kwon, Omin, et al.
Published: (2026)
by: Kwon, Omin, et al.
Published: (2026)
LCSB: Layer-Cyclic Selective Backpropagation for Memory-Efficient On-Device LLM Fine-Tuning
by: Park, Juneyoung, et al.
Published: (2026)
by: Park, Juneyoung, et al.
Published: (2026)
Expanding Foundational Language Capabilities in Open-Source LLMs through a Korean Case Study
by: Lim, Junghwan, et al.
Published: (2025)
by: Lim, Junghwan, et al.
Published: (2025)
EdiText: Controllable Coarse-to-Fine Text Editing with Diffusion Language Models
by: Lee, Che Hyun, et al.
Published: (2025)
by: Lee, Che Hyun, et al.
Published: (2025)
FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Acceleration
by: Jo, Dongwon, et al.
Published: (2025)
by: Jo, Dongwon, et al.
Published: (2025)
REFA: Reference Free Alignment for multi-preference optimization
by: Gupta, Taneesh, et al.
Published: (2024)
by: Gupta, Taneesh, et al.
Published: (2024)
From Volume to Value: Preference-Aligned Memory Construction for On-Device RAG
by: Lee, Changmin, et al.
Published: (2026)
by: Lee, Changmin, et al.
Published: (2026)
SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks
by: Song, Jiwon, et al.
Published: (2024)
by: Song, Jiwon, et al.
Published: (2024)
Continuous-Time Linear Positional Embedding for Irregular Time Series Forecasting
by: Kim, Byunghyun, et al.
Published: (2024)
by: Kim, Byunghyun, et al.
Published: (2024)
Query-Conditioned Test-Time Self-Training for Large Language Models
by: Song, Chaehee, et al.
Published: (2026)
by: Song, Chaehee, et al.
Published: (2026)
Reference-Free Rating of LLM Responses via Latent Information
by: Girrbach, Leander, et al.
Published: (2025)
by: Girrbach, Leander, et al.
Published: (2025)
MEC: Machine-Learning-Assisted Generalized Entropy Calibration for Semi-Supervised Mean Estimation
by: Lee, Se Yoon, et al.
Published: (2026)
by: Lee, Se Yoon, et al.
Published: (2026)
Multi-Response Preference Optimization with Augmented Ranking Dataset
by: Gwon, Hansle, et al.
Published: (2024)
by: Gwon, Hansle, et al.
Published: (2024)
Detecting Token-Level Hallucinations Using Variance Signals: A Reference-Free Approach
by: Kumar, Keshav
Published: (2025)
by: Kumar, Keshav
Published: (2025)
DP-Muon: Differentially Private Optimization via Matrix-Orthogonalized Momentum
by: Kim, Jihwan, et al.
Published: (2026)
by: Kim, Jihwan, et al.
Published: (2026)
Similar Items
-
One Size Fits All for Semantic Shifts: Adaptive Prompt Tuning for Continual Learning
by: Kim, Doyoung, et al.
Published: (2023) -
SimPO: Simple Preference Optimization with a Reference-Free Reward
by: Meng, Yu, et al.
Published: (2024) -
Back to the Future: Look-ahead Augmentation and Parallel Self-Refinement for Time Series Forecasting
by: Kim, Sunho, et al.
Published: (2026) -
Understanding Reference Policies in Direct Preference Optimization
by: Liu, Yixin, et al.
Published: (2024) -
Multi-Reference Preference Optimization for Large Language Models
by: Le, Hung, et al.
Published: (2024)