:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Oh, Minhyeon, Lee, Seungjoon, Ok, Jungseul
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2411.00524
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Experience-based Knowledge Correction for Robust Planning in Minecraft
by: Lee, Seungjoon, et al.
Published: (2025)

Federated Variational Preference Alignment with Gumbel-Softmax Prior for Personalized User Preferences
by: Koo, Jabin, et al.
Published: (2026)

CoPL: Collaborative Preference Learning for Personalizing LLMs
by: Choi, Youngbin, et al.
Published: (2025)

Activity-Guided Industrial Anomalous Sound Detection against Interferences
by: Lee, Yunjoo, et al.
Published: (2024)

Enhancing Cost Efficiency in Active Learning with Candidate Set Query
by: Gwon, Yeho, et al.
Published: (2025)

Improving Generative Behavior Cloning via Self-Guidance and Adaptive Chunking
by: So, Junhyuk, et al.
Published: (2025)

EPIC: Efficient Predicate-Guided Inference-Time Control for Compositional Text-to-Image Generation
by: Mun, Sunung, et al.
Published: (2026)

Towards Robust and Efficient Federated Low-Rank Adaptation with Heterogeneous Clients
by: Koo, Jabin, et al.
Published: (2024)

Self-Training Large Language Models with Confident Reasoning
by: Jang, Hyosoon, et al.
Published: (2025)

Rising Multi-Armed Bandits with Known Horizons
by: Song, Seockbean, et al.
Published: (2026)

When Are Experts Misrouted? Counterfactual Routing Analysis in Mixture-of-Experts Language Models
by: Yoon, Youngsik, et al.
Published: (2026)

Improving Robustness to Multiple Spurious Correlations by Multi-Objective Optimization
by: Kim, Nayeong, et al.
Published: (2024)

Optimal Clustering from Noisy Binary Feedback
by: Ariu, Kaito, et al.
Published: (2019)

Preference-based Reinforcement Learning beyond Pairwise Comparisons: Benefits of Multiple Options
by: Lee, Joongkyu, et al.
Published: (2025)

Combinatorial Rising Bandits
by: Song, Seockbean, et al.
Published: (2024)

ProCompNav: Proactive Instance Navigation with Comparative Judgment for Ambiguous User Queries
by: Kwon, Junhyuk, et al.
Published: (2026)

PaT: Planning-after-Trial for Efficient Test-Time Code Generation
by: Yoon, Youngsik, et al.
Published: (2026)

Revisiting Early Detection of Sexual Predators via Turn-level Optimization
by: An, Jinmyeong, et al.
Published: (2025)

Retrieval-Augmented Generation with Estimation of Source Reliability
by: Hwang, Jeongyeon, et al.
Published: (2024)

Bridging the Gap between Expert and Language Models: Concept-guided Chess Commentary Generation and Evaluation
by: Kim, Jaechang, et al.
Published: (2024)

Delving into Instance-Dependent Label Noise in Graph Data: A Comprehensive Study and Benchmark
by: Kim, Suyeon, et al.
Published: (2025)

Combinatorial Reinforcement Learning with Preference Feedback
by: Lee, Joongkyu, et al.
Published: (2025)

MedBN: Robust Test-Time Adaptation against Malicious Test Samples
by: Park, Hyejin, et al.
Published: (2024)

Interaction-Aware Influence Functions for Group Attribution
by: Heo, Jaeseung, et al.
Published: (2026)

Influence Functions for Edge Edits in Non-Convex Graph Neural Networks
by: Heo, Jaeseung, et al.
Published: (2025)

Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
by: Kang, Hyungkyu, et al.
Published: (2025)

Active Learning for Direct Preference Optimization
by: Kveton, Branislav, et al.
Published: (2025)

Active Query Synthesis for Preference Learning
by: Nadagouda, Namrata, et al.
Published: (2026)

Preference Optimization with Multi-Sample Comparisons
by: Wang, Chaoqi, et al.
Published: (2024)

Preference-based Multi-Objective Reinforcement Learning
by: Mu, Ni, et al.
Published: (2025)

BAPO: Base-Anchored Preference Optimization for Overcoming Forgetting in Large Language Models Personalization
by: Lee, Gihun, et al.
Published: (2024)

Unified Framework of Distributional Regret in Multi-Armed Bandits and Reinforcement Learning
by: Lee, Harin, et al.
Published: (2026)

CLIPtone: Unsupervised Learning for Text-based Image Tone Adjustment
by: Lee, Hyeongmin, et al.
Published: (2024)

Metric Learning from Limited Pairwise Preference Comparisons
by: Wang, Zhi, et al.
Published: (2024)

Is Active Persona Inference Necessary for Aligning Small Models to Personal Preferences?
by: Tang, Zilu, et al.
Published: (2025)

PersonalizedRouter: Personalized LLM Routing via Graph-based User Preference Modeling
by: Dai, Zhongjie, et al.
Published: (2025)

Active Preference Learning for Ordering Items In- and Out-of-sample
by: Bergström, Herman, et al.
Published: (2024)

Offline Clustering of Preference Learning with Active-data Augmentation
by: Liu, Jingyuan, et al.
Published: (2025)

Hypothesis-Conditioned Query Rewriting for Decision-Useful Retrieval
by: Chang, Hangeol, et al.
Published: (2026)

Preference Alignment with Flow Matching
by: Kim, Minu, et al.
Published: (2024)