Saved in:
| Main Authors: | Oh, Minhyeon, Lee, Seungjoon, Ok, Jungseul |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2411.00524 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Experience-based Knowledge Correction for Robust Planning in Minecraft
by: Lee, Seungjoon, et al.
Published: (2025)
by: Lee, Seungjoon, et al.
Published: (2025)
Federated Variational Preference Alignment with Gumbel-Softmax Prior for Personalized User Preferences
by: Koo, Jabin, et al.
Published: (2026)
by: Koo, Jabin, et al.
Published: (2026)
CoPL: Collaborative Preference Learning for Personalizing LLMs
by: Choi, Youngbin, et al.
Published: (2025)
by: Choi, Youngbin, et al.
Published: (2025)
Activity-Guided Industrial Anomalous Sound Detection against Interferences
by: Lee, Yunjoo, et al.
Published: (2024)
by: Lee, Yunjoo, et al.
Published: (2024)
Enhancing Cost Efficiency in Active Learning with Candidate Set Query
by: Gwon, Yeho, et al.
Published: (2025)
by: Gwon, Yeho, et al.
Published: (2025)
Improving Generative Behavior Cloning via Self-Guidance and Adaptive Chunking
by: So, Junhyuk, et al.
Published: (2025)
by: So, Junhyuk, et al.
Published: (2025)
EPIC: Efficient Predicate-Guided Inference-Time Control for Compositional Text-to-Image Generation
by: Mun, Sunung, et al.
Published: (2026)
by: Mun, Sunung, et al.
Published: (2026)
Towards Robust and Efficient Federated Low-Rank Adaptation with Heterogeneous Clients
by: Koo, Jabin, et al.
Published: (2024)
by: Koo, Jabin, et al.
Published: (2024)
Self-Training Large Language Models with Confident Reasoning
by: Jang, Hyosoon, et al.
Published: (2025)
by: Jang, Hyosoon, et al.
Published: (2025)
Rising Multi-Armed Bandits with Known Horizons
by: Song, Seockbean, et al.
Published: (2026)
by: Song, Seockbean, et al.
Published: (2026)
When Are Experts Misrouted? Counterfactual Routing Analysis in Mixture-of-Experts Language Models
by: Yoon, Youngsik, et al.
Published: (2026)
by: Yoon, Youngsik, et al.
Published: (2026)
Improving Robustness to Multiple Spurious Correlations by Multi-Objective Optimization
by: Kim, Nayeong, et al.
Published: (2024)
by: Kim, Nayeong, et al.
Published: (2024)
Optimal Clustering from Noisy Binary Feedback
by: Ariu, Kaito, et al.
Published: (2019)
by: Ariu, Kaito, et al.
Published: (2019)
Preference-based Reinforcement Learning beyond Pairwise Comparisons: Benefits of Multiple Options
by: Lee, Joongkyu, et al.
Published: (2025)
by: Lee, Joongkyu, et al.
Published: (2025)
Combinatorial Rising Bandits
by: Song, Seockbean, et al.
Published: (2024)
by: Song, Seockbean, et al.
Published: (2024)
ProCompNav: Proactive Instance Navigation with Comparative Judgment for Ambiguous User Queries
by: Kwon, Junhyuk, et al.
Published: (2026)
by: Kwon, Junhyuk, et al.
Published: (2026)
PaT: Planning-after-Trial for Efficient Test-Time Code Generation
by: Yoon, Youngsik, et al.
Published: (2026)
by: Yoon, Youngsik, et al.
Published: (2026)
Revisiting Early Detection of Sexual Predators via Turn-level Optimization
by: An, Jinmyeong, et al.
Published: (2025)
by: An, Jinmyeong, et al.
Published: (2025)
Retrieval-Augmented Generation with Estimation of Source Reliability
by: Hwang, Jeongyeon, et al.
Published: (2024)
by: Hwang, Jeongyeon, et al.
Published: (2024)
Bridging the Gap between Expert and Language Models: Concept-guided Chess Commentary Generation and Evaluation
by: Kim, Jaechang, et al.
Published: (2024)
by: Kim, Jaechang, et al.
Published: (2024)
Delving into Instance-Dependent Label Noise in Graph Data: A Comprehensive Study and Benchmark
by: Kim, Suyeon, et al.
Published: (2025)
by: Kim, Suyeon, et al.
Published: (2025)
Combinatorial Reinforcement Learning with Preference Feedback
by: Lee, Joongkyu, et al.
Published: (2025)
by: Lee, Joongkyu, et al.
Published: (2025)
MedBN: Robust Test-Time Adaptation against Malicious Test Samples
by: Park, Hyejin, et al.
Published: (2024)
by: Park, Hyejin, et al.
Published: (2024)
Interaction-Aware Influence Functions for Group Attribution
by: Heo, Jaeseung, et al.
Published: (2026)
by: Heo, Jaeseung, et al.
Published: (2026)
Influence Functions for Edge Edits in Non-Convex Graph Neural Networks
by: Heo, Jaeseung, et al.
Published: (2025)
by: Heo, Jaeseung, et al.
Published: (2025)
Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
by: Kang, Hyungkyu, et al.
Published: (2025)
by: Kang, Hyungkyu, et al.
Published: (2025)
Active Learning for Direct Preference Optimization
by: Kveton, Branislav, et al.
Published: (2025)
by: Kveton, Branislav, et al.
Published: (2025)
Active Query Synthesis for Preference Learning
by: Nadagouda, Namrata, et al.
Published: (2026)
by: Nadagouda, Namrata, et al.
Published: (2026)
Preference Optimization with Multi-Sample Comparisons
by: Wang, Chaoqi, et al.
Published: (2024)
by: Wang, Chaoqi, et al.
Published: (2024)
Preference-based Multi-Objective Reinforcement Learning
by: Mu, Ni, et al.
Published: (2025)
by: Mu, Ni, et al.
Published: (2025)
BAPO: Base-Anchored Preference Optimization for Overcoming Forgetting in Large Language Models Personalization
by: Lee, Gihun, et al.
Published: (2024)
by: Lee, Gihun, et al.
Published: (2024)
Unified Framework of Distributional Regret in Multi-Armed Bandits and Reinforcement Learning
by: Lee, Harin, et al.
Published: (2026)
by: Lee, Harin, et al.
Published: (2026)
CLIPtone: Unsupervised Learning for Text-based Image Tone Adjustment
by: Lee, Hyeongmin, et al.
Published: (2024)
by: Lee, Hyeongmin, et al.
Published: (2024)
Metric Learning from Limited Pairwise Preference Comparisons
by: Wang, Zhi, et al.
Published: (2024)
by: Wang, Zhi, et al.
Published: (2024)
Is Active Persona Inference Necessary for Aligning Small Models to Personal Preferences?
by: Tang, Zilu, et al.
Published: (2025)
by: Tang, Zilu, et al.
Published: (2025)
PersonalizedRouter: Personalized LLM Routing via Graph-based User Preference Modeling
by: Dai, Zhongjie, et al.
Published: (2025)
by: Dai, Zhongjie, et al.
Published: (2025)
Active Preference Learning for Ordering Items In- and Out-of-sample
by: Bergström, Herman, et al.
Published: (2024)
by: Bergström, Herman, et al.
Published: (2024)
Offline Clustering of Preference Learning with Active-data Augmentation
by: Liu, Jingyuan, et al.
Published: (2025)
by: Liu, Jingyuan, et al.
Published: (2025)
Hypothesis-Conditioned Query Rewriting for Decision-Useful Retrieval
by: Chang, Hangeol, et al.
Published: (2026)
by: Chang, Hangeol, et al.
Published: (2026)
Preference Alignment with Flow Matching
by: Kim, Minu, et al.
Published: (2024)
by: Kim, Minu, et al.
Published: (2024)
Similar Items
-
Experience-based Knowledge Correction for Robust Planning in Minecraft
by: Lee, Seungjoon, et al.
Published: (2025) -
Federated Variational Preference Alignment with Gumbel-Softmax Prior for Personalized User Preferences
by: Koo, Jabin, et al.
Published: (2026) -
CoPL: Collaborative Preference Learning for Personalizing LLMs
by: Choi, Youngbin, et al.
Published: (2025) -
Activity-Guided Industrial Anomalous Sound Detection against Interferences
by: Lee, Yunjoo, et al.
Published: (2024) -
Enhancing Cost Efficiency in Active Learning with Candidate Set Query
by: Gwon, Yeho, et al.
Published: (2025)