Saved in:
| Main Authors: | Pang, Jinlong, Zhu, Zhaowei, Di, Na, Zhang, Yichi, Wang, Yaxuan, Qian, Chen, Liu, Yang |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.00954 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Evaluating LLM-Contaminated Crowdsourcing Data Without Ground Truth
by: Zhang, Yichi, et al.
Published: (2025)
by: Zhang, Yichi, et al.
Published: (2025)
Token Cleaning: Fine-Grained Data Selection for LLM Supervised Fine-Tuning
by: Pang, Jinlong, et al.
Published: (2025)
by: Pang, Jinlong, et al.
Published: (2025)
Improving Data Efficiency via Curating LLM-Driven Rating Systems
by: Pang, Jinlong, et al.
Published: (2024)
by: Pang, Jinlong, et al.
Published: (2024)
Fairness Without Harm: An Influence-Guided Active Sampling Approach
by: Pang, Jinlong, et al.
Published: (2024)
by: Pang, Jinlong, et al.
Published: (2024)
Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets
by: Devine, Peter
Published: (2024)
by: Devine, Peter
Published: (2024)
SPPD: Self-training with Process Preference Learning Using Dynamic Value Margin
by: Yi, Hao, et al.
Published: (2025)
by: Yi, Hao, et al.
Published: (2025)
Unmasking and Improving Data Credibility: A Study with Datasets for Training Harmless Language Models
by: Zhu, Zhaowei, et al.
Published: (2023)
by: Zhu, Zhaowei, et al.
Published: (2023)
RLPO: Residual Listwise Preference Optimization for Long-Context Review Ranking
by: Jiang, Hao, et al.
Published: (2026)
by: Jiang, Hao, et al.
Published: (2026)
Incentivizing High-quality Participation From Federated Learning Agents
by: Pang, Jinlong, et al.
Published: (2025)
by: Pang, Jinlong, et al.
Published: (2025)
LLM Unlearning via Loss Adjustment with Only Forget Data
by: Wang, Yaxuan, et al.
Published: (2024)
by: Wang, Yaxuan, et al.
Published: (2024)
If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs
by: Khalifa, Muhammad, et al.
Published: (2024)
by: Khalifa, Muhammad, et al.
Published: (2024)
Conditioning Matters: Training Diffusion Policies is Faster Than You Think
by: Dong, Zibin, et al.
Published: (2025)
by: Dong, Zibin, et al.
Published: (2025)
Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering
by: Zhang, Yichi, et al.
Published: (2023)
by: Zhang, Yichi, et al.
Published: (2023)
Transcendence: Generative Models Can Outperform The Experts That Train Them
by: Zhang, Edwin, et al.
Published: (2024)
by: Zhang, Edwin, et al.
Published: (2024)
Not All Preferences are What You Need for Post-Training: Selective Alignment Strategy for Preference Optimization
by: Dong, Zhijin
Published: (2025)
by: Dong, Zhijin
Published: (2025)
Larger or Smaller Reward Margins to Select Preferences for Alignment?
by: Huang, Kexin, et al.
Published: (2025)
by: Huang, Kexin, et al.
Published: (2025)
Towards Understanding the Influence of Reward Margin on Preference Model Performance
by: Qin, Bowen, et al.
Published: (2024)
by: Qin, Bowen, et al.
Published: (2024)
Sponsored Questions and How to Auction Them
by: Bhawalkar, Kshipra, et al.
Published: (2025)
by: Bhawalkar, Kshipra, et al.
Published: (2025)
Automatic Dataset Construction (ADC): Sample Collection, Data Curation, and Beyond
by: Liu, Minghao, et al.
Published: (2024)
by: Liu, Minghao, et al.
Published: (2024)
Legend: Leveraging Representation Engineering to Annotate Safety Margin for Preference Datasets
by: Feng, Duanyu, et al.
Published: (2024)
by: Feng, Duanyu, et al.
Published: (2024)
Adaptive Margin RLHF via Preference over Preferences
by: Chittepu, Yaswanth, et al.
Published: (2025)
by: Chittepu, Yaswanth, et al.
Published: (2025)
AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization
by: Wu, Junkang, et al.
Published: (2024)
by: Wu, Junkang, et al.
Published: (2024)
Just Say What You Want: Only-prompting Self-rewarding Online Preference Optimization
by: Xu, Ruijie, et al.
Published: (2024)
by: Xu, Ruijie, et al.
Published: (2024)
Training on the Benchmark Is Not All You Need
by: Ni, Shiwen, et al.
Published: (2024)
by: Ni, Shiwen, et al.
Published: (2024)
CHILL at SemEval-2025 Task 2: You Can't Just Throw Entities and Hope -- Make Your LLM to Get Them Right
by: Lee, Jaebok, et al.
Published: (2025)
by: Lee, Jaebok, et al.
Published: (2025)
Using Large Language Models to Assess Teachers' Pedagogical Content Knowledge
by: Yang, Yaxuan, et al.
Published: (2025)
by: Yang, Yaxuan, et al.
Published: (2025)
DRAGON: Guard LLM Unlearning in Context via Negative Detection and Reasoning
by: Wang, Yaxuan, et al.
Published: (2025)
by: Wang, Yaxuan, et al.
Published: (2025)
Look Before You Decide: Prompting Active Deduction of MLLMs for Assumptive Reasoning
by: Li, Yian, et al.
Published: (2024)
by: Li, Yian, et al.
Published: (2024)
Observations and Remedies for Large Language Model Bias in Self-Consuming Performative Loop
by: Wang, Yaxuan, et al.
Published: (2026)
by: Wang, Yaxuan, et al.
Published: (2026)
Preference Consistency Matters: Enhancing Preference Learning in Language Models with Automated Self-Curation of Training Corpora
by: Lee, JoonHo, et al.
Published: (2024)
by: Lee, JoonHo, et al.
Published: (2024)
LRR-Bench: Left, Right or Rotate? Vision-Language models Still Struggle With Spatial Understanding Tasks
by: Kong, Fei, et al.
Published: (2025)
by: Kong, Fei, et al.
Published: (2025)
$ξ$-DPO: Direct Preference Optimization via Ratio Reward Margin
by: Fan, Zhengyuan, et al.
Published: (2026)
by: Fan, Zhengyuan, et al.
Published: (2026)
STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning
by: Chen, Sirui, et al.
Published: (2023)
by: Chen, Sirui, et al.
Published: (2023)
POST: Prior-Observation Adversarial Learning of Spatio-Temporal Associations for Multivariate Time Series Anomaly Detection
by: Zhang, Suofei, et al.
Published: (2026)
by: Zhang, Suofei, et al.
Published: (2026)
What LLMs Think When You Don't Tell Them What to Think About?
by: Kwon, Yongchan, et al.
Published: (2026)
by: Kwon, Yongchan, et al.
Published: (2026)
Learning to Generate Formally Verifiable Step-by-Step Logic Reasoning via Structured Formal Intermediaries
by: Chen, Luoxin, et al.
Published: (2026)
by: Chen, Luoxin, et al.
Published: (2026)
Learning What Matters Now: Dynamic Preference Inference under Contextual Shifts
by: Cao, Xianwei, et al.
Published: (2026)
by: Cao, Xianwei, et al.
Published: (2026)
You Only Forward Once: An Efficient Compositional Judging Paradigm
by: Zhang, Tianlong, et al.
Published: (2025)
by: Zhang, Tianlong, et al.
Published: (2025)
Large Language Model Unlearning via Embedding-Corrupted Prompts
by: Liu, Chris Yuhao, et al.
Published: (2024)
by: Liu, Chris Yuhao, et al.
Published: (2024)
Search Still Matters: Information Retrieval in the Era of Generative AI
by: Hersh, William R.
Published: (2023)
by: Hersh, William R.
Published: (2023)
Similar Items
-
Evaluating LLM-Contaminated Crowdsourcing Data Without Ground Truth
by: Zhang, Yichi, et al.
Published: (2025) -
Token Cleaning: Fine-Grained Data Selection for LLM Supervised Fine-Tuning
by: Pang, Jinlong, et al.
Published: (2025) -
Improving Data Efficiency via Curating LLM-Driven Rating Systems
by: Pang, Jinlong, et al.
Published: (2024) -
Fairness Without Harm: An Influence-Guided Active Sampling Approach
by: Pang, Jinlong, et al.
Published: (2024) -
Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets
by: Devine, Peter
Published: (2024)