Saved in:
| Main Authors: | Li, Peiming, Hu, Zhiyuan, Tang, Yang, Li, Shiyu, Chen, Xi |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.11194 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
PuriDefense: Randomized Local Implicit Adversarial Purification for Defending Black-box Query-based Attacks
by: Guo, Ping, et al.
Published: (2024)
by: Guo, Ping, et al.
Published: (2024)
Where and What: Reasoning Dynamic and Implicit Preferences in Situated Conversational Recommendation
by: Lin, Dongding, et al.
Published: (2026)
by: Lin, Dongding, et al.
Published: (2026)
AlignGroup: Learning and Aligning Group Consensus with Member Preferences for Group Recommendation
by: Xu, Jinfeng, et al.
Published: (2024)
by: Xu, Jinfeng, et al.
Published: (2024)
Finetune Once: Decoupling General & Domain Learning with Dynamic Boosted Annealing
by: Tang, Yang, et al.
Published: (2025)
by: Tang, Yang, et al.
Published: (2025)
Capturing Nuanced Preferences: Preference-Aligned Distillation for Small Language Models
by: Gu, Yanggan, et al.
Published: (2025)
by: Gu, Yanggan, et al.
Published: (2025)
Learning to Align Human Code Preferences
by: Yin, Xin, et al.
Published: (2025)
by: Yin, Xin, et al.
Published: (2025)
CROP: Expert-Aligned Image Cropping via Compositional Reasoning and Optimizing Preference
by: Dong, Zhitong, et al.
Published: (2026)
by: Dong, Zhitong, et al.
Published: (2026)
Conan-Embedding-v2: Training an LLM from Scratch for Text Embeddings
by: Li, Shiyu, et al.
Published: (2025)
by: Li, Shiyu, et al.
Published: (2025)
MTRec: Learning to Align with User Preferences via Mental Reward Models
by: Zhao, Mengchen, et al.
Published: (2025)
by: Zhao, Mengchen, et al.
Published: (2025)
Aligning Crowd Feedback via Distributional Preference Reward Modeling
by: Li, Dexun, et al.
Published: (2024)
by: Li, Dexun, et al.
Published: (2024)
ImplicitRM: Unbiased Reward Modeling from Implicit Preference Data for LLM alignment
by: Wang, Hao, et al.
Published: (2026)
by: Wang, Hao, et al.
Published: (2026)
Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference
by: Shen, Xiangwei, et al.
Published: (2025)
by: Shen, Xiangwei, et al.
Published: (2025)
Select Less, Reason More: Prioritizing Evidence Purity for Video Reasoning
by: Li, Xuchen, et al.
Published: (2025)
by: Li, Xuchen, et al.
Published: (2025)
From Noisy Traces to Stable Gradients: Bias-Variance Optimized Preference Optimization for Aligning Large Reasoning Models
by: Zhu, Mingkang, et al.
Published: (2025)
by: Zhu, Mingkang, et al.
Published: (2025)
VerifyBench: A Systematic Benchmark for Evaluating Reasoning Verifiers Across Domains
by: Li, Xuzhao, et al.
Published: (2025)
by: Li, Xuzhao, et al.
Published: (2025)
Pragmatic Inference Chain (PIC) Improving LLMs' Reasoning of Authentic Implicit Toxic Language
by: Chen, Xi, et al.
Published: (2025)
by: Chen, Xi, et al.
Published: (2025)
Adversarial Preference Learning for Robust LLM Alignment
by: Wang, Yuanfu, et al.
Published: (2025)
by: Wang, Yuanfu, et al.
Published: (2025)
STEMVerse: A Dual-Axis Diagnostic Framework for STEM Reasoning in Large Language Models
by: Li, Xuzhao, et al.
Published: (2026)
by: Li, Xuzhao, et al.
Published: (2026)
Test-Time Deep Thinking to Explore Implicit Rules
by: Chen, Wentong, et al.
Published: (2026)
by: Chen, Wentong, et al.
Published: (2026)
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
by: Dong, Zibin, et al.
Published: (2023)
by: Dong, Zibin, et al.
Published: (2023)
SIPO: Stabilized and Improved Preference Optimization for Aligning Diffusion Models
by: Yang, Xiaomeng, et al.
Published: (2025)
by: Yang, Xiaomeng, et al.
Published: (2025)
EP-GRPO: Entropy-Progress Aligned Group Relative Policy Optimization with Implicit Process Guidance
by: Yu, Song, et al.
Published: (2026)
by: Yu, Song, et al.
Published: (2026)
Aligning Large Language Models with Searcher Preferences
by: Wu, Wei, et al.
Published: (2026)
by: Wu, Wei, et al.
Published: (2026)
RareAlert: Aligning heterogeneous large language model reasoning for early rare disease risk screening
by: Chen, Xi, et al.
Published: (2026)
by: Chen, Xi, et al.
Published: (2026)
Aligning Large Vision-Language Models by Deep Reinforcement Learning and Direct Preference Optimization
by: Nguyen, Thanh Thi, et al.
Published: (2025)
by: Nguyen, Thanh Thi, et al.
Published: (2025)
MedAlign: A Synergistic Framework of Multimodal Preference Optimization and Federated Meta-Cognitive Reasoning
by: Chen, Siyong, et al.
Published: (2025)
by: Chen, Siyong, et al.
Published: (2025)
CriterAlign: Criterion-Centric Rationale Alignment for Code Preference Judging
by: Li, Zhenyu, et al.
Published: (2026)
by: Li, Zhenyu, et al.
Published: (2026)
Implicit Cross-Lingual Rewarding for Efficient Multilingual Preference Alignment
by: Yang, Wen, et al.
Published: (2025)
by: Yang, Wen, et al.
Published: (2025)
STELAR-VISION: Self-Topology-Aware Efficient Learning for Aligned Reasoning in Vision
by: Li, Chen, et al.
Published: (2025)
by: Li, Chen, et al.
Published: (2025)
Securing Retrieval-Augmented Generation: A Taxonomy of Attacks, Defenses, and Future Directions
by: Xu, Yuming, et al.
Published: (2026)
by: Xu, Yuming, et al.
Published: (2026)
Beyond Surface-Level Detection: Towards Cognitive-Driven Defense Against Jailbreak Attacks via Meta-Operations Reasoning
by: Pu, Rui, et al.
Published: (2025)
by: Pu, Rui, et al.
Published: (2025)
Look Less, Reason More: Rollout-Guided Adaptive Pixel-Space Reasoning
by: Li, Xuchen, et al.
Published: (2025)
by: Li, Xuchen, et al.
Published: (2025)
Auto-Rubric as Reward: From Implicit Preferences to Explicit Multimodal Generative Criteria
by: Tian, Juanxi, et al.
Published: (2026)
by: Tian, Juanxi, et al.
Published: (2026)
SHARP: Synthesizing High-quality Aligned Reasoning Problems for Large Reasoning Models Reinforcement Learning
by: Wu, Xiong Jun, et al.
Published: (2025)
by: Wu, Xiong Jun, et al.
Published: (2025)
Activation Approximations Can Incur Safety Vulnerabilities Even in Aligned LLMs: Comprehensive Analysis and Defense
by: Zhang, Jiawen, et al.
Published: (2025)
by: Zhang, Jiawen, et al.
Published: (2025)
Lens Privacy Sealing: A New Benchmark and Method for Physical Privacy-Preserving Action Recognition
by: Liu, Mengyuan, et al.
Published: (2026)
by: Liu, Mengyuan, et al.
Published: (2026)
Align$^2$LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation
by: Huang, Hongzhe, et al.
Published: (2024)
by: Huang, Hongzhe, et al.
Published: (2024)
ReSeek: A Self-Correcting Framework for Search Agents with Instructive Rewards
by: Li, Shiyu, et al.
Published: (2025)
by: Li, Shiyu, et al.
Published: (2025)
CausalStep: A Benchmark for Explicit Stepwise Causal Reasoning in Videos
by: Li, Xuchen, et al.
Published: (2025)
by: Li, Xuchen, et al.
Published: (2025)
Implicit Safety Alignment from Crowd Preferences
by: Lin, Qian, et al.
Published: (2026)
by: Lin, Qian, et al.
Published: (2026)
Similar Items
-
PuriDefense: Randomized Local Implicit Adversarial Purification for Defending Black-box Query-based Attacks
by: Guo, Ping, et al.
Published: (2024) -
Where and What: Reasoning Dynamic and Implicit Preferences in Situated Conversational Recommendation
by: Lin, Dongding, et al.
Published: (2026) -
AlignGroup: Learning and Aligning Group Consensus with Member Preferences for Group Recommendation
by: Xu, Jinfeng, et al.
Published: (2024) -
Finetune Once: Decoupling General & Domain Learning with Dynamic Boosted Annealing
by: Tang, Yang, et al.
Published: (2025) -
Capturing Nuanced Preferences: Preference-Aligned Distillation for Small Language Models
by: Gu, Yanggan, et al.
Published: (2025)