Saved in:
| Main Authors: | Cao, Xianwei, Quan, Dou, Zhang, Zhenliang, Wang, Shuang |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.22813 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CLNet: Cross-View Correspondence Makes a Stronger Geo-Localizationer
by: Cao, Xianwei, et al.
Published: (2025)
by: Cao, Xianwei, et al.
Published: (2025)
Dynamics-Aligned Shared Hypernetworks for Contextual RL under Discontinuous Shifts
by: Benad, Jan, et al.
Published: (2026)
by: Benad, Jan, et al.
Published: (2026)
Learning What Matters Now: A Dual-Critic Context-Aware RL Framework for Priority-Driven Information Gain
by: Panagopoulos, Dimitris, et al.
Published: (2025)
by: Panagopoulos, Dimitris, et al.
Published: (2025)
Exploring Human-Machine Coexistence in Symmetrical Reality
by: Zhang, Zhenliang
Published: (2026)
by: Zhang, Zhenliang
Published: (2026)
Time-Scaling Is What Agents Need Now
by: Liu, Zhi, et al.
Published: (2026)
by: Liu, Zhi, et al.
Published: (2026)
Spectral Invariant Learning for Dynamic Graphs under Distribution Shifts
by: Zhang, Zeyang, et al.
Published: (2024)
by: Zhang, Zeyang, et al.
Published: (2024)
Contrastive Learning of Preferences with a Contextual InfoNCE Loss
by: Bertram, Timo, et al.
Published: (2024)
by: Bertram, Timo, et al.
Published: (2024)
Understand What LLM Needs: Dual Preference Alignment for Retrieval-Augmented Generation
by: Dong, Guanting, et al.
Published: (2024)
by: Dong, Guanting, et al.
Published: (2024)
Contextual Position Encoding: Learning to Count What's Important
by: Golovneva, Olga, et al.
Published: (2024)
by: Golovneva, Olga, et al.
Published: (2024)
Reinforcement Learning-based Sequential Route Recommendation for System-Optimal Traffic Assignment
by: Wang, Leizhen, et al.
Published: (2025)
by: Wang, Leizhen, et al.
Published: (2025)
Silencing the Guardrails: Inference-Time Jailbreaking via Dynamic Contextual Representation Ablation
by: Xing, Wenpeng, et al.
Published: (2026)
by: Xing, Wenpeng, et al.
Published: (2026)
Human-in-the-Loop Multi-Agent Ventilator Decision Support with Contextual Bandit Preference Learning
by: Li, Sijia, et al.
Published: (2026)
by: Li, Sijia, et al.
Published: (2026)
Where and What: Reasoning Dynamic and Implicit Preferences in Situated Conversational Recommendation
by: Lin, Dongding, et al.
Published: (2026)
by: Lin, Dongding, et al.
Published: (2026)
Toward Effective Tool-Integrated Reasoning via Self-Evolved Preference Learning
by: Chen, Yifei, et al.
Published: (2025)
by: Chen, Yifei, et al.
Published: (2025)
ICR Probe: Tracking Hidden State Dynamics for Reliable Hallucination Detection in LLMs
by: Zhang, Zhenliang, et al.
Published: (2025)
by: Zhang, Zhenliang, et al.
Published: (2025)
ATIR: Towards Audio-Text Interleaved Contextual Retrieval
by: Zhao, Tong, et al.
Published: (2026)
by: Zhao, Tong, et al.
Published: (2026)
From Abstract to Contextual: What LLMs Still Cannot Do in Mathematics
by: Cao, Bowen, et al.
Published: (2026)
by: Cao, Bowen, et al.
Published: (2026)
Lightweight Adapter Learning for More Generalized Remote Sensing Change Detection
by: Quan, Dou, et al.
Published: (2025)
by: Quan, Dou, et al.
Published: (2025)
TeachAnything: A Multimodal Crowdsourcing Platform for Training Embodied AI Agents in Symmetrical Reality
by: Liu, Zidong, et al.
Published: (2026)
by: Liu, Zidong, et al.
Published: (2026)
Attention Basin: Why Contextual Position Matters in Large Language Models
by: Yi, Zihao, et al.
Published: (2025)
by: Yi, Zihao, et al.
Published: (2025)
Adaptive Shielding for Safe Reinforcement Learning under Hidden-Parameter Dynamics Shifts
by: Kwon, Minjae, et al.
Published: (2025)
by: Kwon, Minjae, et al.
Published: (2025)
Graphs Generalization under Distribution Shifts
by: Tian, Qin, et al.
Published: (2024)
by: Tian, Qin, et al.
Published: (2024)
Focus on What Matters: Fisher-Guided Adaptive Multimodal Fusion for Vulnerability Detection
by: Bian, Yun, et al.
Published: (2026)
by: Bian, Yun, et al.
Published: (2026)
Small-Margin Preferences Still Matter-If You Train Them Right
by: Pang, Jinlong, et al.
Published: (2026)
by: Pang, Jinlong, et al.
Published: (2026)
Via Negativa for AI Alignment: Why Negative Constraints Are Structurally Superior to Positive Preferences
by: Cheng, Quan
Published: (2026)
by: Cheng, Quan
Published: (2026)
$\boldsymbol{f}$-OPD: Stabilizing Long-Horizon On-Policy Distillation with Freshness-Aware Control
by: Chen, Xianwei, et al.
Published: (2026)
by: Chen, Xianwei, et al.
Published: (2026)
OMPO: A Unified Framework for RL under Policy and Dynamics Shifts
by: Luo, Yu, et al.
Published: (2024)
by: Luo, Yu, et al.
Published: (2024)
Statistical Inference for Misspecified Contextual Bandits
by: Guo, Yongyi, et al.
Published: (2025)
by: Guo, Yongyi, et al.
Published: (2025)
DynamicPO: Dynamic Preference Optimization for Recommendation
by: Hu, Xingyu, et al.
Published: (2026)
by: Hu, Xingyu, et al.
Published: (2026)
Rule Learning for Knowledge Graph Reasoning under Agnostic Distribution Shift
by: Liu, Shixuan, et al.
Published: (2025)
by: Liu, Shixuan, et al.
Published: (2025)
Now It Sounds Like You: Learning Personalized Vocabulary On Device
by: Wang, Sid, et al.
Published: (2023)
by: Wang, Sid, et al.
Published: (2023)
An Enhanced Federated Prototype Learning Method under Domain Shift
by: Kuang, Liang, et al.
Published: (2024)
by: Kuang, Liang, et al.
Published: (2024)
What Matters in Data for DPO?
by: Pan, Yu, et al.
Published: (2025)
by: Pan, Yu, et al.
Published: (2025)
Rewarding What Matters: Step-by-Step Reinforcement Learning for Task-Oriented Dialogue
by: Du, Huifang, et al.
Published: (2024)
by: Du, Huifang, et al.
Published: (2024)
What Matters for Batch Online Reinforcement Learning in Robotics?
by: Dong, Perry, et al.
Published: (2025)
by: Dong, Perry, et al.
Published: (2025)
Domain-Contextualized Inference: A Computable Graph Architecture for Explicit-Domain Reasoning
by: Li, Chao, et al.
Published: (2026)
by: Li, Chao, et al.
Published: (2026)
Multi-Objective Planning with Contextual Lexicographic Reward Preferences
by: Rustagi, Pulkit, et al.
Published: (2025)
by: Rustagi, Pulkit, et al.
Published: (2025)
Graph Fairness Learning under Distribution Shifts
by: Li, Yibo, et al.
Published: (2024)
by: Li, Yibo, et al.
Published: (2024)
Preference Consistency Matters: Enhancing Preference Learning in Language Models with Automated Self-Curation of Training Corpora
by: Lee, JoonHo, et al.
Published: (2024)
by: Lee, JoonHo, et al.
Published: (2024)
Contextual Preference Collaborative Measure Framework Based on Belief System
by: Yu, Hang, et al.
Published: (2025)
by: Yu, Hang, et al.
Published: (2025)
Similar Items
-
CLNet: Cross-View Correspondence Makes a Stronger Geo-Localizationer
by: Cao, Xianwei, et al.
Published: (2025) -
Dynamics-Aligned Shared Hypernetworks for Contextual RL under Discontinuous Shifts
by: Benad, Jan, et al.
Published: (2026) -
Learning What Matters Now: A Dual-Critic Context-Aware RL Framework for Priority-Driven Information Gain
by: Panagopoulos, Dimitris, et al.
Published: (2025) -
Exploring Human-Machine Coexistence in Symmetrical Reality
by: Zhang, Zhenliang
Published: (2026) -
Time-Scaling Is What Agents Need Now
by: Liu, Zhi, et al.
Published: (2026)