:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Cao, Xianwei, Quan, Dou, Zhang, Zhenliang, Wang, Shuang
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2603.22813
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

CLNet: Cross-View Correspondence Makes a Stronger Geo-Localizationer
by: Cao, Xianwei, et al.
Published: (2025)

Dynamics-Aligned Shared Hypernetworks for Contextual RL under Discontinuous Shifts
by: Benad, Jan, et al.
Published: (2026)

Learning What Matters Now: A Dual-Critic Context-Aware RL Framework for Priority-Driven Information Gain
by: Panagopoulos, Dimitris, et al.
Published: (2025)

Exploring Human-Machine Coexistence in Symmetrical Reality
by: Zhang, Zhenliang
Published: (2026)

Time-Scaling Is What Agents Need Now
by: Liu, Zhi, et al.
Published: (2026)

Spectral Invariant Learning for Dynamic Graphs under Distribution Shifts
by: Zhang, Zeyang, et al.
Published: (2024)

Contrastive Learning of Preferences with a Contextual InfoNCE Loss
by: Bertram, Timo, et al.
Published: (2024)

Understand What LLM Needs: Dual Preference Alignment for Retrieval-Augmented Generation
by: Dong, Guanting, et al.
Published: (2024)

Contextual Position Encoding: Learning to Count What's Important
by: Golovneva, Olga, et al.
Published: (2024)

Reinforcement Learning-based Sequential Route Recommendation for System-Optimal Traffic Assignment
by: Wang, Leizhen, et al.
Published: (2025)

Silencing the Guardrails: Inference-Time Jailbreaking via Dynamic Contextual Representation Ablation
by: Xing, Wenpeng, et al.
Published: (2026)

Human-in-the-Loop Multi-Agent Ventilator Decision Support with Contextual Bandit Preference Learning
by: Li, Sijia, et al.
Published: (2026)

Where and What: Reasoning Dynamic and Implicit Preferences in Situated Conversational Recommendation
by: Lin, Dongding, et al.
Published: (2026)

Toward Effective Tool-Integrated Reasoning via Self-Evolved Preference Learning
by: Chen, Yifei, et al.
Published: (2025)

ICR Probe: Tracking Hidden State Dynamics for Reliable Hallucination Detection in LLMs
by: Zhang, Zhenliang, et al.
Published: (2025)

ATIR: Towards Audio-Text Interleaved Contextual Retrieval
by: Zhao, Tong, et al.
Published: (2026)

From Abstract to Contextual: What LLMs Still Cannot Do in Mathematics
by: Cao, Bowen, et al.
Published: (2026)

Lightweight Adapter Learning for More Generalized Remote Sensing Change Detection
by: Quan, Dou, et al.
Published: (2025)

TeachAnything: A Multimodal Crowdsourcing Platform for Training Embodied AI Agents in Symmetrical Reality
by: Liu, Zidong, et al.
Published: (2026)

Attention Basin: Why Contextual Position Matters in Large Language Models
by: Yi, Zihao, et al.
Published: (2025)

Adaptive Shielding for Safe Reinforcement Learning under Hidden-Parameter Dynamics Shifts
by: Kwon, Minjae, et al.
Published: (2025)

Graphs Generalization under Distribution Shifts
by: Tian, Qin, et al.
Published: (2024)

Focus on What Matters: Fisher-Guided Adaptive Multimodal Fusion for Vulnerability Detection
by: Bian, Yun, et al.
Published: (2026)

Small-Margin Preferences Still Matter-If You Train Them Right
by: Pang, Jinlong, et al.
Published: (2026)

Via Negativa for AI Alignment: Why Negative Constraints Are Structurally Superior to Positive Preferences
by: Cheng, Quan
Published: (2026)

$\boldsymbol{f}$-OPD: Stabilizing Long-Horizon On-Policy Distillation with Freshness-Aware Control
by: Chen, Xianwei, et al.
Published: (2026)

OMPO: A Unified Framework for RL under Policy and Dynamics Shifts
by: Luo, Yu, et al.
Published: (2024)

Statistical Inference for Misspecified Contextual Bandits
by: Guo, Yongyi, et al.
Published: (2025)

DynamicPO: Dynamic Preference Optimization for Recommendation
by: Hu, Xingyu, et al.
Published: (2026)

Rule Learning for Knowledge Graph Reasoning under Agnostic Distribution Shift
by: Liu, Shixuan, et al.
Published: (2025)

Now It Sounds Like You: Learning Personalized Vocabulary On Device
by: Wang, Sid, et al.
Published: (2023)

An Enhanced Federated Prototype Learning Method under Domain Shift
by: Kuang, Liang, et al.
Published: (2024)

What Matters in Data for DPO?
by: Pan, Yu, et al.
Published: (2025)

Rewarding What Matters: Step-by-Step Reinforcement Learning for Task-Oriented Dialogue
by: Du, Huifang, et al.
Published: (2024)

What Matters for Batch Online Reinforcement Learning in Robotics?
by: Dong, Perry, et al.
Published: (2025)

Domain-Contextualized Inference: A Computable Graph Architecture for Explicit-Domain Reasoning
by: Li, Chao, et al.
Published: (2026)

Multi-Objective Planning with Contextual Lexicographic Reward Preferences
by: Rustagi, Pulkit, et al.
Published: (2025)

Graph Fairness Learning under Distribution Shifts
by: Li, Yibo, et al.
Published: (2024)

Preference Consistency Matters: Enhancing Preference Learning in Language Models with Automated Self-Curation of Training Corpora
by: Lee, JoonHo, et al.
Published: (2024)

Contextual Preference Collaborative Measure Framework Based on Belief System
by: Yu, Hang, et al.
Published: (2025)