Saved in:
| Main Authors: | Liang, Dayang, Liu, Ruihan, Wan, Lipeng, Liu, Yunlong, An, Bo |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.18724 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Intrinsic Dynamics-Driven Generalizable Scene Representations for Vision-Oriented Decision-Making Applications
by: Liang, Dayang, et al.
Published: (2024)
by: Liang, Dayang, et al.
Published: (2024)
Episodic Reinforcement Learning with Expanded State-reward Space
by: Liang, Dayang, et al.
Published: (2024)
by: Liang, Dayang, et al.
Published: (2024)
InterReal: A Unified Physics-Based Imitation Framework for Learning Human-Object Interaction Skills
by: Liang, Dayang, et al.
Published: (2026)
by: Liang, Dayang, et al.
Published: (2026)
Imagine, Initialize, and Explore: An Effective Exploration Method in Multi-Agent Reinforcement Learning
by: Liu, Zeyang, et al.
Published: (2024)
by: Liu, Zeyang, et al.
Published: (2024)
ThanoRA: Task Heterogeneity-Aware Multi-Task Low-Rank Adaptation
by: Liang, Jian, et al.
Published: (2025)
by: Liang, Jian, et al.
Published: (2025)
A Generalized Bisimulation Metric of State Similarity between Markov Decision Processes: From Theoretical Propositions to Applications
by: Tao, Zhenyu, et al.
Published: (2025)
by: Tao, Zhenyu, et al.
Published: (2025)
LASIL: Learner-Aware Supervised Imitation Learning For Long-term Microscopic Traffic Simulation
by: Guo, Ke, et al.
Published: (2024)
by: Guo, Ke, et al.
Published: (2024)
Game Generation via Large Language Models
by: Hu, Chengpeng, et al.
Published: (2024)
by: Hu, Chengpeng, et al.
Published: (2024)
Stable Offline Value Function Learning with Bisimulation-based Representations
by: Pavse, Brahma S., et al.
Published: (2024)
by: Pavse, Brahma S., et al.
Published: (2024)
DeepThink3D: Enhancing Large Language Models with Programmatic Reasoning in Complex 3D Situated Reasoning Tasks
by: Song, Jiayi, et al.
Published: (2025)
by: Song, Jiayi, et al.
Published: (2025)
Boosting Meta-Learning for Few-Shot Text Classification via Label-guided Distance Scaling
by: Gao, Yunlong, et al.
Published: (2026)
by: Gao, Yunlong, et al.
Published: (2026)
SD-Net: Symmetric-Aware Keypoint Prediction and Domain Adaptation for 6D Pose Estimation In Bin-picking Scenarios
by: Huang, Ding-Tao, et al.
Published: (2024)
by: Huang, Ding-Tao, et al.
Published: (2024)
Toward Automated and Trustworthy Scientific Analysis and Visualization with LLM-Generated Code
by: Chakroborti, Apu Kumar, et al.
Published: (2025)
by: Chakroborti, Apu Kumar, et al.
Published: (2025)
Scaling Synthetic Task Generation for Agents via Exploration
by: Ramrakhya, Ram, et al.
Published: (2025)
by: Ramrakhya, Ram, et al.
Published: (2025)
Selection, Reflection and Self-Refinement: Revisit Reasoning Tasks via a Causal Lens
by: Deng, Yunlong, et al.
Published: (2025)
by: Deng, Yunlong, et al.
Published: (2025)
RDEx-CSOP: Feasibility-Aware Reconstructed Differential Evolution with Adaptive epsilon-Constraint Ranking
by: Tao, Sichen, et al.
Published: (2026)
by: Tao, Sichen, et al.
Published: (2026)
CALM: Consensus-Aware Localized Merging for Multi-Task Learning
by: Yan, Kunda, et al.
Published: (2025)
by: Yan, Kunda, et al.
Published: (2025)
Measuring and Analyzing Intelligence via Contextual Uncertainty in Large Language Models using Information-Theoretic Metrics
by: Shim, Jae Wan
Published: (2025)
by: Shim, Jae Wan
Published: (2025)
RDEx-CMOP: Feasibility-Aware Indicator-Guided Differential Evolution for Fixed-Budget Constrained Multiobjective Optimization
by: Tao, Sichen, et al.
Published: (2026)
by: Tao, Sichen, et al.
Published: (2026)
Video-XL-2: Towards Very Long-Video Understanding Through Task-Aware KV Sparsification
by: Qin, Minghao, et al.
Published: (2025)
by: Qin, Minghao, et al.
Published: (2025)
Autonomous Implicit Indoor Scene Reconstruction with Frontier Exploration
by: Zeng, Jing, et al.
Published: (2024)
by: Zeng, Jing, et al.
Published: (2024)
Provably Efficient Exploration in Inverse Constrained Reinforcement Learning
by: Yue, Bo, et al.
Published: (2024)
by: Yue, Bo, et al.
Published: (2024)
DRT: Deep Reasoning Translation via Long Chain-of-Thought
by: Wang, Jiaan, et al.
Published: (2024)
by: Wang, Jiaan, et al.
Published: (2024)
Scene-Aware Explainable Multimodal Trajectory Prediction
by: Liu, Pei, et al.
Published: (2024)
by: Liu, Pei, et al.
Published: (2024)
On the Benefits of Free Exploration for Regret Minimization in Multi-Armed Bandits
by: Hou, Yunlong, et al.
Published: (2026)
by: Hou, Yunlong, et al.
Published: (2026)
Grounded Answers for Multi-agent Decision-making Problem through Generative World Model
by: Liu, Zeyang, et al.
Published: (2024)
by: Liu, Zeyang, et al.
Published: (2024)
Transport-Hub-Aware Spatial-Temporal Adaptive Graph Transformer for Traffic Flow Prediction
by: Xu, Xiao, et al.
Published: (2023)
by: Xu, Xiao, et al.
Published: (2023)
Fake News Detection and Manipulation Reasoning via Large Vision-Language Models
by: Jin, Ruihan, et al.
Published: (2024)
by: Jin, Ruihan, et al.
Published: (2024)
Towards Provably Unlearnable Examples via Bayes Error Optimisation
by: Zhang, Ruihan, et al.
Published: (2025)
by: Zhang, Ruihan, et al.
Published: (2025)
SeqUDA-Rec: Sequential User Behavior Enhanced Recommendation via Global Unsupervised Data Augmentation for Personalized Content Marketing
by: Luo, Ruihan, et al.
Published: (2025)
by: Luo, Ruihan, et al.
Published: (2025)
One-Shot Sensitivity-Aware Mixed Sparsity Pruning for Large Language Models
by: Shao, Hang, et al.
Published: (2023)
by: Shao, Hang, et al.
Published: (2023)
BEE: Metric-Adapted Explanations via Baseline Exploration-Exploitation
by: Barkan, Oren, et al.
Published: (2024)
by: Barkan, Oren, et al.
Published: (2024)
Lifecycle-Aware code generation: Leveraging Software Engineering Phases in LLMs
by: Xing, Xing, et al.
Published: (2025)
by: Xing, Xing, et al.
Published: (2025)
How to Evaluate Semantic Communications for Images with ViTScore Metric?
by: Zhu, Tingting, et al.
Published: (2023)
by: Zhu, Tingting, et al.
Published: (2023)
Lion Secretly Solves Constrained Optimization: As Lyapunov Predicts
by: Chen, Lizhang, et al.
Published: (2023)
by: Chen, Lizhang, et al.
Published: (2023)
Joint Agent Memory and Exploration Learning via Novelty Signals
by: Tian, Shizuo, et al.
Published: (2026)
by: Tian, Shizuo, et al.
Published: (2026)
CrossLinear: Plug-and-Play Cross-Correlation Embedding for Time Series Forecasting with Exogenous Variables
by: Zhou, Pengfei, et al.
Published: (2025)
by: Zhou, Pengfei, et al.
Published: (2025)
OPRIDE: Offline Preference-based Reinforcement Learning via In-Dataset Exploration
by: Yang, Yiqin, et al.
Published: (2026)
by: Yang, Yiqin, et al.
Published: (2026)
Learning to Explore: Scaling Agentic Reasoning via Exploration-Aware Policy Optimization
by: Hua, Xingyuan, et al.
Published: (2026)
by: Hua, Xingyuan, et al.
Published: (2026)
Learning to Adapt: Self-Improving Web Agent via Cognitive-Aware Exploration
by: Chen, Weile, et al.
Published: (2026)
by: Chen, Weile, et al.
Published: (2026)
Similar Items
-
Intrinsic Dynamics-Driven Generalizable Scene Representations for Vision-Oriented Decision-Making Applications
by: Liang, Dayang, et al.
Published: (2024) -
Episodic Reinforcement Learning with Expanded State-reward Space
by: Liang, Dayang, et al.
Published: (2024) -
InterReal: A Unified Physics-Based Imitation Framework for Learning Human-Object Interaction Skills
by: Liang, Dayang, et al.
Published: (2026) -
Imagine, Initialize, and Explore: An Effective Exploration Method in Multi-Agent Reinforcement Learning
by: Liu, Zeyang, et al.
Published: (2024) -
ThanoRA: Task Heterogeneity-Aware Multi-Task Low-Rank Adaptation
by: Liang, Jian, et al.
Published: (2025)