Saved in:
| Main Authors: | Chen, Canyu, Yang, Yuguang, Tan, Zhewen, Wang, Yizhi, Zhan, Ruiyi, Liu, Haiyan, Mao, Xuanyao, Bao, Jason, Tang, Xinyue, Yang, Linlin, Sun, Bingchuan, Wang, Yan, Zhang, Baochang |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.06049 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
From Representational Complementarity to Dual Systems: Synergizing VLM and Vision-Only Backbones for End-to-End Driving
by: Ang, Sining, et al.
Published: (2026)
by: Ang, Sining, et al.
Published: (2026)
CLOVER: Closed-Loop Value Estimation and Ranking for End-to-End Autonomous Driving Planning
by: Ang, Sining, et al.
Published: (2026)
by: Ang, Sining, et al.
Published: (2026)
AR Forcing: Towards Long-Horizon Robot Navigation World Model
by: Yang, Yifei, et al.
Published: (2026)
by: Yang, Yifei, et al.
Published: (2026)
SURGE: Surrogate Gradient Adaptation in Binary Neural Networks
by: Huang, Haoyu, et al.
Published: (2026)
by: Huang, Haoyu, et al.
Published: (2026)
SparseWorld: A Flexible, Adaptive, and Efficient 4D Occupancy World Model Powered by Sparse and Dynamic Queries
by: Dang, Chenxu, et al.
Published: (2025)
by: Dang, Chenxu, et al.
Published: (2025)
SilLang: Improving Gait Recognition with Silhouette Language Encoding
by: Zhan, Ruiyi, et al.
Published: (2026)
by: Zhan, Ruiyi, et al.
Published: (2026)
PROSPECT: Unified Streaming Vision-Language Navigation via Semantic--Spatial Fusion and Latent Predictive Representation
by: Fan, Zehua, et al.
Published: (2026)
by: Fan, Zehua, et al.
Published: (2026)
Causally Guided Gaussian Perturbations for Out-Of-Distribution Generalization in Medical Imaging
by: Pei, Haoran, et al.
Published: (2025)
by: Pei, Haoran, et al.
Published: (2025)
Unleashing VLA Potentials in Autonomous Driving via Explicit Learning from Failures
by: Luo, Yuechen, et al.
Published: (2026)
by: Luo, Yuechen, et al.
Published: (2026)
DecomCAM: Advancing Beyond Saliency Maps through Decomposition and Integration
by: Yang, Yuguang, et al.
Published: (2024)
by: Yang, Yuguang, et al.
Published: (2024)
Prompt as Knowledge Bank: Boost Vision-language model via Structural Representation for zero-shot medical detection
by: Yang, Yuguang, et al.
Published: (2025)
by: Yang, Yuguang, et al.
Published: (2025)
EvoDriveVLA: Evolving Driving VLA Models via Collaborative Perception-Planning Distillation
by: Cao, Jiajun, et al.
Published: (2026)
by: Cao, Jiajun, et al.
Published: (2026)
VIVA: VLM-Guided Instruction-Based Video Editing with Reward Optimization
by: Cong, Xiaoyan, et al.
Published: (2025)
by: Cong, Xiaoyan, et al.
Published: (2025)
Normalizing Batch Normalization for Long-Tailed Recognition
by: Bao, Yuxiang, et al.
Published: (2025)
by: Bao, Yuxiang, et al.
Published: (2025)
Learning domain-invariant features through channel-level sparsification for Out-Of Distribution Generalization
by: Pei, Haoran, et al.
Published: (2026)
by: Pei, Haoran, et al.
Published: (2026)
AMLRIS: Alignment-aware Masked Learning for Referring Image Segmentation
by: Chen, Tongfei, et al.
Published: (2026)
by: Chen, Tongfei, et al.
Published: (2026)
Bone Organ‐on‐a‐Chip Uncovers That TPD52L1 Enhances Osteogenic Differentiation of MSCs and Contributes to Osteoporosis Repair
by: Zhewen Liu, et al.
Published: (2026)
by: Zhewen Liu, et al.
Published: (2026)
TrackVLA++: Unleashing Reasoning and Memory Capabilities in VLA Models for Embodied Visual Tracking
by: Liu, Jiahang, et al.
Published: (2025)
by: Liu, Jiahang, et al.
Published: (2025)
DynVLA: Learning World Dynamics for Action Reasoning in Autonomous Driving
by: Shang, Shuyao, et al.
Published: (2026)
by: Shang, Shuyao, et al.
Published: (2026)
Decom--CAM: Tell Me What You See, In Details! Feature-Level Interpretation via Decomposition Class Activation Map
by: Yang, Yuguang, et al.
Published: (2023)
by: Yang, Yuguang, et al.
Published: (2023)
AI Research Agents Narrow Scientific Exploration
by: Tang, Yixuan, et al.
Published: (2026)
by: Tang, Yixuan, et al.
Published: (2026)
Uncertainty-Aware Gradient Stabilization for Small Object Detection
by: Sun, Huixin, et al.
Published: (2023)
by: Sun, Huixin, et al.
Published: (2023)
Teaching effects of the online and offline flipped classroom model (FCM) in the post‐epidemic era: Development and feasibility study
by: Shumin Wang, et al.
Published: (2024)
by: Shumin Wang, et al.
Published: (2024)
Squeeze10-LLM: Squeezing LLMs' Weights by 10 Times via a Staged Mixed-Precision Quantization Method
by: Zhu, Qingcheng, et al.
Published: (2025)
by: Zhu, Qingcheng, et al.
Published: (2025)
Using a Hybrid Collaborative Crisis Management Framework to Foster Long‐Term Growth in Post‐Disaster Reconstruction: Findings From the Chinese Paired Assistance Policy
by: Linlin Wang, et al.
Published: (2025)
by: Linlin Wang, et al.
Published: (2025)
RePO-VLA: Recovery-Driven Policy Optimization for Vision-Language-Action Models
by: Liufu, Weijia, et al.
Published: (2026)
by: Liufu, Weijia, et al.
Published: (2026)
Unbiased Dynamic Pruning for Efficient Group-Based Policy Optimization
by: Zhu, Haodong, et al.
Published: (2026)
by: Zhu, Haodong, et al.
Published: (2026)
DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving
by: Li, Yingyan, et al.
Published: (2025)
by: Li, Yingyan, et al.
Published: (2025)
VirtuWander: Enhancing Multi-modal Interaction for Virtual Tour Guidance through Large Language Models
by: Wang, Zhan, et al.
Published: (2024)
by: Wang, Zhan, et al.
Published: (2024)
Volatility in Carbon Futures Amid Uncertainties: Considering Geopolitical and Economic Policy Factors
by: Xiaoqing Wang, et al.
Published: (2025)
by: Xiaoqing Wang, et al.
Published: (2025)
DriveAction: A Benchmark for Exploring Human-like Driving Decisions in VLA Models
by: Hao, Yuhan, et al.
Published: (2025)
by: Hao, Yuhan, et al.
Published: (2025)
WaveMamba: Wavelet-Driven Mamba Fusion for RGB-Infrared Object Detection
by: Zhu, Haodong, et al.
Published: (2025)
by: Zhu, Haodong, et al.
Published: (2025)
SHIELD: A Segmented Hierarchical Memory Architecture for Energy-Efficient LLM Inference on Edge NPUs
by: Zhang, Jintao, et al.
Published: (2026)
by: Zhang, Jintao, et al.
Published: (2026)
A Unified Evaluation Framework for Spiking Neural Network Hardware Accelerators Based on Emerging Non-Volatile Memory Devices
by: Das, Debasis, et al.
Published: (2024)
by: Das, Debasis, et al.
Published: (2024)
Unleashing the Potential of Diffusion Models for End-to-End Autonomous Driving
by: Zheng, Yinan, et al.
Published: (2026)
by: Zheng, Yinan, et al.
Published: (2026)
Efficiently Seeking Flat Minima for Better Generalization in Fine-Tuning Large Language Models and Beyond
by: Deng, Jiaxin, et al.
Published: (2025)
by: Deng, Jiaxin, et al.
Published: (2025)
ExploreVLA: Dense World Modeling and Exploration for End-to-End Autonomous Driving
by: Sheng, Zihao, et al.
Published: (2026)
by: Sheng, Zihao, et al.
Published: (2026)
Long-VLA: Unleashing Long-Horizon Capability of Vision Language Action Model for Robot Manipulation
by: Fan, Yiguo, et al.
Published: (2025)
by: Fan, Yiguo, et al.
Published: (2025)
Devil's Advocate: Anticipatory Reflection for LLM Agents
by: Wang, Haoyu, et al.
Published: (2024)
by: Wang, Haoyu, et al.
Published: (2024)
CoWorld-VLA: Thinking in a Multi-Expert World Model for Autonomous Driving
by: Huang, Minqing, et al.
Published: (2026)
by: Huang, Minqing, et al.
Published: (2026)
Similar Items
-
From Representational Complementarity to Dual Systems: Synergizing VLM and Vision-Only Backbones for End-to-End Driving
by: Ang, Sining, et al.
Published: (2026) -
CLOVER: Closed-Loop Value Estimation and Ranking for End-to-End Autonomous Driving Planning
by: Ang, Sining, et al.
Published: (2026) -
AR Forcing: Towards Long-Horizon Robot Navigation World Model
by: Yang, Yifei, et al.
Published: (2026) -
SURGE: Surrogate Gradient Adaptation in Binary Neural Networks
by: Huang, Haoyu, et al.
Published: (2026) -
SparseWorld: A Flexible, Adaptive, and Efficient 4D Occupancy World Model Powered by Sparse and Dynamic Queries
by: Dang, Chenxu, et al.
Published: (2025)