Saved in:
| Main Authors: | Luo, Zhanpeng, Zhang, Ce, Yong, Silong, Dai, Cunxi, Wang, Qianwei, Ran, Haoxi, Shi, Guanya, Sycara, Katia, Xie, Yaqi |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.00905 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
GL-NeRF: Gauss-Laguerre Quadrature Enables Training-Free NeRF Acceleration
by: Yong, Silong, et al.
Published: (2024)
by: Yong, Silong, et al.
Published: (2024)
Enhancing Vision-Language Few-Shot Adaptation with Negative Learning
by: Zhang, Ce, et al.
Published: (2024)
by: Zhang, Ce, et al.
Published: (2024)
Instant4D: 4D Gaussian Splatting in Minutes
by: Luo, Zhanpeng, et al.
Published: (2025)
by: Luo, Zhanpeng, et al.
Published: (2025)
Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models
by: Zhang, Ce, et al.
Published: (2024)
by: Zhang, Ce, et al.
Published: (2024)
Unifying Deep Predicate Invention with Pre-trained Foundation Models
by: Wang, Qianwei, et al.
Published: (2025)
by: Wang, Qianwei, et al.
Published: (2025)
Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation
by: Wan, Zifu, et al.
Published: (2024)
by: Wan, Zifu, et al.
Published: (2024)
Spectral-Aware Global Fusion for RGB-Thermal Semantic Segmentation
by: Zhang, Ce, et al.
Published: (2025)
by: Zhang, Ce, et al.
Published: (2025)
HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation
by: Zhang, Ce, et al.
Published: (2024)
by: Zhang, Ce, et al.
Published: (2024)
Evolving Contextual Safety in Multi-Modal Large Language Models via Inference-Time Self-Reflective Memory
by: Zhang, Ce, et al.
Published: (2026)
by: Zhang, Ce, et al.
Published: (2026)
Aug3D: Augmenting large scale outdoor datasets for Generalizable Novel View Synthesis
by: Rauniyar, Aditya, et al.
Published: (2025)
by: Rauniyar, Aditya, et al.
Published: (2025)
Jailbreaking Frontier Foundation Models Through Intention Deception
by: Wang, Xinhe, et al.
Published: (2026)
by: Wang, Xinhe, et al.
Published: (2026)
ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models
by: Wan, Zifu, et al.
Published: (2025)
by: Wan, Zifu, et al.
Published: (2025)
OMG: Opacity Matters in Material Modeling with Gaussian Splatting
by: Yong, Silong, et al.
Published: (2025)
by: Yong, Silong, et al.
Published: (2025)
ShapeGrasp: Zero-Shot Task-Oriented Grasping with Large Language Models through Geometric Decomposition
by: Li, Samuel, et al.
Published: (2024)
by: Li, Samuel, et al.
Published: (2024)
InstructPart: Task-Oriented Part Segmentation with Instruction Reasoning
by: Wan, Zifu, et al.
Published: (2025)
by: Wan, Zifu, et al.
Published: (2025)
Generalizable Dense Reward for Long-Horizon Robotic Tasks
by: Yong, Silong, et al.
Published: (2026)
by: Yong, Silong, et al.
Published: (2026)
Modeling Latent Partner Strategies for Adaptive Zero-Shot Human-Agent Collaboration
by: Li, Benjamin, et al.
Published: (2025)
by: Li, Benjamin, et al.
Published: (2025)
Theory of Mind Guided Strategy Adaptation for Zero-Shot Coordination
by: Ni, Andrew, et al.
Published: (2026)
by: Ni, Andrew, et al.
Published: (2026)
B3C: A Minimalist Approach to Offline Multi-Agent Reinforcement Learning
by: Kim, Woojun, et al.
Published: (2025)
by: Kim, Woojun, et al.
Published: (2025)
Fair Cooperation in Mixed-Motive Games via Conflict-Aware Gradient Adjustment
by: Kim, Woojun, et al.
Published: (2025)
by: Kim, Woojun, et al.
Published: (2025)
HiMemFormer: Hierarchical Memory-Aware Transformer for Multi-Agent Action Anticipation
by: Wang, Zirui, et al.
Published: (2024)
by: Wang, Zirui, et al.
Published: (2024)
SPATIOROUTE: Dynamic Prompt Routing for Zero-Shot Spatial Reasoning
by: Chunhachatrachai, Pawat, et al.
Published: (2026)
by: Chunhachatrachai, Pawat, et al.
Published: (2026)
SPAZER: Spatial-Semantic Progressive Reasoning Agent for Zero-shot 3D Visual Grounding
by: Jin, Zhao, et al.
Published: (2025)
by: Jin, Zhao, et al.
Published: (2025)
Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models
by: Zhang, Ce, et al.
Published: (2025)
by: Zhang, Ce, et al.
Published: (2025)
Spatial-VLN: Zero-Shot Vision-and-Language Navigation With Explicit Spatial Perception and Exploration
by: Yue, Lu, et al.
Published: (2026)
by: Yue, Lu, et al.
Published: (2026)
Multi-Robot Navigation in Social Mini-Games: Definitions, Taxonomy, and Algorithms
by: Chandra, Rohan, et al.
Published: (2025)
by: Chandra, Rohan, et al.
Published: (2025)
SpatialNav: Leveraging Spatial Scene Graphs for Zero-Shot Vision-and-Language Navigation
by: Zhang, Jiwen, et al.
Published: (2026)
by: Zhang, Jiwen, et al.
Published: (2026)
Improved Visual-Spatial Reasoning via R1-Zero-Like Training
by: Liao, Zhenyi, et al.
Published: (2025)
by: Liao, Zhenyi, et al.
Published: (2025)
BFM-Zero: A Promptable Behavioral Foundation Model for Humanoid Control Using Unsupervised Reinforcement Learning
by: Li, Yitang, et al.
Published: (2025)
by: Li, Yitang, et al.
Published: (2025)
Reconfigurable Robot Control Using Flexible Coupling Mechanisms
by: Yi, Sha, et al.
Published: (2023)
by: Yi, Sha, et al.
Published: (2023)
CBGT-Net: A Neuromimetic Architecture for Robust Classification of Streaming Data
by: Sharma, Shreya, et al.
Published: (2024)
by: Sharma, Shreya, et al.
Published: (2024)
Aligning LLM+PDDL Symbolic Plans with Human Objective Specifications through Evolutionary Algorithm Guidance
by: Burns, Owen, et al.
Published: (2024)
by: Burns, Owen, et al.
Published: (2024)
SpatialPrompting: Keyframe-driven Zero-Shot Spatial Reasoning with Off-the-Shelf Multimodal Large Language Models
by: Taguchi, Shun, et al.
Published: (2025)
by: Taguchi, Shun, et al.
Published: (2025)
SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound
by: Dagli, Rishit, et al.
Published: (2024)
by: Dagli, Rishit, et al.
Published: (2024)
SpatialAnt: Autonomous Zero-Shot Robot Navigation via Active Scene Reconstruction and Visual Anticipation
by: Zhang, Jiwen, et al.
Published: (2026)
by: Zhang, Jiwen, et al.
Published: (2026)
GSMem: 3D Gaussian Splatting as Persistent Spatial Memory for Zero-Shot Embodied Exploration and Reasoning
by: Lu, Yiren, et al.
Published: (2026)
by: Lu, Yiren, et al.
Published: (2026)
CARE: Enhancing Safety of Visual Navigation through Collision Avoidance via Repulsive Estimation
by: Kim, Joonkyung, et al.
Published: (2025)
by: Kim, Joonkyung, et al.
Published: (2025)
SpatialReasoner: Towards Explicit and Generalizable 3D Spatial Reasoning
by: Ma, Wufei, et al.
Published: (2025)
by: Ma, Wufei, et al.
Published: (2025)
Adaptively Coordinating with Novel Partners via Learned Latent Strategies
by: Li, Benjamin, et al.
Published: (2025)
by: Li, Benjamin, et al.
Published: (2025)
Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning
by: Ran, Xingjian, et al.
Published: (2025)
by: Ran, Xingjian, et al.
Published: (2025)
Similar Items
-
GL-NeRF: Gauss-Laguerre Quadrature Enables Training-Free NeRF Acceleration
by: Yong, Silong, et al.
Published: (2024) -
Enhancing Vision-Language Few-Shot Adaptation with Negative Learning
by: Zhang, Ce, et al.
Published: (2024) -
Instant4D: 4D Gaussian Splatting in Minutes
by: Luo, Zhanpeng, et al.
Published: (2025) -
Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models
by: Zhang, Ce, et al.
Published: (2024) -
Unifying Deep Predicate Invention with Pre-trained Foundation Models
by: Wang, Qianwei, et al.
Published: (2025)