:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Luo, Zhanpeng, Zhang, Ce, Yong, Silong, Dai, Cunxi, Wang, Qianwei, Ran, Haoxi, Shi, Guanya, Sycara, Katia, Xie, Yaqi
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2603.00905
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

GL-NeRF: Gauss-Laguerre Quadrature Enables Training-Free NeRF Acceleration
by: Yong, Silong, et al.
Published: (2024)

Enhancing Vision-Language Few-Shot Adaptation with Negative Learning
by: Zhang, Ce, et al.
Published: (2024)

Instant4D: 4D Gaussian Splatting in Minutes
by: Luo, Zhanpeng, et al.
Published: (2025)

Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models
by: Zhang, Ce, et al.
Published: (2024)

Unifying Deep Predicate Invention with Pre-trained Foundation Models
by: Wang, Qianwei, et al.
Published: (2025)

Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation
by: Wan, Zifu, et al.
Published: (2024)

Spectral-Aware Global Fusion for RGB-Thermal Semantic Segmentation
by: Zhang, Ce, et al.
Published: (2025)

HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation
by: Zhang, Ce, et al.
Published: (2024)

Evolving Contextual Safety in Multi-Modal Large Language Models via Inference-Time Self-Reflective Memory
by: Zhang, Ce, et al.
Published: (2026)

Aug3D: Augmenting large scale outdoor datasets for Generalizable Novel View Synthesis
by: Rauniyar, Aditya, et al.
Published: (2025)

Jailbreaking Frontier Foundation Models Through Intention Deception
by: Wang, Xinhe, et al.
Published: (2026)

ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models
by: Wan, Zifu, et al.
Published: (2025)

OMG: Opacity Matters in Material Modeling with Gaussian Splatting
by: Yong, Silong, et al.
Published: (2025)

ShapeGrasp: Zero-Shot Task-Oriented Grasping with Large Language Models through Geometric Decomposition
by: Li, Samuel, et al.
Published: (2024)

InstructPart: Task-Oriented Part Segmentation with Instruction Reasoning
by: Wan, Zifu, et al.
Published: (2025)

Generalizable Dense Reward for Long-Horizon Robotic Tasks
by: Yong, Silong, et al.
Published: (2026)

Modeling Latent Partner Strategies for Adaptive Zero-Shot Human-Agent Collaboration
by: Li, Benjamin, et al.
Published: (2025)

Theory of Mind Guided Strategy Adaptation for Zero-Shot Coordination
by: Ni, Andrew, et al.
Published: (2026)

B3C: A Minimalist Approach to Offline Multi-Agent Reinforcement Learning
by: Kim, Woojun, et al.
Published: (2025)

Fair Cooperation in Mixed-Motive Games via Conflict-Aware Gradient Adjustment
by: Kim, Woojun, et al.
Published: (2025)

HiMemFormer: Hierarchical Memory-Aware Transformer for Multi-Agent Action Anticipation
by: Wang, Zirui, et al.
Published: (2024)

SPATIOROUTE: Dynamic Prompt Routing for Zero-Shot Spatial Reasoning
by: Chunhachatrachai, Pawat, et al.
Published: (2026)

SPAZER: Spatial-Semantic Progressive Reasoning Agent for Zero-shot 3D Visual Grounding
by: Jin, Zhao, et al.
Published: (2025)

Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models
by: Zhang, Ce, et al.
Published: (2025)

Spatial-VLN: Zero-Shot Vision-and-Language Navigation With Explicit Spatial Perception and Exploration
by: Yue, Lu, et al.
Published: (2026)

Multi-Robot Navigation in Social Mini-Games: Definitions, Taxonomy, and Algorithms
by: Chandra, Rohan, et al.
Published: (2025)

SpatialNav: Leveraging Spatial Scene Graphs for Zero-Shot Vision-and-Language Navigation
by: Zhang, Jiwen, et al.
Published: (2026)

Improved Visual-Spatial Reasoning via R1-Zero-Like Training
by: Liao, Zhenyi, et al.
Published: (2025)

BFM-Zero: A Promptable Behavioral Foundation Model for Humanoid Control Using Unsupervised Reinforcement Learning
by: Li, Yitang, et al.
Published: (2025)

Reconfigurable Robot Control Using Flexible Coupling Mechanisms
by: Yi, Sha, et al.
Published: (2023)

CBGT-Net: A Neuromimetic Architecture for Robust Classification of Streaming Data
by: Sharma, Shreya, et al.
Published: (2024)

Aligning LLM+PDDL Symbolic Plans with Human Objective Specifications through Evolutionary Algorithm Guidance
by: Burns, Owen, et al.
Published: (2024)

SpatialPrompting: Keyframe-driven Zero-Shot Spatial Reasoning with Off-the-Shelf Multimodal Large Language Models
by: Taguchi, Shun, et al.
Published: (2025)

SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound
by: Dagli, Rishit, et al.
Published: (2024)

SpatialAnt: Autonomous Zero-Shot Robot Navigation via Active Scene Reconstruction and Visual Anticipation
by: Zhang, Jiwen, et al.
Published: (2026)

GSMem: 3D Gaussian Splatting as Persistent Spatial Memory for Zero-Shot Embodied Exploration and Reasoning
by: Lu, Yiren, et al.
Published: (2026)

CARE: Enhancing Safety of Visual Navigation through Collision Avoidance via Repulsive Estimation
by: Kim, Joonkyung, et al.
Published: (2025)

SpatialReasoner: Towards Explicit and Generalizable 3D Spatial Reasoning
by: Ma, Wufei, et al.
Published: (2025)

Adaptively Coordinating with Novel Partners via Learned Latent Strategies
by: Li, Benjamin, et al.
Published: (2025)

Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning
by: Ran, Xingjian, et al.
Published: (2025)