:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kong, Lingzhao, Lin, Jiacheng, Li, Siyu, Luo, Kai, Li, Zhiyong, Yang, Kailun
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Robotics Image and Video Processing
Online Access:	https://arxiv.org/abs/2509.17107
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

MambaMOS: LiDAR-based 3D Moving Object Segmentation with Motion-aware State Space Model
by: Zeng, Kang, et al.
Published: (2024)

TS-CGNet: Temporal-Spatial Fusion Meets Centerline-Guided Diffusion for BEV Mapping
by: Hong, Xinying, et al.
Published: (2025)

DTCLMapper: Dual Temporal Consistent Learning for Vectorized HD Map Construction
by: Li, Siyu, et al.
Published: (2024)

PVPUFormer: Probabilistic Visual Prompt Unified Transformer for Interactive Image Segmentation
by: Zhang, Xu, et al.
Published: (2023)

GenMapping: Unleashing the Potential of Inverse Perspective Mapping for Robust Online HD Map Construction
by: Li, Siyu, et al.
Published: (2024)

NRSeg: Noise-Resilient Learning for BEV Semantic Segmentation via Driving World Models
by: Li, Siyu, et al.
Published: (2025)

HierDAMap: Towards Universal Domain Adaptive BEV Mapping via Hierarchical Perspective Priors
by: Li, Siyu, et al.
Published: (2025)

Multi-Keypoint Affordance Representation for Functional Dexterous Grasping
by: Yang, Fan, et al.
Published: (2025)

Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance
by: Zhao, Jiayi, et al.
Published: (2025)

CFMW: Cross-modality Fusion Mamba for Robust Object Detection under Adverse Weather
by: Li, Haoyuan, et al.
Published: (2024)

Panoramic Out-of-Distribution Segmentation
by: Duan, Mengfei, et al.
Published: (2025)

Learning Fine-Grained Correspondence with Cross-Perspective Perception for Open-Vocabulary 6D Object Pose Estimation
by: Qin, Yu, et al.
Published: (2026)

S$^3$-MonoDETR: Supervised Shape&Scale-perceptive Deformable Transformer for Monocular 3D Object Detection
by: He, Xuan, et al.
Published: (2023)

Out-of-Distribution Semantic Occupancy Prediction
by: Zhang, Yuheng, et al.
Published: (2025)

Hallucinating 360°: Panoramic Street-View Generation via Local Scenes Diffusion and Probabilistic Prompting
by: Teng, Fei, et al.
Published: (2025)

AdaptiveClick: Clicks-aware Transformer with Adaptive Focal Loss for Interactive Image Segmentation
by: Lin, Jiacheng, et al.
Published: (2023)

NOVA: Next-step Open-Vocabulary Autoregression for 3D Multi-Object Tracking in Autonomous Driving
by: Luo, Kai, et al.
Published: (2026)

EchoTrack: Auditory Referring Multi-Object Tracking for Autonomous Driving
by: Lin, Jiacheng, et al.
Published: (2024)

PanoAffordanceNet: Towards Holistic Affordance Grounding in 360° Indoor Environments
by: Zhu, Guoliang, et al.
Published: (2026)

Learning Granularity-Aware Affordances from Human-Object Interaction for Tool-Based Functional Dexterous Grasping
by: Yang, Fan, et al.
Published: (2024)

A Late Collaborative Perception Framework for 3D Multi-Object and Multi-Source Association and Fusion
by: Fadili, Maryem, et al.
Published: (2025)

DepTR-MOT: Unveiling the Potential of Depth-Informed Trajectory Refinement for Multi-Object Tracking
by: Deng, Buyin, et al.
Published: (2025)

LF Tracy: A Unified Single-Pipeline Approach for Salient Object Detection in Light Field Cameras
by: Teng, Fei, et al.
Published: (2024)

O3N: Omnidirectional Open-Vocabulary Occupancy Prediction
by: Duan, Mengfei, et al.
Published: (2026)

Language-Driven Dual Style Mixing for Single-Domain Generalized Object Detection
by: Qin, Hongda, et al.
Published: (2025)

Spherical-GOF: Geometry-Aware Panoramic Gaussian Opacity Fields for 3D Scene Reconstruction
by: Yang, Zhe, et al.
Published: (2026)

Towards Consistent Object Detection via LiDAR-Camera Synergy
by: Luo, Kai, et al.
Published: (2024)

Resource-Efficient Affordance Grounding with Complementary Depth and Semantic Prompts
by: Huang, Yizhou, et al.
Published: (2025)

OAFuser: Towards Omni-Aperture Fusion for Light Field Semantic Segmentation
by: Teng, Fei, et al.
Published: (2023)

Towards Source-free Domain Adaptive Semantic Segmentation via Importance-aware and Prototype-contrast Learning
by: Cao, Yihong, et al.
Published: (2023)

DeProPose: Deficiency-Proof 3D Human Pose Estimation via Adaptive Multi-View Fusion
by: Jiao, Jianbin, et al.
Published: (2025)

One-Shot Affordance Grounding of Deformable Objects in Egocentric Organizing Scenes
by: Jia, Wanjun, et al.
Published: (2025)

OmniTrack++: Omnidirectional Multi-Object Tracking by Learning Large-FoV Trajectory Feedback
by: Luo, Kai, et al.
Published: (2025)

OccTrack360: 4D Panoptic Occupancy Tracking from Surround-View Fisheye Cameras
by: Lin, Yongzhi, et al.
Published: (2026)

QuaDreamer: Controllable Panoramic Video Generation for Quadruped Robots
by: Wu, Sheng, et al.
Published: (2025)

Segment-to-Act: Label-Noise-Robust Action-Prompted Video Segmentation Towards Embodied Intelligence
by: Li, Wenxin, et al.
Published: (2025)

Panoramic Multimodal Semantic Occupancy Prediction for Quadruped Robots
by: Zhao, Guoqiang, et al.
Published: (2026)

Omnidirectional Multi-Object Tracking
by: Luo, Kai, et al.
Published: (2025)

Behind Every Domain There is a Shift: Adapting Distortion-aware Vision Transformers for Panoramic Semantic Segmentation
by: Zhang, Jiaming, et al.
Published: (2022)

EI-Nexus: Towards Unmediated and Flexible Inter-Modality Local Feature Extraction and Matching for Event-Image Data
by: Yi, Zhonghua, et al.
Published: (2024)