:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Lantao, Yang, Kang, Song, Rui, Sun, Chen
Format:	Preprint
Published:	2025
Subjects:	Robotics Computer Vision and Pattern Recognition Image and Video Processing
Online Access:	https://arxiv.org/abs/2509.24903
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

RG-Attn: Radian Glue Attention for Multi-modality Multi-agent Cooperative Perception
by: Li, Lantao, et al.
Published: (2025)

TS-CGNet: Temporal-Spatial Fusion Meets Centerline-Guided Diffusion for BEV Mapping
by: Hong, Xinying, et al.
Published: (2025)

Learning Fine-Grained Correspondence with Cross-Perspective Perception for Open-Vocabulary 6D Object Pose Estimation
by: Qin, Yu, et al.
Published: (2026)

CoBEVMoE: Heterogeneity-aware Feature Fusion with Dynamic Mixture-of-Experts for Collaborative Perception
by: Kong, Lingzhao, et al.
Published: (2025)

On the Benefits of Visual Stabilization for Frame- and Event-based Perception
by: Rodriguez-Gomez, Juan Pablo, et al.
Published: (2024)

GenMapping: Unleashing the Potential of Inverse Perspective Mapping for Robust Online HD Map Construction
by: Li, Siyu, et al.
Published: (2024)

Hallucinating 360°: Panoramic Street-View Generation via Local Scenes Diffusion and Probabilistic Prompting
by: Teng, Fei, et al.
Published: (2025)

NOVA: Next-step Open-Vocabulary Autoregression for 3D Multi-Object Tracking in Autonomous Driving
by: Luo, Kai, et al.
Published: (2026)

Robust Roadside Perception: an Automated Data Synthesis Pipeline Minimizing Human Annotation
by: Zhang, Rusheng, et al.
Published: (2023)

DTCLMapper: Dual Temporal Consistent Learning for Vectorized HD Map Construction
by: Li, Siyu, et al.
Published: (2024)

WLTCL: Wide Field-of-View 3-D LiDAR Truck Compartment Automatic Localization System
by: Sun, Guodong, et al.
Published: (2025)

TIR-Diffusion: Diffusion-based Thermal Infrared Image Denoising via Latent and Wavelet Domain Optimization
by: Rhee, Tai Hyoung, et al.
Published: (2025)

Point Cloud Recombination: Systematic Real Data Augmentation Using Robotic Targets for LiDAR Perception Validation
by: Padusinski, Hubert, et al.
Published: (2025)

Can we Trust Unreliable Voxels? Exploring 3D Semantic Occupancy Prediction under Label Noise
by: Li, Wenxin, et al.
Published: (2026)

One-Shot Affordance Grounding of Deformable Objects in Egocentric Organizing Scenes
by: Jia, Wanjun, et al.
Published: (2025)

OneOcc: Semantic Occupancy Prediction for Legged Robots with a Single Panoramic Camera
by: Shi, Hao, et al.
Published: (2025)

Resource-Efficient Affordance Grounding with Complementary Depth and Semantic Prompts
by: Huang, Yizhou, et al.
Published: (2025)

UniFucGrasp: Human-Hand-Inspired Unified Functional Grasp Annotation Strategy and Dataset for Diverse Dexterous Hands
by: Lin, Haoran, et al.
Published: (2025)

Event-aided Semantic Scene Completion
by: Guo, Shangwei, et al.
Published: (2025)

Multi-Keypoint Affordance Representation for Functional Dexterous Grasping
by: Yang, Fan, et al.
Published: (2025)

Learning Granularity-Aware Affordances from Human-Object Interaction for Tool-Based Functional Dexterous Grasping
by: Yang, Fan, et al.
Published: (2024)

ViPE: Video Pose Engine for 3D Geometric Perception
by: Huang, Jiahui, et al.
Published: (2025)

Offboard Occupancy Refinement with Hybrid Propagation for Autonomous Driving
by: Shi, Hao, et al.
Published: (2024)

MambaMOS: LiDAR-based 3D Moving Object Segmentation with Motion-aware State Space Model
by: Zeng, Kang, et al.
Published: (2024)

Towards Anytime Optical Flow Estimation with Event Cameras
by: Ye, Yaozu, et al.
Published: (2023)

NRSeg: Noise-Resilient Learning for BEV Semantic Segmentation via Driving World Models
by: Li, Siyu, et al.
Published: (2025)

PVPUFormer: Probabilistic Visual Prompt Unified Transformer for Interactive Image Segmentation
by: Zhang, Xu, et al.
Published: (2023)

Unsupervised Multi-view UAV Image Geo-localization via Iterative Rendering
by: Li, Haoyuan, et al.
Published: (2024)

FishDetector-R1: Unified MLLM-Based Framework with Reinforcement Fine-Tuning for Weakly Supervised Fish Detection, Segmentation, and Counting
by: Liu, Yi, et al.
Published: (2025)

HierDAMap: Towards Universal Domain Adaptive BEV Mapping via Hierarchical Perspective Priors
by: Li, Siyu, et al.
Published: (2025)

S$^3$-MonoDETR: Supervised Shape&Scale-perceptive Deformable Transformer for Monocular 3D Object Detection
by: He, Xuan, et al.
Published: (2023)

Language-Driven Dual Style Mixing for Single-Domain Generalized Object Detection
by: Qin, Hongda, et al.
Published: (2025)

$M^2$-Occ: Resilient 3D Semantic Occupancy Prediction for Autonomous Driving with Incomplete Camera Inputs
by: Lin, Kaixin, et al.
Published: (2026)

Towards Precise 3D Human Pose Estimation with Multi-Perspective Spatial-Temporal Relational Transformers
by: Jiao, Jianbin, et al.
Published: (2024)

PanoAffordanceNet: Towards Holistic Affordance Grounding in 360° Indoor Environments
by: Zhu, Guoliang, et al.
Published: (2026)

Seeing Beyond: Extrapolative Domain Adaptive Panoramic Segmentation
by: Zheng, Yuanfan, et al.
Published: (2026)

Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance
by: Zhao, Jiayi, et al.
Published: (2025)

LF Tracy: A Unified Single-Pipeline Approach for Salient Object Detection in Light Field Cameras
by: Teng, Fei, et al.
Published: (2024)

O3N: Omnidirectional Open-Vocabulary Occupancy Prediction
by: Duan, Mengfei, et al.
Published: (2026)

InterEdit: Navigating Text-Guided Multi-Human 3D Motion Editing
by: Yang, Yebin, et al.
Published: (2026)