Saved in:
| Main Authors: | Li, Lantao, Yang, Kang, Song, Rui, Sun, Chen |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.24903 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
RG-Attn: Radian Glue Attention for Multi-modality Multi-agent Cooperative Perception
by: Li, Lantao, et al.
Published: (2025)
by: Li, Lantao, et al.
Published: (2025)
TS-CGNet: Temporal-Spatial Fusion Meets Centerline-Guided Diffusion for BEV Mapping
by: Hong, Xinying, et al.
Published: (2025)
by: Hong, Xinying, et al.
Published: (2025)
Learning Fine-Grained Correspondence with Cross-Perspective Perception for Open-Vocabulary 6D Object Pose Estimation
by: Qin, Yu, et al.
Published: (2026)
by: Qin, Yu, et al.
Published: (2026)
CoBEVMoE: Heterogeneity-aware Feature Fusion with Dynamic Mixture-of-Experts for Collaborative Perception
by: Kong, Lingzhao, et al.
Published: (2025)
by: Kong, Lingzhao, et al.
Published: (2025)
On the Benefits of Visual Stabilization for Frame- and Event-based Perception
by: Rodriguez-Gomez, Juan Pablo, et al.
Published: (2024)
by: Rodriguez-Gomez, Juan Pablo, et al.
Published: (2024)
GenMapping: Unleashing the Potential of Inverse Perspective Mapping for Robust Online HD Map Construction
by: Li, Siyu, et al.
Published: (2024)
by: Li, Siyu, et al.
Published: (2024)
Hallucinating 360°: Panoramic Street-View Generation via Local Scenes Diffusion and Probabilistic Prompting
by: Teng, Fei, et al.
Published: (2025)
by: Teng, Fei, et al.
Published: (2025)
NOVA: Next-step Open-Vocabulary Autoregression for 3D Multi-Object Tracking in Autonomous Driving
by: Luo, Kai, et al.
Published: (2026)
by: Luo, Kai, et al.
Published: (2026)
Robust Roadside Perception: an Automated Data Synthesis Pipeline Minimizing Human Annotation
by: Zhang, Rusheng, et al.
Published: (2023)
by: Zhang, Rusheng, et al.
Published: (2023)
DTCLMapper: Dual Temporal Consistent Learning for Vectorized HD Map Construction
by: Li, Siyu, et al.
Published: (2024)
by: Li, Siyu, et al.
Published: (2024)
WLTCL: Wide Field-of-View 3-D LiDAR Truck Compartment Automatic Localization System
by: Sun, Guodong, et al.
Published: (2025)
by: Sun, Guodong, et al.
Published: (2025)
TIR-Diffusion: Diffusion-based Thermal Infrared Image Denoising via Latent and Wavelet Domain Optimization
by: Rhee, Tai Hyoung, et al.
Published: (2025)
by: Rhee, Tai Hyoung, et al.
Published: (2025)
Point Cloud Recombination: Systematic Real Data Augmentation Using Robotic Targets for LiDAR Perception Validation
by: Padusinski, Hubert, et al.
Published: (2025)
by: Padusinski, Hubert, et al.
Published: (2025)
Can we Trust Unreliable Voxels? Exploring 3D Semantic Occupancy Prediction under Label Noise
by: Li, Wenxin, et al.
Published: (2026)
by: Li, Wenxin, et al.
Published: (2026)
One-Shot Affordance Grounding of Deformable Objects in Egocentric Organizing Scenes
by: Jia, Wanjun, et al.
Published: (2025)
by: Jia, Wanjun, et al.
Published: (2025)
OneOcc: Semantic Occupancy Prediction for Legged Robots with a Single Panoramic Camera
by: Shi, Hao, et al.
Published: (2025)
by: Shi, Hao, et al.
Published: (2025)
Resource-Efficient Affordance Grounding with Complementary Depth and Semantic Prompts
by: Huang, Yizhou, et al.
Published: (2025)
by: Huang, Yizhou, et al.
Published: (2025)
UniFucGrasp: Human-Hand-Inspired Unified Functional Grasp Annotation Strategy and Dataset for Diverse Dexterous Hands
by: Lin, Haoran, et al.
Published: (2025)
by: Lin, Haoran, et al.
Published: (2025)
Event-aided Semantic Scene Completion
by: Guo, Shangwei, et al.
Published: (2025)
by: Guo, Shangwei, et al.
Published: (2025)
Multi-Keypoint Affordance Representation for Functional Dexterous Grasping
by: Yang, Fan, et al.
Published: (2025)
by: Yang, Fan, et al.
Published: (2025)
Learning Granularity-Aware Affordances from Human-Object Interaction for Tool-Based Functional Dexterous Grasping
by: Yang, Fan, et al.
Published: (2024)
by: Yang, Fan, et al.
Published: (2024)
ViPE: Video Pose Engine for 3D Geometric Perception
by: Huang, Jiahui, et al.
Published: (2025)
by: Huang, Jiahui, et al.
Published: (2025)
Offboard Occupancy Refinement with Hybrid Propagation for Autonomous Driving
by: Shi, Hao, et al.
Published: (2024)
by: Shi, Hao, et al.
Published: (2024)
MambaMOS: LiDAR-based 3D Moving Object Segmentation with Motion-aware State Space Model
by: Zeng, Kang, et al.
Published: (2024)
by: Zeng, Kang, et al.
Published: (2024)
Towards Anytime Optical Flow Estimation with Event Cameras
by: Ye, Yaozu, et al.
Published: (2023)
by: Ye, Yaozu, et al.
Published: (2023)
NRSeg: Noise-Resilient Learning for BEV Semantic Segmentation via Driving World Models
by: Li, Siyu, et al.
Published: (2025)
by: Li, Siyu, et al.
Published: (2025)
PVPUFormer: Probabilistic Visual Prompt Unified Transformer for Interactive Image Segmentation
by: Zhang, Xu, et al.
Published: (2023)
by: Zhang, Xu, et al.
Published: (2023)
Unsupervised Multi-view UAV Image Geo-localization via Iterative Rendering
by: Li, Haoyuan, et al.
Published: (2024)
by: Li, Haoyuan, et al.
Published: (2024)
FishDetector-R1: Unified MLLM-Based Framework with Reinforcement Fine-Tuning for Weakly Supervised Fish Detection, Segmentation, and Counting
by: Liu, Yi, et al.
Published: (2025)
by: Liu, Yi, et al.
Published: (2025)
HierDAMap: Towards Universal Domain Adaptive BEV Mapping via Hierarchical Perspective Priors
by: Li, Siyu, et al.
Published: (2025)
by: Li, Siyu, et al.
Published: (2025)
S$^3$-MonoDETR: Supervised Shape&Scale-perceptive Deformable Transformer for Monocular 3D Object Detection
by: He, Xuan, et al.
Published: (2023)
by: He, Xuan, et al.
Published: (2023)
Language-Driven Dual Style Mixing for Single-Domain Generalized Object Detection
by: Qin, Hongda, et al.
Published: (2025)
by: Qin, Hongda, et al.
Published: (2025)
$M^2$-Occ: Resilient 3D Semantic Occupancy Prediction for Autonomous Driving with Incomplete Camera Inputs
by: Lin, Kaixin, et al.
Published: (2026)
by: Lin, Kaixin, et al.
Published: (2026)
Towards Precise 3D Human Pose Estimation with Multi-Perspective Spatial-Temporal Relational Transformers
by: Jiao, Jianbin, et al.
Published: (2024)
by: Jiao, Jianbin, et al.
Published: (2024)
PanoAffordanceNet: Towards Holistic Affordance Grounding in 360° Indoor Environments
by: Zhu, Guoliang, et al.
Published: (2026)
by: Zhu, Guoliang, et al.
Published: (2026)
Seeing Beyond: Extrapolative Domain Adaptive Panoramic Segmentation
by: Zheng, Yuanfan, et al.
Published: (2026)
by: Zheng, Yuanfan, et al.
Published: (2026)
Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance
by: Zhao, Jiayi, et al.
Published: (2025)
by: Zhao, Jiayi, et al.
Published: (2025)
LF Tracy: A Unified Single-Pipeline Approach for Salient Object Detection in Light Field Cameras
by: Teng, Fei, et al.
Published: (2024)
by: Teng, Fei, et al.
Published: (2024)
O3N: Omnidirectional Open-Vocabulary Occupancy Prediction
by: Duan, Mengfei, et al.
Published: (2026)
by: Duan, Mengfei, et al.
Published: (2026)
InterEdit: Navigating Text-Guided Multi-Human 3D Motion Editing
by: Yang, Yebin, et al.
Published: (2026)
by: Yang, Yebin, et al.
Published: (2026)
Similar Items
-
RG-Attn: Radian Glue Attention for Multi-modality Multi-agent Cooperative Perception
by: Li, Lantao, et al.
Published: (2025) -
TS-CGNet: Temporal-Spatial Fusion Meets Centerline-Guided Diffusion for BEV Mapping
by: Hong, Xinying, et al.
Published: (2025) -
Learning Fine-Grained Correspondence with Cross-Perspective Perception for Open-Vocabulary 6D Object Pose Estimation
by: Qin, Yu, et al.
Published: (2026) -
CoBEVMoE: Heterogeneity-aware Feature Fusion with Dynamic Mixture-of-Experts for Collaborative Perception
by: Kong, Lingzhao, et al.
Published: (2025) -
On the Benefits of Visual Stabilization for Frame- and Event-based Perception
by: Rodriguez-Gomez, Juan Pablo, et al.
Published: (2024)