Saved in:
| Main Authors: | Kong, Lingzhao, Lin, Jiacheng, Li, Siyu, Luo, Kai, Li, Zhiyong, Yang, Kailun |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.17107 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MambaMOS: LiDAR-based 3D Moving Object Segmentation with Motion-aware State Space Model
by: Zeng, Kang, et al.
Published: (2024)
by: Zeng, Kang, et al.
Published: (2024)
TS-CGNet: Temporal-Spatial Fusion Meets Centerline-Guided Diffusion for BEV Mapping
by: Hong, Xinying, et al.
Published: (2025)
by: Hong, Xinying, et al.
Published: (2025)
DTCLMapper: Dual Temporal Consistent Learning for Vectorized HD Map Construction
by: Li, Siyu, et al.
Published: (2024)
by: Li, Siyu, et al.
Published: (2024)
PVPUFormer: Probabilistic Visual Prompt Unified Transformer for Interactive Image Segmentation
by: Zhang, Xu, et al.
Published: (2023)
by: Zhang, Xu, et al.
Published: (2023)
GenMapping: Unleashing the Potential of Inverse Perspective Mapping for Robust Online HD Map Construction
by: Li, Siyu, et al.
Published: (2024)
by: Li, Siyu, et al.
Published: (2024)
NRSeg: Noise-Resilient Learning for BEV Semantic Segmentation via Driving World Models
by: Li, Siyu, et al.
Published: (2025)
by: Li, Siyu, et al.
Published: (2025)
HierDAMap: Towards Universal Domain Adaptive BEV Mapping via Hierarchical Perspective Priors
by: Li, Siyu, et al.
Published: (2025)
by: Li, Siyu, et al.
Published: (2025)
Multi-Keypoint Affordance Representation for Functional Dexterous Grasping
by: Yang, Fan, et al.
Published: (2025)
by: Yang, Fan, et al.
Published: (2025)
Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance
by: Zhao, Jiayi, et al.
Published: (2025)
by: Zhao, Jiayi, et al.
Published: (2025)
CFMW: Cross-modality Fusion Mamba for Robust Object Detection under Adverse Weather
by: Li, Haoyuan, et al.
Published: (2024)
by: Li, Haoyuan, et al.
Published: (2024)
Panoramic Out-of-Distribution Segmentation
by: Duan, Mengfei, et al.
Published: (2025)
by: Duan, Mengfei, et al.
Published: (2025)
Learning Fine-Grained Correspondence with Cross-Perspective Perception for Open-Vocabulary 6D Object Pose Estimation
by: Qin, Yu, et al.
Published: (2026)
by: Qin, Yu, et al.
Published: (2026)
S$^3$-MonoDETR: Supervised Shape&Scale-perceptive Deformable Transformer for Monocular 3D Object Detection
by: He, Xuan, et al.
Published: (2023)
by: He, Xuan, et al.
Published: (2023)
Out-of-Distribution Semantic Occupancy Prediction
by: Zhang, Yuheng, et al.
Published: (2025)
by: Zhang, Yuheng, et al.
Published: (2025)
Hallucinating 360°: Panoramic Street-View Generation via Local Scenes Diffusion and Probabilistic Prompting
by: Teng, Fei, et al.
Published: (2025)
by: Teng, Fei, et al.
Published: (2025)
AdaptiveClick: Clicks-aware Transformer with Adaptive Focal Loss for Interactive Image Segmentation
by: Lin, Jiacheng, et al.
Published: (2023)
by: Lin, Jiacheng, et al.
Published: (2023)
NOVA: Next-step Open-Vocabulary Autoregression for 3D Multi-Object Tracking in Autonomous Driving
by: Luo, Kai, et al.
Published: (2026)
by: Luo, Kai, et al.
Published: (2026)
EchoTrack: Auditory Referring Multi-Object Tracking for Autonomous Driving
by: Lin, Jiacheng, et al.
Published: (2024)
by: Lin, Jiacheng, et al.
Published: (2024)
PanoAffordanceNet: Towards Holistic Affordance Grounding in 360° Indoor Environments
by: Zhu, Guoliang, et al.
Published: (2026)
by: Zhu, Guoliang, et al.
Published: (2026)
Learning Granularity-Aware Affordances from Human-Object Interaction for Tool-Based Functional Dexterous Grasping
by: Yang, Fan, et al.
Published: (2024)
by: Yang, Fan, et al.
Published: (2024)
A Late Collaborative Perception Framework for 3D Multi-Object and Multi-Source Association and Fusion
by: Fadili, Maryem, et al.
Published: (2025)
by: Fadili, Maryem, et al.
Published: (2025)
DepTR-MOT: Unveiling the Potential of Depth-Informed Trajectory Refinement for Multi-Object Tracking
by: Deng, Buyin, et al.
Published: (2025)
by: Deng, Buyin, et al.
Published: (2025)
LF Tracy: A Unified Single-Pipeline Approach for Salient Object Detection in Light Field Cameras
by: Teng, Fei, et al.
Published: (2024)
by: Teng, Fei, et al.
Published: (2024)
O3N: Omnidirectional Open-Vocabulary Occupancy Prediction
by: Duan, Mengfei, et al.
Published: (2026)
by: Duan, Mengfei, et al.
Published: (2026)
Language-Driven Dual Style Mixing for Single-Domain Generalized Object Detection
by: Qin, Hongda, et al.
Published: (2025)
by: Qin, Hongda, et al.
Published: (2025)
Spherical-GOF: Geometry-Aware Panoramic Gaussian Opacity Fields for 3D Scene Reconstruction
by: Yang, Zhe, et al.
Published: (2026)
by: Yang, Zhe, et al.
Published: (2026)
Towards Consistent Object Detection via LiDAR-Camera Synergy
by: Luo, Kai, et al.
Published: (2024)
by: Luo, Kai, et al.
Published: (2024)
Resource-Efficient Affordance Grounding with Complementary Depth and Semantic Prompts
by: Huang, Yizhou, et al.
Published: (2025)
by: Huang, Yizhou, et al.
Published: (2025)
OAFuser: Towards Omni-Aperture Fusion for Light Field Semantic Segmentation
by: Teng, Fei, et al.
Published: (2023)
by: Teng, Fei, et al.
Published: (2023)
Towards Source-free Domain Adaptive Semantic Segmentation via Importance-aware and Prototype-contrast Learning
by: Cao, Yihong, et al.
Published: (2023)
by: Cao, Yihong, et al.
Published: (2023)
DeProPose: Deficiency-Proof 3D Human Pose Estimation via Adaptive Multi-View Fusion
by: Jiao, Jianbin, et al.
Published: (2025)
by: Jiao, Jianbin, et al.
Published: (2025)
One-Shot Affordance Grounding of Deformable Objects in Egocentric Organizing Scenes
by: Jia, Wanjun, et al.
Published: (2025)
by: Jia, Wanjun, et al.
Published: (2025)
OmniTrack++: Omnidirectional Multi-Object Tracking by Learning Large-FoV Trajectory Feedback
by: Luo, Kai, et al.
Published: (2025)
by: Luo, Kai, et al.
Published: (2025)
OccTrack360: 4D Panoptic Occupancy Tracking from Surround-View Fisheye Cameras
by: Lin, Yongzhi, et al.
Published: (2026)
by: Lin, Yongzhi, et al.
Published: (2026)
QuaDreamer: Controllable Panoramic Video Generation for Quadruped Robots
by: Wu, Sheng, et al.
Published: (2025)
by: Wu, Sheng, et al.
Published: (2025)
Segment-to-Act: Label-Noise-Robust Action-Prompted Video Segmentation Towards Embodied Intelligence
by: Li, Wenxin, et al.
Published: (2025)
by: Li, Wenxin, et al.
Published: (2025)
Panoramic Multimodal Semantic Occupancy Prediction for Quadruped Robots
by: Zhao, Guoqiang, et al.
Published: (2026)
by: Zhao, Guoqiang, et al.
Published: (2026)
Omnidirectional Multi-Object Tracking
by: Luo, Kai, et al.
Published: (2025)
by: Luo, Kai, et al.
Published: (2025)
Behind Every Domain There is a Shift: Adapting Distortion-aware Vision Transformers for Panoramic Semantic Segmentation
by: Zhang, Jiaming, et al.
Published: (2022)
by: Zhang, Jiaming, et al.
Published: (2022)
EI-Nexus: Towards Unmediated and Flexible Inter-Modality Local Feature Extraction and Matching for Event-Image Data
by: Yi, Zhonghua, et al.
Published: (2024)
by: Yi, Zhonghua, et al.
Published: (2024)
Similar Items
-
MambaMOS: LiDAR-based 3D Moving Object Segmentation with Motion-aware State Space Model
by: Zeng, Kang, et al.
Published: (2024) -
TS-CGNet: Temporal-Spatial Fusion Meets Centerline-Guided Diffusion for BEV Mapping
by: Hong, Xinying, et al.
Published: (2025) -
DTCLMapper: Dual Temporal Consistent Learning for Vectorized HD Map Construction
by: Li, Siyu, et al.
Published: (2024) -
PVPUFormer: Probabilistic Visual Prompt Unified Transformer for Interactive Image Segmentation
by: Zhang, Xu, et al.
Published: (2023) -
GenMapping: Unleashing the Potential of Inverse Perspective Mapping for Robust Online HD Map Construction
by: Li, Siyu, et al.
Published: (2024)