Saved in:
| Main Authors: | Du, Songlin, Lu, Xiaoyong, Yan, Yaping, Xiao, Guobao, Lu, Xiaobo, Ikenaga, Takeshi |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.13941 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
JamMa: Ultra-lightweight Local Feature Matching with Joint Mamba
by: Lu, Xiaoyong, et al.
Published: (2025)
by: Lu, Xiaoyong, et al.
Published: (2025)
Raising the Ceiling: Conflict-Free Local Feature Matching with Dynamic View Switching
by: Lu, Xiaoyong, et al.
Published: (2024)
by: Lu, Xiaoyong, et al.
Published: (2024)
UniVoxel: Fast Inverse Rendering by Unified Voxelization of Scene Representation
by: Wu, Shuang, et al.
Published: (2024)
by: Wu, Shuang, et al.
Published: (2024)
RoadSceneBench: A Lightweight Benchmark for Mid-Level Road Scene Understanding
by: Liu, Xiyan, et al.
Published: (2025)
by: Liu, Xiyan, et al.
Published: (2025)
CycleSAM: Few-Shot Surgical Scene Segmentation with Cycle- and Scene-Consistent Feature Matching
by: Murali, Aditya, et al.
Published: (2024)
by: Murali, Aditya, et al.
Published: (2024)
QuadricFormer: Scene as Superquadrics for 3D Semantic Occupancy Prediction
by: Zuo, Sicheng, et al.
Published: (2025)
by: Zuo, Sicheng, et al.
Published: (2025)
MegaScenes: Scene-Level View Synthesis at Scale
by: Tung, Joseph, et al.
Published: (2024)
by: Tung, Joseph, et al.
Published: (2024)
MetaFind: Scene-Aware 3D Asset Retrieval for Coherent Metaverse Scene Generation
by: Pan, Zhenyu, et al.
Published: (2025)
by: Pan, Zhenyu, et al.
Published: (2025)
Scene-R1: Video-Grounded Large Language Models for 3D Scene Reasoning without 3D Annotations
by: Yuan, Zhihao, et al.
Published: (2025)
by: Yuan, Zhihao, et al.
Published: (2025)
DiffPixelFormer: Differential Pixel-Aware Transformer for RGB-D Indoor Scene Segmentation
by: Gong, Yan, et al.
Published: (2025)
by: Gong, Yan, et al.
Published: (2025)
Hypergraph Convolutional Network based Weakly Supervised Point Cloud Semantic Segmentation with Scene-Level Annotations
by: Lu, Zhuheng, et al.
Published: (2022)
by: Lu, Zhuheng, et al.
Published: (2022)
4D Primitive-Mâché: Glueing Primitives for Persistent 4D Scene Reconstruction
by: Mazur, Kirill, et al.
Published: (2025)
by: Mazur, Kirill, et al.
Published: (2025)
Surgical Scene Segmentation by Transformer With Asymmetric Feature Enhancement
by: Yuan, Cheng, et al.
Published: (2024)
by: Yuan, Cheng, et al.
Published: (2024)
Aggregated Text Transformer for Scene Text Detection
by: Zhou, Zhao, et al.
Published: (2022)
by: Zhou, Zhao, et al.
Published: (2022)
MDiff4STR: Mask Diffusion Model for Scene Text Recognition
by: Du, Yongkun, et al.
Published: (2025)
by: Du, Yongkun, et al.
Published: (2025)
Deep Feature Gaussian Processes for Single-Scene Aerosol Optical Depth Reconstruction
by: Liu, Shengjie, et al.
Published: (2024)
by: Liu, Shengjie, et al.
Published: (2024)
FlowScene: Style-Consistent Indoor Scene Generation with Multimodal Graph Rectified Flow
by: Yang, Zhifei, et al.
Published: (2026)
by: Yang, Zhifei, et al.
Published: (2026)
EAFormer: Scene Text Segmentation with Edge-Aware Transformers
by: Yu, Haiyang, et al.
Published: (2024)
by: Yu, Haiyang, et al.
Published: (2024)
SceneDecorator: Towards Scene-Oriented Story Generation with Scene Planning and Scene Consistency
by: Song, Quanjian, et al.
Published: (2025)
by: Song, Quanjian, et al.
Published: (2025)
CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graph Diffusion
by: Zhai, Guangyao, et al.
Published: (2023)
by: Zhai, Guangyao, et al.
Published: (2023)
Adaptive Visual Scene Understanding: Incremental Scene Graph Generation
by: Khandelwal, Naitik, et al.
Published: (2023)
by: Khandelwal, Naitik, et al.
Published: (2023)
GeoSceneGraph: Geometric Scene Graph Diffusion Model for Text-guided 3D Indoor Scene Synthesis
by: Ruiz, Antonio, et al.
Published: (2025)
by: Ruiz, Antonio, et al.
Published: (2025)
SceneAssistant: A Visual Feedback Agent for Open-Vocabulary 3D Scene Generation
by: Luo, Jun, et al.
Published: (2026)
by: Luo, Jun, et al.
Published: (2026)
Controllable 3D Outdoor Scene Generation via Scene Graphs
by: Liu, Yuheng, et al.
Published: (2025)
by: Liu, Yuheng, et al.
Published: (2025)
TAFormer: A Unified Target-Aware Transformer for Video and Motion Joint Prediction in Aerial Scenes
by: Xu, Liangyu, et al.
Published: (2024)
by: Xu, Liangyu, et al.
Published: (2024)
Context and Geometry Aware Voxel Transformer for Semantic Scene Completion
by: Yu, Zhu, et al.
Published: (2024)
by: Yu, Zhu, et al.
Published: (2024)
Multimodal 3D Reasoning Segmentation with Complex Scenes
by: Jiang, Xueying, et al.
Published: (2024)
by: Jiang, Xueying, et al.
Published: (2024)
Compositional Feature Augmentation for Unbiased Scene Graph Generation
by: Li, Lin, et al.
Published: (2023)
by: Li, Lin, et al.
Published: (2023)
SceneParser: Hierarchical Scene Parsing for Visual Semantics Understanding
by: Xu, Pengxin, et al.
Published: (2026)
by: Xu, Pengxin, et al.
Published: (2026)
Scene Summarization: Clustering Scene Videos into Spatially Diverse Frames
by: Chen, Chao, et al.
Published: (2023)
by: Chen, Chao, et al.
Published: (2023)
OmniScene: Attention-Augmented Multimodal 4D Scene Understanding for Autonomous Driving
by: Liu, Pei, et al.
Published: (2025)
by: Liu, Pei, et al.
Published: (2025)
SPIRAL: Semantic-Aware Progressive LiDAR Scene Generation and Understanding
by: Zhu, Dekai, et al.
Published: (2025)
by: Zhu, Dekai, et al.
Published: (2025)
Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition
by: Yang, Mingkun, et al.
Published: (2024)
by: Yang, Mingkun, et al.
Published: (2024)
Graph-Guided Dual-Level Augmentation for 3D Scene Segmentation
by: Lin, Hongbin, et al.
Published: (2025)
by: Lin, Hongbin, et al.
Published: (2025)
HetScene: Heterogeneity-Aware Diffusion for Dense Indoor Scene Generation
by: Chen, Zini, et al.
Published: (2026)
by: Chen, Zini, et al.
Published: (2026)
SARA: Scene-Aware Reconstruction Accelerator
by: Lee, Jee Won, et al.
Published: (2026)
by: Lee, Jee Won, et al.
Published: (2026)
BARD-GS: Blur-Aware Reconstruction of Dynamic Scenes via Gaussian Splatting
by: Lu, Yiren, et al.
Published: (2025)
by: Lu, Yiren, et al.
Published: (2025)
Lay-Your-Scene: Natural Scene Layout Generation with Diffusion Transformers
by: Srivastava, Divyansh, et al.
Published: (2025)
by: Srivastava, Divyansh, et al.
Published: (2025)
Efficient Scene Appearance Aggregation for Level-of-Detail Rendering
by: Zhou, Yang, et al.
Published: (2024)
by: Zhou, Yang, et al.
Published: (2024)
Hardness-Aware Scene Synthesis for Semi-Supervised 3D Object Detection
by: Zeng, Shuai, et al.
Published: (2024)
by: Zeng, Shuai, et al.
Published: (2024)
Similar Items
-
JamMa: Ultra-lightweight Local Feature Matching with Joint Mamba
by: Lu, Xiaoyong, et al.
Published: (2025) -
Raising the Ceiling: Conflict-Free Local Feature Matching with Dynamic View Switching
by: Lu, Xiaoyong, et al.
Published: (2024) -
UniVoxel: Fast Inverse Rendering by Unified Voxelization of Scene Representation
by: Wu, Shuang, et al.
Published: (2024) -
RoadSceneBench: A Lightweight Benchmark for Mid-Level Road Scene Understanding
by: Liu, Xiyan, et al.
Published: (2025) -
CycleSAM: Few-Shot Surgical Scene Segmentation with Cycle- and Scene-Consistent Feature Matching
by: Murali, Aditya, et al.
Published: (2024)