:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Huang, Zilong, He, Jun, Ye, Junyan, Jiang, Lihan, Li, Weijia, Chen, Yiping, Han, Ting
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2504.00387
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

MajutsuCity: Language-driven Aesthetic-adaptive City Generation with Controllable 3D Assets and Layouts
by: Huang, Zilong, et al.
Published: (2025)

UrbanFeel: A Comprehensive Benchmark for Temporal and Perceptual Understanding of City Scenes through Human Perspective
by: He, Jun, et al.
Published: (2025)

GenClaw: Code-Driven Agentic Image Generation
by: Ye, Junyan, et al.
Published: (2026)

Mind-Brush: Integrating Agentic Cognitive Search and Reasoning into Image Generation
by: He, Jun, et al.
Published: (2026)

GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation
by: Yan, Zhiyuan, et al.
Published: (2025)

Echo-4o: Harnessing the Power of GPT-4o Synthetic Images for Improved Image Generation
by: Ye, Junyan, et al.
Published: (2025)

Underwater360: Reconstructing Underwater Scenes from Panoramic Images with Omnidirectional Gaussian Splatting
by: Hu, Jiangbei, et al.
Published: (2026)

ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting
by: Zhu, Ruijie, et al.
Published: (2025)

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
by: Sun, Wenqiang, et al.
Published: (2024)

CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis
by: Li, Weijia, et al.
Published: (2024)

BLINK-Twice: You see, but do you observe? A Reasoning Benchmark on Visual Perception
by: Ye, Junyan, et al.
Published: (2025)

R3DS: Reality-linked 3D Scenes for Panoramic Scene Understanding
by: Wu, Qirui, et al.
Published: (2024)

Parallel Cross Strip Attention Network for Single Image Dehazing
by: Tong, Lihan, et al.
Published: (2024)

Pano3DComposer: Feed-Forward Compositional 3D Scene Generation from Single Panoramic Image
by: Qiu, Zidian, et al.
Published: (2026)

Autonomous Implicit Indoor Scene Reconstruction with Frontier Exploration
by: Zeng, Jing, et al.
Published: (2024)

Spherical-GOF: Geometry-Aware Panoramic Gaussian Opacity Fields for 3D Scene Reconstruction
by: Yang, Zhe, et al.
Published: (2026)

DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting
by: Zhou, Shijie, et al.
Published: (2024)

SatSAM2: Motion-Constrained Video Object Tracking in Satellite Imagery using Promptable SAM2 and Kalman Priors
by: Fan, Ruijie, et al.
Published: (2025)

RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards
by: Ye, Junyan, et al.
Published: (2025)

Haze-Aware Attention Network for Single-Image Dehazing
by: Tong, Lihan, et al.
Published: (2024)

SceneDreamer360: Text-Driven 3D-Consistent Scene Generation with Panoramic Gaussian Splatting
by: Li, Wenrui, et al.
Published: (2024)

ExScene: Free-View 3D Scene Reconstruction with Gaussian Splatting from a Single Image
by: Gong, Tianyi, et al.
Published: (2025)

Is Your LiDAR Placement Optimized for 3D Scene Understanding?
by: Li, Ye, et al.
Published: (2024)

HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation
by: Zhou, Haiyang, et al.
Published: (2025)

Vision Transformer with Key-select Routing Attention for Single Image Dehazing
by: Tong, Lihan, et al.
Published: (2024)

Zero-Shot Scene Reconstruction from Single Images with Deep Prior Assembly
by: Zhou, Junsheng, et al.
Published: (2024)

HELIOS: Hierarchical Exploration for Language-Grounded Interaction in Open Scenes
by: Ashton, Katrina, et al.
Published: (2025)

DreamAnywhere: Object-Centric Panoramic 3D Scene Generation
by: Dominici, Edoardo Alberto, et al.
Published: (2025)

FastScene: Text-Driven Fast 3D Indoor Scene Generation via Panoramic Gaussian Splatting
by: Ma, Yikun, et al.
Published: (2024)

CM-EVS: Sparse Panoramic RGB-D-Pose Data for Complete Scene Coverage
by: Liu, Jiale, et al.
Published: (2026)

Personalize Your Gaussian: Consistent 3D Scene Personalization from a Single Image
by: Wang, Yuxuan, et al.
Published: (2025)

Beyond Existance: Fulfill 3D Reconstructed Scenes with Pseudo Details
by: Gao, Yifei, et al.
Published: (2025)

Sonic4D: Spatial Audio Generation for Immersive 4D Scene Exploration
by: Xie, Siyi, et al.
Published: (2025)

Lay-Your-Scene: Natural Scene Layout Generation with Diffusion Transformers
by: Srivastava, Divyansh, et al.
Published: (2025)

Virtualized 3D Gaussians: Flexible Cluster-based Level-of-Detail System for Real-Time Rendering of Composed Scenes
by: Yang, Xijie, et al.
Published: (2025)

DC-Scene: Data-Centric Learning for 3D Scene Understanding
by: Huang, Ting, et al.
Published: (2025)

CamCtrl3D: Single-Image Scene Exploration with Precise 3D Camera Control
by: Popov, Stefan, et al.
Published: (2025)

OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes
by: Huang, Yukun, et al.
Published: (2025)

DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images
by: Chen, Xiaoxue, et al.
Published: (2025)

SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass
by: Meng, Yanxu, et al.
Published: (2025)