Saved in:
| Main Authors: | Huang, Zilong, He, Jun, Ye, Junyan, Jiang, Lihan, Li, Weijia, Chen, Yiping, Han, Ting |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.00387 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MajutsuCity: Language-driven Aesthetic-adaptive City Generation with Controllable 3D Assets and Layouts
by: Huang, Zilong, et al.
Published: (2025)
by: Huang, Zilong, et al.
Published: (2025)
UrbanFeel: A Comprehensive Benchmark for Temporal and Perceptual Understanding of City Scenes through Human Perspective
by: He, Jun, et al.
Published: (2025)
by: He, Jun, et al.
Published: (2025)
GenClaw: Code-Driven Agentic Image Generation
by: Ye, Junyan, et al.
Published: (2026)
by: Ye, Junyan, et al.
Published: (2026)
Mind-Brush: Integrating Agentic Cognitive Search and Reasoning into Image Generation
by: He, Jun, et al.
Published: (2026)
by: He, Jun, et al.
Published: (2026)
GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation
by: Yan, Zhiyuan, et al.
Published: (2025)
by: Yan, Zhiyuan, et al.
Published: (2025)
Echo-4o: Harnessing the Power of GPT-4o Synthetic Images for Improved Image Generation
by: Ye, Junyan, et al.
Published: (2025)
by: Ye, Junyan, et al.
Published: (2025)
Underwater360: Reconstructing Underwater Scenes from Panoramic Images with Omnidirectional Gaussian Splatting
by: Hu, Jiangbei, et al.
Published: (2026)
by: Hu, Jiangbei, et al.
Published: (2026)
ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting
by: Zhu, Ruijie, et al.
Published: (2025)
by: Zhu, Ruijie, et al.
Published: (2025)
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
by: Sun, Wenqiang, et al.
Published: (2024)
by: Sun, Wenqiang, et al.
Published: (2024)
CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis
by: Li, Weijia, et al.
Published: (2024)
by: Li, Weijia, et al.
Published: (2024)
BLINK-Twice: You see, but do you observe? A Reasoning Benchmark on Visual Perception
by: Ye, Junyan, et al.
Published: (2025)
by: Ye, Junyan, et al.
Published: (2025)
R3DS: Reality-linked 3D Scenes for Panoramic Scene Understanding
by: Wu, Qirui, et al.
Published: (2024)
by: Wu, Qirui, et al.
Published: (2024)
Parallel Cross Strip Attention Network for Single Image Dehazing
by: Tong, Lihan, et al.
Published: (2024)
by: Tong, Lihan, et al.
Published: (2024)
Pano3DComposer: Feed-Forward Compositional 3D Scene Generation from Single Panoramic Image
by: Qiu, Zidian, et al.
Published: (2026)
by: Qiu, Zidian, et al.
Published: (2026)
Autonomous Implicit Indoor Scene Reconstruction with Frontier Exploration
by: Zeng, Jing, et al.
Published: (2024)
by: Zeng, Jing, et al.
Published: (2024)
Spherical-GOF: Geometry-Aware Panoramic Gaussian Opacity Fields for 3D Scene Reconstruction
by: Yang, Zhe, et al.
Published: (2026)
by: Yang, Zhe, et al.
Published: (2026)
DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting
by: Zhou, Shijie, et al.
Published: (2024)
by: Zhou, Shijie, et al.
Published: (2024)
SatSAM2: Motion-Constrained Video Object Tracking in Satellite Imagery using Promptable SAM2 and Kalman Priors
by: Fan, Ruijie, et al.
Published: (2025)
by: Fan, Ruijie, et al.
Published: (2025)
RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards
by: Ye, Junyan, et al.
Published: (2025)
by: Ye, Junyan, et al.
Published: (2025)
Haze-Aware Attention Network for Single-Image Dehazing
by: Tong, Lihan, et al.
Published: (2024)
by: Tong, Lihan, et al.
Published: (2024)
SceneDreamer360: Text-Driven 3D-Consistent Scene Generation with Panoramic Gaussian Splatting
by: Li, Wenrui, et al.
Published: (2024)
by: Li, Wenrui, et al.
Published: (2024)
ExScene: Free-View 3D Scene Reconstruction with Gaussian Splatting from a Single Image
by: Gong, Tianyi, et al.
Published: (2025)
by: Gong, Tianyi, et al.
Published: (2025)
Is Your LiDAR Placement Optimized for 3D Scene Understanding?
by: Li, Ye, et al.
Published: (2024)
by: Li, Ye, et al.
Published: (2024)
HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation
by: Zhou, Haiyang, et al.
Published: (2025)
by: Zhou, Haiyang, et al.
Published: (2025)
Vision Transformer with Key-select Routing Attention for Single Image Dehazing
by: Tong, Lihan, et al.
Published: (2024)
by: Tong, Lihan, et al.
Published: (2024)
Zero-Shot Scene Reconstruction from Single Images with Deep Prior Assembly
by: Zhou, Junsheng, et al.
Published: (2024)
by: Zhou, Junsheng, et al.
Published: (2024)
HELIOS: Hierarchical Exploration for Language-Grounded Interaction in Open Scenes
by: Ashton, Katrina, et al.
Published: (2025)
by: Ashton, Katrina, et al.
Published: (2025)
DreamAnywhere: Object-Centric Panoramic 3D Scene Generation
by: Dominici, Edoardo Alberto, et al.
Published: (2025)
by: Dominici, Edoardo Alberto, et al.
Published: (2025)
FastScene: Text-Driven Fast 3D Indoor Scene Generation via Panoramic Gaussian Splatting
by: Ma, Yikun, et al.
Published: (2024)
by: Ma, Yikun, et al.
Published: (2024)
CM-EVS: Sparse Panoramic RGB-D-Pose Data for Complete Scene Coverage
by: Liu, Jiale, et al.
Published: (2026)
by: Liu, Jiale, et al.
Published: (2026)
Personalize Your Gaussian: Consistent 3D Scene Personalization from a Single Image
by: Wang, Yuxuan, et al.
Published: (2025)
by: Wang, Yuxuan, et al.
Published: (2025)
Beyond Existance: Fulfill 3D Reconstructed Scenes with Pseudo Details
by: Gao, Yifei, et al.
Published: (2025)
by: Gao, Yifei, et al.
Published: (2025)
Sonic4D: Spatial Audio Generation for Immersive 4D Scene Exploration
by: Xie, Siyi, et al.
Published: (2025)
by: Xie, Siyi, et al.
Published: (2025)
Lay-Your-Scene: Natural Scene Layout Generation with Diffusion Transformers
by: Srivastava, Divyansh, et al.
Published: (2025)
by: Srivastava, Divyansh, et al.
Published: (2025)
Virtualized 3D Gaussians: Flexible Cluster-based Level-of-Detail System for Real-Time Rendering of Composed Scenes
by: Yang, Xijie, et al.
Published: (2025)
by: Yang, Xijie, et al.
Published: (2025)
DC-Scene: Data-Centric Learning for 3D Scene Understanding
by: Huang, Ting, et al.
Published: (2025)
by: Huang, Ting, et al.
Published: (2025)
CamCtrl3D: Single-Image Scene Exploration with Precise 3D Camera Control
by: Popov, Stefan, et al.
Published: (2025)
by: Popov, Stefan, et al.
Published: (2025)
OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes
by: Huang, Yukun, et al.
Published: (2025)
by: Huang, Yukun, et al.
Published: (2025)
DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images
by: Chen, Xiaoxue, et al.
Published: (2025)
by: Chen, Xiaoxue, et al.
Published: (2025)
SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass
by: Meng, Yanxu, et al.
Published: (2025)
by: Meng, Yanxu, et al.
Published: (2025)
Similar Items
-
MajutsuCity: Language-driven Aesthetic-adaptive City Generation with Controllable 3D Assets and Layouts
by: Huang, Zilong, et al.
Published: (2025) -
UrbanFeel: A Comprehensive Benchmark for Temporal and Perceptual Understanding of City Scenes through Human Perspective
by: He, Jun, et al.
Published: (2025) -
GenClaw: Code-Driven Agentic Image Generation
by: Ye, Junyan, et al.
Published: (2026) -
Mind-Brush: Integrating Agentic Cognitive Search and Reasoning into Image Generation
by: He, Jun, et al.
Published: (2026) -
GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation
by: Yan, Zhiyuan, et al.
Published: (2025)