Saved in:
| Main Authors: | Zhang, Bowen, Cheng, Yiji, Yang, Jiaolong, Wang, Chunyu, Zhao, Feng, Tang, Yansong, Chen, Dong, Guo, Baining |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2403.19655 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models
by: Zhang, Bowen, et al.
Published: (2024)
by: Zhang, Bowen, et al.
Published: (2024)
Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis
by: Zhang, Bowen, et al.
Published: (2025)
by: Zhang, Bowen, et al.
Published: (2025)
VASA-3D: Lifelike Audio-Driven Gaussian Head Avatars from a Single Image
by: Xu, Sicheng, et al.
Published: (2025)
by: Xu, Sicheng, et al.
Published: (2025)
VolumeDiffusion: Flexible Text-to-3D Generation with Efficient Volumetric Encoder
by: Tang, Zhicong, et al.
Published: (2023)
by: Tang, Zhicong, et al.
Published: (2023)
Structured 3D Latents for Scalable and Versatile 3D Generation
by: Xiang, Jianfeng, et al.
Published: (2024)
by: Xiang, Jianfeng, et al.
Published: (2024)
Real-Time Generation of Streamable Talking Portrait Video with Reference-Guided Deep Compression VAEs
by: Xu, Sicheng, et al.
Published: (2026)
by: Xu, Sicheng, et al.
Published: (2026)
Compact 3D Gaussian Representation for Radiance Field
by: Lee, Joo Chan, et al.
Published: (2023)
by: Lee, Joo Chan, et al.
Published: (2023)
Meta-CoT: Enhancing Granularity and Generalization in Image Editing
by: Zhang, Shiyi, et al.
Published: (2026)
by: Zhang, Shiyi, et al.
Published: (2026)
VideoVLA: Video Generators Can Be Generalizable Robot Manipulators
by: Shen, Yichao, et al.
Published: (2025)
by: Shen, Yichao, et al.
Published: (2025)
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
by: Xu, Sicheng, et al.
Published: (2024)
by: Xu, Sicheng, et al.
Published: (2024)
Diffusion Models without Classifier-free Guidance
by: Tang, Zhicong, et al.
Published: (2025)
by: Tang, Zhicong, et al.
Published: (2025)
Editing Implicit and Explicit Representations of Radiance Fields: A Survey
by: Hubert, Arthur, et al.
Published: (2024)
by: Hubert, Arthur, et al.
Published: (2024)
ChatUMM: Robust Context Tracking for Conversational Interleaved Generation
by: Dai, Wenxun, et al.
Published: (2026)
by: Dai, Wenxun, et al.
Published: (2026)
Native and Compact Structured Latents for 3D Generation
by: Xiang, Jianfeng, et al.
Published: (2025)
by: Xiang, Jianfeng, et al.
Published: (2025)
GeoLRM: Geometry-Aware Large Reconstruction Model for High-Quality 3D Gaussian Generation
by: Zhang, Chubin, et al.
Published: (2024)
by: Zhang, Chubin, et al.
Published: (2024)
Latent Radiance Fields with 3D-aware 2D Representations
by: Zhou, Chaoyi, et al.
Published: (2025)
by: Zhou, Chaoyi, et al.
Published: (2025)
ESGaussianFace: Emotional and Stylized Audio-Driven Facial Animation via 3D Gaussian Splatting
by: Ma, Chuhang, et al.
Published: (2026)
by: Ma, Chuhang, et al.
Published: (2026)
Incorporating Pre-trained Diffusion Models in Solving the Schrödinger Bridge Problem
by: Tang, Zhicong, et al.
Published: (2025)
by: Tang, Zhicong, et al.
Published: (2025)
GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting
by: Dong, Jiajun, et al.
Published: (2025)
by: Dong, Jiajun, et al.
Published: (2025)
WildSeg3D: Segment Any 3D Objects in the Wild from 2D Images
by: Guo, Yansong, et al.
Published: (2025)
by: Guo, Yansong, et al.
Published: (2025)
PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting
by: Hong, Sunghwan, et al.
Published: (2024)
by: Hong, Sunghwan, et al.
Published: (2024)
Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors
by: Wang, Ruicheng, et al.
Published: (2024)
by: Wang, Ruicheng, et al.
Published: (2024)
Boosting Zero-Shot 3D Style Transfer with 2D Pre-trained Priors
by: Dong, Xin, et al.
Published: (2026)
by: Dong, Xin, et al.
Published: (2026)
PRTGaussian: Efficient Relighting Using 3D Gaussians with Precomputed Radiance Transfer
by: Zhang, Libo, et al.
Published: (2024)
by: Zhang, Libo, et al.
Published: (2024)
Simplified Diffusion Schrödinger Bridge
by: Tang, Zhicong, et al.
Published: (2024)
by: Tang, Zhicong, et al.
Published: (2024)
Map2World: Segment Map Conditioned Text to 3D World Generation
by: Chung, Jaeyoung, et al.
Published: (2026)
by: Chung, Jaeyoung, et al.
Published: (2026)
TaskGround: Structured Executable Task Inference for Full-Scene Household Reasoning
by: Feng, ZhiYuan, et al.
Published: (2026)
by: Feng, ZhiYuan, et al.
Published: (2026)
CoSTL: Comprehensive Spatial-Temporal Representation Learning for Moment Retrieval and Highlight Detection
by: Dong, Xin, et al.
Published: (2026)
by: Dong, Xin, et al.
Published: (2026)
Diverse 3D Human Pose Generation in Scenes based on Decoupled Structure
by: Dang, Bowen, et al.
Published: (2024)
by: Dang, Bowen, et al.
Published: (2024)
RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance Fields
by: Jurca, Mihnea-Bogdan, et al.
Published: (2024)
by: Jurca, Mihnea-Bogdan, et al.
Published: (2024)
NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representations
by: Tang, Zhenyu, et al.
Published: (2025)
by: Tang, Zhenyu, et al.
Published: (2025)
MCGS: Multiview Consistency Enhancement for Sparse-View 3D Gaussian Radiance Fields
by: Xiao, Yuru, et al.
Published: (2024)
by: Xiao, Yuru, et al.
Published: (2024)
Augmented Radiance Field: A General Framework for Enhanced Gaussian Splatting
by: Yang, Yixin, et al.
Published: (2026)
by: Yang, Yixin, et al.
Published: (2026)
From Implicit Ambiguity to Explicit Solidity: Diagnosing Interior Geometric Degradation in Neural Radiance Fields for Dense 3D Scene Understanding
by: Zhao, Jiangsan, et al.
Published: (2026)
by: Zhao, Jiangsan, et al.
Published: (2026)
Re-Align: Structured Reasoning-guided Alignment for In-Context Image Generation and Editing
by: He, Runze, et al.
Published: (2026)
by: He, Runze, et al.
Published: (2026)
HiSpatial: Taming Hierarchical 3D Spatial Understanding in Vision-Language Models
by: Liang, Huizhi, et al.
Published: (2026)
by: Liang, Huizhi, et al.
Published: (2026)
Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields
by: Lee, Joo Chan, et al.
Published: (2024)
by: Lee, Joo Chan, et al.
Published: (2024)
Dynamic 2D Gaussians: Geometrically Accurate Radiance Fields for Dynamic Objects
by: Zhang, Shuai, et al.
Published: (2024)
by: Zhang, Shuai, et al.
Published: (2024)
Explicit Correspondence Matching for Generalizable Neural Radiance Fields
by: Chen, Yuedong, et al.
Published: (2023)
by: Chen, Yuedong, et al.
Published: (2023)
MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation
by: Wang, Yanhui, et al.
Published: (2023)
by: Wang, Yanhui, et al.
Published: (2023)
Similar Items
-
RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models
by: Zhang, Bowen, et al.
Published: (2024) -
Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis
by: Zhang, Bowen, et al.
Published: (2025) -
VASA-3D: Lifelike Audio-Driven Gaussian Head Avatars from a Single Image
by: Xu, Sicheng, et al.
Published: (2025) -
VolumeDiffusion: Flexible Text-to-3D Generation with Efficient Volumetric Encoder
by: Tang, Zhicong, et al.
Published: (2023) -
Structured 3D Latents for Scalable and Versatile 3D Generation
by: Xiang, Jianfeng, et al.
Published: (2024)