Saved in:
| Main Authors: | Zhang, Jinzhi, Xiong, Feng, Xu, Mu |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.02202 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
G3PT: Unleash the power of Autoregressive Modeling in 3D Generation via Cross-scale Querying Transformer
by: Zhang, Jinzhi, et al.
Published: (2024)
by: Zhang, Jinzhi, et al.
Published: (2024)
MVPainter: Accurate and Detailed 3D Texture Generation via Multi-View Diffusion with Geometric Control
by: Shao, Mingqi, et al.
Published: (2025)
by: Shao, Mingqi, et al.
Published: (2025)
Flow caching for autoregressive video generation
by: Ma, Yuexiao, et al.
Published: (2026)
by: Ma, Yuexiao, et al.
Published: (2026)
HumanRig: Learning Automatic Rigging for Humanoid Character in a Large Scale Dataset
by: Chu, Zedong, et al.
Published: (2024)
by: Chu, Zedong, et al.
Published: (2024)
Predicting 3D representations for Dynamic Scenes
by: Qi, Di, et al.
Published: (2025)
by: Qi, Di, et al.
Published: (2025)
I2V3D: Controllable image-to-video generation with 3D guidance
by: Zhang, Zhiyuan, et al.
Published: (2025)
by: Zhang, Zhiyuan, et al.
Published: (2025)
Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders
by: Chen, Rui, et al.
Published: (2024)
by: Chen, Rui, et al.
Published: (2024)
Event-boosted Deformable 3D Gaussians for Dynamic Scene Reconstruction
by: Xu, Wenhao, et al.
Published: (2024)
by: Xu, Wenhao, et al.
Published: (2024)
Not all tokens contribute equally to diffusion learning
by: Zhang, Guoqing, et al.
Published: (2026)
by: Zhang, Guoqing, et al.
Published: (2026)
VarGes: Improving Variation in Co-Speech 3D Gesture Generation via StyleCLIPS
by: Meng, Ming, et al.
Published: (2025)
by: Meng, Ming, et al.
Published: (2025)
ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splattings
by: Lee, Suyoung, et al.
Published: (2024)
by: Lee, Suyoung, et al.
Published: (2024)
NOVA-3D: Non-overlapped Views for 3D Anime Character Reconstruction
by: Wang, Hongsheng, et al.
Published: (2024)
by: Wang, Hongsheng, et al.
Published: (2024)
RCGDet3D: Rethinking 4D Radar-Camera Fusion-based 3D Object Detection with Enhanced Radar Feature Encoding
by: Xiong, Weiyi, et al.
Published: (2026)
by: Xiong, Weiyi, et al.
Published: (2026)
ARM3D: Attention-based relation module for indoor 3D object detection
by: Lan, Yuqing, et al.
Published: (2022)
by: Lan, Yuqing, et al.
Published: (2022)
Byte-level generative predictions for forensics multimedia carving
by: Lee, Jaewon, et al.
Published: (2026)
by: Lee, Jaewon, et al.
Published: (2026)
InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis with Semantic Graph Prior
by: Lin, Chenguo, et al.
Published: (2024)
by: Lin, Chenguo, et al.
Published: (2024)
SCA3D: Enhancing Cross-modal 3D Retrieval via 3D Shape and Caption Paired Data Augmentation
by: Ren, Junlong, et al.
Published: (2025)
by: Ren, Junlong, et al.
Published: (2025)
OmniPhysGS: 3D Constitutive Gaussians for General Physics-Based Dynamics Generation
by: Lin, Yuchen, et al.
Published: (2025)
by: Lin, Yuchen, et al.
Published: (2025)
CEI-3D: Collaborative Explicit-Implicit 3D Reconstruction for Realistic and Fine-Grained Object Editing
by: Shi, Yue, et al.
Published: (2026)
by: Shi, Yue, et al.
Published: (2026)
Group Critical-token Policy Optimization for Autoregressive Image Generation
by: Zhang, Guohui, et al.
Published: (2025)
by: Zhang, Guohui, et al.
Published: (2025)
VEDAL: Variational Error-Driven Asynchronous Learning for 3D Gaussian Splatting Pruning
by: Li, Aoduo, et al.
Published: (2026)
by: Li, Aoduo, et al.
Published: (2026)
CRAG: Can 3D Generative Models Help 3D Assembly?
by: Jiang, Zeyu, et al.
Published: (2026)
by: Jiang, Zeyu, et al.
Published: (2026)
RadarGaussianDet3D: Gaussian Representation-based Real-time 3D Object Detection with 4D Automotive Radars
by: Xiong, Weiyi, et al.
Published: (2025)
by: Xiong, Weiyi, et al.
Published: (2025)
StereoDETR: Stereo-based Transformer for 3D Object Detection
by: Mu, Shiyi, et al.
Published: (2025)
by: Mu, Shiyi, et al.
Published: (2025)
SR3D: Unleashing Single-view 3D Reconstruction for Transparent and Specular Object Grasping
by: Zhang, Mingxu, et al.
Published: (2025)
by: Zhang, Mingxu, et al.
Published: (2025)
Open-Vocabulary High-Resolution 3D (OVHR3D) Data Segmentation and Annotation Framework
by: Xu, Jiuyi, et al.
Published: (2024)
by: Xu, Jiuyi, et al.
Published: (2024)
CO^3: Cooperative Unsupervised 3D Representation Learning for Autonomous Driving
by: Chen, Runjian, et al.
Published: (2022)
by: Chen, Runjian, et al.
Published: (2022)
COM3D: Leveraging Cross-View Correspondence and Cross-Modal Mining for 3D Retrieval
by: Wu, Hao, et al.
Published: (2024)
by: Wu, Hao, et al.
Published: (2024)
When Worse is Better: Navigating the compression-generation tradeoff in visual tokenization
by: Ramanujan, Vivek, et al.
Published: (2024)
by: Ramanujan, Vivek, et al.
Published: (2024)
Rein3D: Reinforced 3D Indoor Scene Generation with Panoramic Video Diffusion Models
by: Wang, Dehui, et al.
Published: (2026)
by: Wang, Dehui, et al.
Published: (2026)
InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior
by: Lin, Chenguo, et al.
Published: (2024)
by: Lin, Chenguo, et al.
Published: (2024)
Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis
by: Zhang, Bowen, et al.
Published: (2025)
by: Zhang, Bowen, et al.
Published: (2025)
FantasyWorld: Geometry-Consistent World Modeling via Unified Video and 3D Prediction
by: Dai, Yixiang, et al.
Published: (2025)
by: Dai, Yixiang, et al.
Published: (2025)
Visual enhancement and 3D representation for underwater scenes: a review
by: Huang, Guoxi, et al.
Published: (2025)
by: Huang, Guoxi, et al.
Published: (2025)
Hyper3D: Efficient 3D Representation via Hybrid Triplane and Octree Feature for Enhanced 3D Shape Variational Auto-Encoders
by: Guo, Jingyu, et al.
Published: (2025)
by: Guo, Jingyu, et al.
Published: (2025)
B2N3D: Progressive Learning from Binary to N-ary Relationships for 3D Object Grounding
by: Xiao, Feng, et al.
Published: (2025)
by: Xiao, Feng, et al.
Published: (2025)
Resolving compositional and conformational heterogeneity in cryo-EM with deformable 3D Gaussian representations
by: He, Bintao, et al.
Published: (2025)
by: He, Bintao, et al.
Published: (2025)
Uncertainty-Aware AB3DMOT by Variational 3D Object Detection
by: Oleksiienko, Illia, et al.
Published: (2023)
by: Oleksiienko, Illia, et al.
Published: (2023)
PhysAlign: Physics-Coherent Image-to-Video Generation through Feature and 3D Representation Alignment
by: Xiong, Zhexiao, et al.
Published: (2026)
by: Xiong, Zhexiao, et al.
Published: (2026)
SDesc3D: Towards Layout-Aware 3D Indoor Scene Generation from Short Descriptions
by: Feng, Jie, et al.
Published: (2026)
by: Feng, Jie, et al.
Published: (2026)
Similar Items
-
G3PT: Unleash the power of Autoregressive Modeling in 3D Generation via Cross-scale Querying Transformer
by: Zhang, Jinzhi, et al.
Published: (2024) -
MVPainter: Accurate and Detailed 3D Texture Generation via Multi-View Diffusion with Geometric Control
by: Shao, Mingqi, et al.
Published: (2025) -
Flow caching for autoregressive video generation
by: Ma, Yuexiao, et al.
Published: (2026) -
HumanRig: Learning Automatic Rigging for Humanoid Character in a Large Scale Dataset
by: Chu, Zedong, et al.
Published: (2024) -
Predicting 3D representations for Dynamic Scenes
by: Qi, Di, et al.
Published: (2025)