:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhang, Jinzhi, Xiong, Feng, Xu, Mu
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2412.02202
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

G3PT: Unleash the power of Autoregressive Modeling in 3D Generation via Cross-scale Querying Transformer
by: Zhang, Jinzhi, et al.
Published: (2024)

MVPainter: Accurate and Detailed 3D Texture Generation via Multi-View Diffusion with Geometric Control
by: Shao, Mingqi, et al.
Published: (2025)

Flow caching for autoregressive video generation
by: Ma, Yuexiao, et al.
Published: (2026)

HumanRig: Learning Automatic Rigging for Humanoid Character in a Large Scale Dataset
by: Chu, Zedong, et al.
Published: (2024)

Predicting 3D representations for Dynamic Scenes
by: Qi, Di, et al.
Published: (2025)

I2V3D: Controllable image-to-video generation with 3D guidance
by: Zhang, Zhiyuan, et al.
Published: (2025)

Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders
by: Chen, Rui, et al.
Published: (2024)

Event-boosted Deformable 3D Gaussians for Dynamic Scene Reconstruction
by: Xu, Wenhao, et al.
Published: (2024)

Not all tokens contribute equally to diffusion learning
by: Zhang, Guoqing, et al.
Published: (2026)

VarGes: Improving Variation in Co-Speech 3D Gesture Generation via StyleCLIPS
by: Meng, Ming, et al.
Published: (2025)

ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splattings
by: Lee, Suyoung, et al.
Published: (2024)

NOVA-3D: Non-overlapped Views for 3D Anime Character Reconstruction
by: Wang, Hongsheng, et al.
Published: (2024)

RCGDet3D: Rethinking 4D Radar-Camera Fusion-based 3D Object Detection with Enhanced Radar Feature Encoding
by: Xiong, Weiyi, et al.
Published: (2026)

ARM3D: Attention-based relation module for indoor 3D object detection
by: Lan, Yuqing, et al.
Published: (2022)

Byte-level generative predictions for forensics multimedia carving
by: Lee, Jaewon, et al.
Published: (2026)

InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis with Semantic Graph Prior
by: Lin, Chenguo, et al.
Published: (2024)

SCA3D: Enhancing Cross-modal 3D Retrieval via 3D Shape and Caption Paired Data Augmentation
by: Ren, Junlong, et al.
Published: (2025)

OmniPhysGS: 3D Constitutive Gaussians for General Physics-Based Dynamics Generation
by: Lin, Yuchen, et al.
Published: (2025)

CEI-3D: Collaborative Explicit-Implicit 3D Reconstruction for Realistic and Fine-Grained Object Editing
by: Shi, Yue, et al.
Published: (2026)

Group Critical-token Policy Optimization for Autoregressive Image Generation
by: Zhang, Guohui, et al.
Published: (2025)

VEDAL: Variational Error-Driven Asynchronous Learning for 3D Gaussian Splatting Pruning
by: Li, Aoduo, et al.
Published: (2026)

CRAG: Can 3D Generative Models Help 3D Assembly?
by: Jiang, Zeyu, et al.
Published: (2026)

RadarGaussianDet3D: Gaussian Representation-based Real-time 3D Object Detection with 4D Automotive Radars
by: Xiong, Weiyi, et al.
Published: (2025)

StereoDETR: Stereo-based Transformer for 3D Object Detection
by: Mu, Shiyi, et al.
Published: (2025)

SR3D: Unleashing Single-view 3D Reconstruction for Transparent and Specular Object Grasping
by: Zhang, Mingxu, et al.
Published: (2025)

Open-Vocabulary High-Resolution 3D (OVHR3D) Data Segmentation and Annotation Framework
by: Xu, Jiuyi, et al.
Published: (2024)

CO^3: Cooperative Unsupervised 3D Representation Learning for Autonomous Driving
by: Chen, Runjian, et al.
Published: (2022)

COM3D: Leveraging Cross-View Correspondence and Cross-Modal Mining for 3D Retrieval
by: Wu, Hao, et al.
Published: (2024)

When Worse is Better: Navigating the compression-generation tradeoff in visual tokenization
by: Ramanujan, Vivek, et al.
Published: (2024)

Rein3D: Reinforced 3D Indoor Scene Generation with Panoramic Video Diffusion Models
by: Wang, Dehui, et al.
Published: (2026)

InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior
by: Lin, Chenguo, et al.
Published: (2024)

Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis
by: Zhang, Bowen, et al.
Published: (2025)

FantasyWorld: Geometry-Consistent World Modeling via Unified Video and 3D Prediction
by: Dai, Yixiang, et al.
Published: (2025)

Visual enhancement and 3D representation for underwater scenes: a review
by: Huang, Guoxi, et al.
Published: (2025)

Hyper3D: Efficient 3D Representation via Hybrid Triplane and Octree Feature for Enhanced 3D Shape Variational Auto-Encoders
by: Guo, Jingyu, et al.
Published: (2025)

B2N3D: Progressive Learning from Binary to N-ary Relationships for 3D Object Grounding
by: Xiao, Feng, et al.
Published: (2025)

Resolving compositional and conformational heterogeneity in cryo-EM with deformable 3D Gaussian representations
by: He, Bintao, et al.
Published: (2025)

Uncertainty-Aware AB3DMOT by Variational 3D Object Detection
by: Oleksiienko, Illia, et al.
Published: (2023)

PhysAlign: Physics-Coherent Image-to-Video Generation through Feature and 3D Representation Alignment
by: Xiong, Zhexiao, et al.
Published: (2026)

SDesc3D: Towards Layout-Aware 3D Indoor Scene Generation from Short Descriptions
by: Feng, Jie, et al.
Published: (2026)