Saved in:
| Main Authors: | Huang, Hanzhuo, Liu, Yuan, Zheng, Ge, Wang, Jiepeng, Dou, Zhiyang, Yang, Sibei |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.11697 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Adaptive Part Learning for Fine-Grained Generalized Category Discovery: A Plug-and-Play Enhancement
by: Dai, Qiyuan, et al.
Published: (2025)
by: Dai, Qiyuan, et al.
Published: (2025)
RefAny3D: 3D Asset-Referenced Diffusion Models for Image Generation
by: Huang, Hanzhuo, et al.
Published: (2026)
by: Huang, Hanzhuo, et al.
Published: (2026)
Dynamic Realms: 4D Content Analysis, Recovery and Generation with Geometric, Topological and Physical Priors
by: Dou, Zhiyang
Published: (2024)
by: Dou, Zhiyang
Published: (2024)
MV2UV: Generating High-quality UV Texture Maps with Multiview Prompts
by: Zhang, Zheng, et al.
Published: (2026)
by: Zhang, Zheng, et al.
Published: (2026)
MVD$^2$: Efficient Multiview 3D Reconstruction for Multiview Diffusion
by: Zheng, Xin-Yang, et al.
Published: (2024)
by: Zheng, Xin-Yang, et al.
Published: (2024)
Closed-Loop Transfer for Weakly-supervised Affordance Grounding
by: Tang, Jiajin, et al.
Published: (2025)
by: Tang, Jiajin, et al.
Published: (2025)
TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization
by: Pan, Liang, et al.
Published: (2025)
by: Pan, Liang, et al.
Published: (2025)
SuperFlow: Training Flow Matching Models with RL on the Fly
by: Chen, Kaijie, et al.
Published: (2025)
by: Chen, Kaijie, et al.
Published: (2025)
The devil is in the object boundary: towards annotation-free instance segmentation using Foundation Models
by: Shi, Cheng, et al.
Published: (2024)
by: Shi, Cheng, et al.
Published: (2024)
Intervene-All-Paths: Unified Mitigation of LVLM Hallucinations across Alignment Formats
by: Qian, Jiaye, et al.
Published: (2025)
by: Qian, Jiaye, et al.
Published: (2025)
AdaHuman: Animatable Detailed 3D Human Generation with Compositional Multiview Diffusion
by: Huang, Yangyi, et al.
Published: (2025)
by: Huang, Yangyi, et al.
Published: (2025)
Syn4D: A Multiview Synthetic 4D Dataset
by: Jiang, Zeren, et al.
Published: (2026)
by: Jiang, Zeren, et al.
Published: (2026)
Illusion3D: 3D Multiview Illusion with 2D Diffusion Priors
by: Feng, Yue, et al.
Published: (2024)
by: Feng, Yue, et al.
Published: (2024)
TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation
by: Qu, Liao, et al.
Published: (2024)
by: Qu, Liao, et al.
Published: (2024)
Self-Guidance: Boosting Flow and Diffusion Generation on Their Own
by: Li, Tiancheng, et al.
Published: (2024)
by: Li, Tiancheng, et al.
Published: (2024)
GO-Renderer: Generative Object Rendering with 3D-aware Controllable Video Diffusion Models
by: Gu, Zekai, et al.
Published: (2026)
by: Gu, Zekai, et al.
Published: (2026)
Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention
by: Li, Peng, et al.
Published: (2024)
by: Li, Peng, et al.
Published: (2024)
SyncHuman: Synchronizing 2D and 3D Generative Models for Single-view Human Reconstruction
by: Chen, Wenyue, et al.
Published: (2025)
by: Chen, Wenyue, et al.
Published: (2025)
Why LVLMs Are More Prone to Hallucinations in Longer Responses: The Role of Context
by: Zheng, Ge, et al.
Published: (2025)
by: Zheng, Ge, et al.
Published: (2025)
UniFlow: A Unified Pixel Flow Tokenizer for Visual Understanding and Generation
by: Yue, Zhengrong, et al.
Published: (2025)
by: Yue, Zhengrong, et al.
Published: (2025)
PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation
by: Wang, Chen, et al.
Published: (2025)
by: Wang, Chen, et al.
Published: (2025)
GaussianFlow: Splatting Gaussian Dynamics for 4D Content Creation
by: Gao, Quankai, et al.
Published: (2024)
by: Gao, Quankai, et al.
Published: (2024)
PixelFlow: Pixel-Space Generative Models with Flow
by: Chen, Shoufa, et al.
Published: (2025)
by: Chen, Shoufa, et al.
Published: (2025)
FlowCoMotion: Text-to-Motion Generation via Token-Latent Flow Modeling
by: Guan, Dawei, et al.
Published: (2026)
by: Guan, Dawei, et al.
Published: (2026)
CMD: Controllable Multiview Diffusion for 3D Editing and Progressive Generation
by: Li, Peng, et al.
Published: (2025)
by: Li, Peng, et al.
Published: (2025)
LaTtE-Flow: Layerwise Timestep-Expert Flow-based Transformer
by: Shen, Ying, et al.
Published: (2025)
by: Shen, Ying, et al.
Published: (2025)
Curriculum Point Prompting for Weakly-Supervised Referring Image Segmentation
by: Dai, Qiyuan, et al.
Published: (2024)
by: Dai, Qiyuan, et al.
Published: (2024)
Free on the Fly: Enhancing Flexibility in Test-Time Adaptation with Online EM
by: Dai, Qiyuan, et al.
Published: (2025)
by: Dai, Qiyuan, et al.
Published: (2025)
4DVD: Cascaded Dense-view Video Diffusion Model for High-quality 4D Content Generation
by: Yang, Shuzhou, et al.
Published: (2025)
by: Yang, Shuzhou, et al.
Published: (2025)
DreamFlow: High-Quality Text-to-3D Generation by Approximating Probability Flow
by: Lee, Kyungmin, et al.
Published: (2024)
by: Lee, Kyungmin, et al.
Published: (2024)
Generative Multiview Relighting for 3D Reconstruction under Extreme Illumination Variation
by: Alzayer, Hadi, et al.
Published: (2024)
by: Alzayer, Hadi, et al.
Published: (2024)
ModSkill: Physical Character Skill Modularization
by: Huang, Yiming, et al.
Published: (2025)
by: Huang, Yiming, et al.
Published: (2025)
TeethDreamer: 3D Teeth Reconstruction from Five Intra-oral Photographs
by: Xu, Chenfan, et al.
Published: (2024)
by: Xu, Chenfan, et al.
Published: (2024)
FlowTok: Flowing Seamlessly Across Text and Image Tokens
by: He, Ju, et al.
Published: (2025)
by: He, Ju, et al.
Published: (2025)
SSRFlow: Semantic-aware Fusion with Spatial Temporal Re-embedding for Real-world Scene Flow
by: Lu, Zhiyang, et al.
Published: (2024)
by: Lu, Zhiyang, et al.
Published: (2024)
Wonder3D++: Cross-domain Diffusion for High-fidelity 3D Generation from a Single Image
by: Yang, Yuxiao, et al.
Published: (2025)
by: Yang, Yuxiao, et al.
Published: (2025)
Towards a Generalizable Bimanual Foundation Policy via Flow-based Video Prediction
by: Fan, Chenyou, et al.
Published: (2025)
by: Fan, Chenyou, et al.
Published: (2025)
QuadLink: Autoregressive Quad-Dominant Mesh Generation via Point-Relation Learning
by: Zhang, Yiheng, et al.
Published: (2026)
by: Zhang, Yiheng, et al.
Published: (2026)
CoMoVi: Co-Generation of 3D Human Motions and Realistic Videos
by: Zhao, Chengfeng, et al.
Published: (2026)
by: Zhao, Chengfeng, et al.
Published: (2026)
Probability-Flow Distillation: Exact Wasserstein Gradient Flow for High-Fidelity 3D Generation
by: Ramanan, Rohith, et al.
Published: (2026)
by: Ramanan, Rohith, et al.
Published: (2026)
Similar Items
-
Adaptive Part Learning for Fine-Grained Generalized Category Discovery: A Plug-and-Play Enhancement
by: Dai, Qiyuan, et al.
Published: (2025) -
RefAny3D: 3D Asset-Referenced Diffusion Models for Image Generation
by: Huang, Hanzhuo, et al.
Published: (2026) -
Dynamic Realms: 4D Content Analysis, Recovery and Generation with Geometric, Topological and Physical Priors
by: Dou, Zhiyang
Published: (2024) -
MV2UV: Generating High-quality UV Texture Maps with Multiview Prompts
by: Zhang, Zheng, et al.
Published: (2026) -
MVD$^2$: Efficient Multiview 3D Reconstruction for Multiview Diffusion
by: Zheng, Xin-Yang, et al.
Published: (2024)