:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Huang, Hanzhuo, Liu, Yuan, Zheng, Ge, Wang, Jiepeng, Dou, Zhiyang, Yang, Sibei
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2502.11697
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Adaptive Part Learning for Fine-Grained Generalized Category Discovery: A Plug-and-Play Enhancement
by: Dai, Qiyuan, et al.
Published: (2025)

RefAny3D: 3D Asset-Referenced Diffusion Models for Image Generation
by: Huang, Hanzhuo, et al.
Published: (2026)

Dynamic Realms: 4D Content Analysis, Recovery and Generation with Geometric, Topological and Physical Priors
by: Dou, Zhiyang
Published: (2024)

MV2UV: Generating High-quality UV Texture Maps with Multiview Prompts
by: Zhang, Zheng, et al.
Published: (2026)

MVD$^2$: Efficient Multiview 3D Reconstruction for Multiview Diffusion
by: Zheng, Xin-Yang, et al.
Published: (2024)

Closed-Loop Transfer for Weakly-supervised Affordance Grounding
by: Tang, Jiajin, et al.
Published: (2025)

TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization
by: Pan, Liang, et al.
Published: (2025)

SuperFlow: Training Flow Matching Models with RL on the Fly
by: Chen, Kaijie, et al.
Published: (2025)

The devil is in the object boundary: towards annotation-free instance segmentation using Foundation Models
by: Shi, Cheng, et al.
Published: (2024)

Intervene-All-Paths: Unified Mitigation of LVLM Hallucinations across Alignment Formats
by: Qian, Jiaye, et al.
Published: (2025)

AdaHuman: Animatable Detailed 3D Human Generation with Compositional Multiview Diffusion
by: Huang, Yangyi, et al.
Published: (2025)

Syn4D: A Multiview Synthetic 4D Dataset
by: Jiang, Zeren, et al.
Published: (2026)

Illusion3D: 3D Multiview Illusion with 2D Diffusion Priors
by: Feng, Yue, et al.
Published: (2024)

TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation
by: Qu, Liao, et al.
Published: (2024)

Self-Guidance: Boosting Flow and Diffusion Generation on Their Own
by: Li, Tiancheng, et al.
Published: (2024)

GO-Renderer: Generative Object Rendering with 3D-aware Controllable Video Diffusion Models
by: Gu, Zekai, et al.
Published: (2026)

Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention
by: Li, Peng, et al.
Published: (2024)

SyncHuman: Synchronizing 2D and 3D Generative Models for Single-view Human Reconstruction
by: Chen, Wenyue, et al.
Published: (2025)

Why LVLMs Are More Prone to Hallucinations in Longer Responses: The Role of Context
by: Zheng, Ge, et al.
Published: (2025)

UniFlow: A Unified Pixel Flow Tokenizer for Visual Understanding and Generation
by: Yue, Zhengrong, et al.
Published: (2025)

PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation
by: Wang, Chen, et al.
Published: (2025)

GaussianFlow: Splatting Gaussian Dynamics for 4D Content Creation
by: Gao, Quankai, et al.
Published: (2024)

PixelFlow: Pixel-Space Generative Models with Flow
by: Chen, Shoufa, et al.
Published: (2025)

FlowCoMotion: Text-to-Motion Generation via Token-Latent Flow Modeling
by: Guan, Dawei, et al.
Published: (2026)

CMD: Controllable Multiview Diffusion for 3D Editing and Progressive Generation
by: Li, Peng, et al.
Published: (2025)

LaTtE-Flow: Layerwise Timestep-Expert Flow-based Transformer
by: Shen, Ying, et al.
Published: (2025)

Curriculum Point Prompting for Weakly-Supervised Referring Image Segmentation
by: Dai, Qiyuan, et al.
Published: (2024)

Free on the Fly: Enhancing Flexibility in Test-Time Adaptation with Online EM
by: Dai, Qiyuan, et al.
Published: (2025)

4DVD: Cascaded Dense-view Video Diffusion Model for High-quality 4D Content Generation
by: Yang, Shuzhou, et al.
Published: (2025)

DreamFlow: High-Quality Text-to-3D Generation by Approximating Probability Flow
by: Lee, Kyungmin, et al.
Published: (2024)

Generative Multiview Relighting for 3D Reconstruction under Extreme Illumination Variation
by: Alzayer, Hadi, et al.
Published: (2024)

ModSkill: Physical Character Skill Modularization
by: Huang, Yiming, et al.
Published: (2025)

TeethDreamer: 3D Teeth Reconstruction from Five Intra-oral Photographs
by: Xu, Chenfan, et al.
Published: (2024)

FlowTok: Flowing Seamlessly Across Text and Image Tokens
by: He, Ju, et al.
Published: (2025)

SSRFlow: Semantic-aware Fusion with Spatial Temporal Re-embedding for Real-world Scene Flow
by: Lu, Zhiyang, et al.
Published: (2024)

Wonder3D++: Cross-domain Diffusion for High-fidelity 3D Generation from a Single Image
by: Yang, Yuxiao, et al.
Published: (2025)

Towards a Generalizable Bimanual Foundation Policy via Flow-based Video Prediction
by: Fan, Chenyou, et al.
Published: (2025)

QuadLink: Autoregressive Quad-Dominant Mesh Generation via Point-Relation Learning
by: Zhang, Yiheng, et al.
Published: (2026)

CoMoVi: Co-Generation of 3D Human Motions and Realistic Videos
by: Zhao, Chengfeng, et al.
Published: (2026)

Probability-Flow Distillation: Exact Wasserstein Gradient Flow for High-Fidelity 3D Generation
by: Ramanan, Rohith, et al.
Published: (2026)