:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Sang, Shen, Zhi, Tiancheng, Gu, Tianpei, Liu, Jing, Luo, Linjie
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2509.15496
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Learning Feature-Preserving Portrait Editing from Generated Pairs
by: Chen, Bowei, et al.
Published: (2024)

ID-Patch: Robust ID Association for Group Photo Personalization
by: Zhang, Yimeng, et al.
Published: (2024)

Video-As-Prompt: Unified Semantic Control for Video Generation
by: Bian, Yuxuan, et al.
Published: (2025)

COAP: Memory-Efficient Training with Correlation-Aware Gradient Projection
by: Xiao, Jinqi, et al.
Published: (2024)

Plan-X: Instruct Video Generation via Semantic Planning
by: Huang, Lun, et al.
Published: (2025)

CU-Mamba: Selective State Space Models with Channel Learning for Image Restoration
by: Deng, Rui, et al.
Published: (2024)

DreamVideo: High-Fidelity Image-to-Video Generation with Image Retention and Text Guidance
by: Wang, Cong, et al.
Published: (2023)

Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion
by: Luo, Ge Ya, et al.
Published: (2024)

Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality
by: Luo, Zekai, et al.
Published: (2025)

X-UniMotion: Animating Human Images with Expressive, Unified and Identity-Agnostic Motion Latents
by: Song, Guoxian, et al.
Published: (2025)

AtomoVideo: High Fidelity Image-to-Video Generation
by: Gong, Litong, et al.
Published: (2024)

FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
by: Zhang, Shilong, et al.
Published: (2025)

PersonalVideo: High ID-Fidelity Video Customization without Dynamic and Semantic Degradation
by: Li, Hengjia, et al.
Published: (2024)

Towards High-Fidelity, Identity-Preserving Real-Time Makeup Transfer: Decoupling Style Generation
by: Chau, Lydia Kin Ching, et al.
Published: (2025)

X-Streamer: Unified Human World Modeling with Audiovisual Interaction
by: Xie, You, et al.
Published: (2025)

Which Pretraining Paradigm Better Serves Spatial Intelligence? An Empirical Comparison of Vision-Language and Video Generation Models
by: Shen, Haozhan, et al.
Published: (2026)

CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx
by: Picek, Lukas, et al.
Published: (2025)

LumiTex: Towards High-Fidelity PBR Texture Generation with Illumination Context
by: Bao, Jingzhi, et al.
Published: (2025)

Identity as Presence: Towards Appearance and Voice Personalized Joint Audio-Video Generation
by: Chen, Yingjie, et al.
Published: (2026)

PixelWizard: Towards Efficient High-Fidelity Video Generation at Ultra-Large Spatial Resolution
by: Li, Wenxue, et al.
Published: (2026)

X-Actor: Emotional and Expressive Long-Range Portrait Acting from Audio
by: Zhang, Chenxu, et al.
Published: (2025)

Beyond Inserting: Learning Identity Embedding for Semantic-Fidelity Personalized Diffusion Generation
by: Li, Yang, et al.
Published: (2024)

Text2Interact: High-Fidelity and Diverse Text-to-Two-Person Interaction Generation
by: Wu, Qingxuan, et al.
Published: (2025)

InstructMoLE: Instruction-Guided Mixture of Low-rank Experts for Multi-Conditional Image Generation
by: Xiao, Jinqi, et al.
Published: (2025)

MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing
by: Zhao, Haoyu, et al.
Published: (2023)

Democratizing High-Fidelity Co-Speech Gesture Video Generation
by: Yang, Xu, et al.
Published: (2025)

Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation
by: Li, Weijie, et al.
Published: (2024)

DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance
by: Yang, Zhao, et al.
Published: (2025)

Simba: Towards High-Fidelity and Geometrically-Consistent Point Cloud Completion via Transformation Diffusion
by: Zhang, Lirui, et al.
Published: (2025)

CodecCap: High-Fidelity Codec-Inspired Residual Modeling for Dense Video Captioning
by: Lin, Zihan, et al.
Published: (2026)

Surface-Centric Modeling for High-Fidelity Generalizable Neural Surface Reconstruction
by: Peng, Rui, et al.
Published: (2024)

CanonSwap: High-Fidelity and Consistent Video Face Swapping via Canonical Space Modulation
by: Luo, Xiangyang, et al.
Published: (2025)

ORID: Organ-Regional Information Driven Framework for Radiology Report Generation
by: Gu, Tiancheng, et al.
Published: (2024)

MonoHair: High-Fidelity Hair Modeling from a Monocular Video
by: Wu, Keyu, et al.
Published: (2024)

GFSR: Geometric Fidelity and Spatial Refinement for Reliable Lane Detection
by: Wang, Tiancheng, et al.
Published: (2026)

VividDreamer: Towards High-Fidelity and Efficient Text-to-3D Generation
by: Chen, Zixuan, et al.
Published: (2024)

FlashFace: Human Image Personalization with High-fidelity Identity Preservation
by: Zhang, Shilong, et al.
Published: (2024)

TexDreamer: Towards Zero-Shot High-Fidelity 3D Human Texture Generation
by: Liu, Yufei, et al.
Published: (2024)

UniTEX: Universal High Fidelity Generative Texturing for 3D Shapes
by: Liang, Yixun, et al.
Published: (2025)

X-Dancer: Expressive Music to Human Dance Video Generation
by: Chen, Zeyuan, et al.
Published: (2025)