Saved in:
| Main Authors: | Sang, Shen, Zhi, Tiancheng, Gu, Tianpei, Liu, Jing, Luo, Linjie |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.15496 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Learning Feature-Preserving Portrait Editing from Generated Pairs
by: Chen, Bowei, et al.
Published: (2024)
by: Chen, Bowei, et al.
Published: (2024)
ID-Patch: Robust ID Association for Group Photo Personalization
by: Zhang, Yimeng, et al.
Published: (2024)
by: Zhang, Yimeng, et al.
Published: (2024)
Video-As-Prompt: Unified Semantic Control for Video Generation
by: Bian, Yuxuan, et al.
Published: (2025)
by: Bian, Yuxuan, et al.
Published: (2025)
COAP: Memory-Efficient Training with Correlation-Aware Gradient Projection
by: Xiao, Jinqi, et al.
Published: (2024)
by: Xiao, Jinqi, et al.
Published: (2024)
Plan-X: Instruct Video Generation via Semantic Planning
by: Huang, Lun, et al.
Published: (2025)
by: Huang, Lun, et al.
Published: (2025)
CU-Mamba: Selective State Space Models with Channel Learning for Image Restoration
by: Deng, Rui, et al.
Published: (2024)
by: Deng, Rui, et al.
Published: (2024)
DreamVideo: High-Fidelity Image-to-Video Generation with Image Retention and Text Guidance
by: Wang, Cong, et al.
Published: (2023)
by: Wang, Cong, et al.
Published: (2023)
Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion
by: Luo, Ge Ya, et al.
Published: (2024)
by: Luo, Ge Ya, et al.
Published: (2024)
Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality
by: Luo, Zekai, et al.
Published: (2025)
by: Luo, Zekai, et al.
Published: (2025)
X-UniMotion: Animating Human Images with Expressive, Unified and Identity-Agnostic Motion Latents
by: Song, Guoxian, et al.
Published: (2025)
by: Song, Guoxian, et al.
Published: (2025)
AtomoVideo: High Fidelity Image-to-Video Generation
by: Gong, Litong, et al.
Published: (2024)
by: Gong, Litong, et al.
Published: (2024)
FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
by: Zhang, Shilong, et al.
Published: (2025)
by: Zhang, Shilong, et al.
Published: (2025)
PersonalVideo: High ID-Fidelity Video Customization without Dynamic and Semantic Degradation
by: Li, Hengjia, et al.
Published: (2024)
by: Li, Hengjia, et al.
Published: (2024)
Towards High-Fidelity, Identity-Preserving Real-Time Makeup Transfer: Decoupling Style Generation
by: Chau, Lydia Kin Ching, et al.
Published: (2025)
by: Chau, Lydia Kin Ching, et al.
Published: (2025)
X-Streamer: Unified Human World Modeling with Audiovisual Interaction
by: Xie, You, et al.
Published: (2025)
by: Xie, You, et al.
Published: (2025)
Which Pretraining Paradigm Better Serves Spatial Intelligence? An Empirical Comparison of Vision-Language and Video Generation Models
by: Shen, Haozhan, et al.
Published: (2026)
by: Shen, Haozhan, et al.
Published: (2026)
CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx
by: Picek, Lukas, et al.
Published: (2025)
by: Picek, Lukas, et al.
Published: (2025)
LumiTex: Towards High-Fidelity PBR Texture Generation with Illumination Context
by: Bao, Jingzhi, et al.
Published: (2025)
by: Bao, Jingzhi, et al.
Published: (2025)
Identity as Presence: Towards Appearance and Voice Personalized Joint Audio-Video Generation
by: Chen, Yingjie, et al.
Published: (2026)
by: Chen, Yingjie, et al.
Published: (2026)
PixelWizard: Towards Efficient High-Fidelity Video Generation at Ultra-Large Spatial Resolution
by: Li, Wenxue, et al.
Published: (2026)
by: Li, Wenxue, et al.
Published: (2026)
X-Actor: Emotional and Expressive Long-Range Portrait Acting from Audio
by: Zhang, Chenxu, et al.
Published: (2025)
by: Zhang, Chenxu, et al.
Published: (2025)
Beyond Inserting: Learning Identity Embedding for Semantic-Fidelity Personalized Diffusion Generation
by: Li, Yang, et al.
Published: (2024)
by: Li, Yang, et al.
Published: (2024)
Text2Interact: High-Fidelity and Diverse Text-to-Two-Person Interaction Generation
by: Wu, Qingxuan, et al.
Published: (2025)
by: Wu, Qingxuan, et al.
Published: (2025)
InstructMoLE: Instruction-Guided Mixture of Low-rank Experts for Multi-Conditional Image Generation
by: Xiao, Jinqi, et al.
Published: (2025)
by: Xiao, Jinqi, et al.
Published: (2025)
MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing
by: Zhao, Haoyu, et al.
Published: (2023)
by: Zhao, Haoyu, et al.
Published: (2023)
Democratizing High-Fidelity Co-Speech Gesture Video Generation
by: Yang, Xu, et al.
Published: (2025)
by: Yang, Xu, et al.
Published: (2025)
Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation
by: Li, Weijie, et al.
Published: (2024)
by: Li, Weijie, et al.
Published: (2024)
DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance
by: Yang, Zhao, et al.
Published: (2025)
by: Yang, Zhao, et al.
Published: (2025)
Simba: Towards High-Fidelity and Geometrically-Consistent Point Cloud Completion via Transformation Diffusion
by: Zhang, Lirui, et al.
Published: (2025)
by: Zhang, Lirui, et al.
Published: (2025)
CodecCap: High-Fidelity Codec-Inspired Residual Modeling for Dense Video Captioning
by: Lin, Zihan, et al.
Published: (2026)
by: Lin, Zihan, et al.
Published: (2026)
Surface-Centric Modeling for High-Fidelity Generalizable Neural Surface Reconstruction
by: Peng, Rui, et al.
Published: (2024)
by: Peng, Rui, et al.
Published: (2024)
CanonSwap: High-Fidelity and Consistent Video Face Swapping via Canonical Space Modulation
by: Luo, Xiangyang, et al.
Published: (2025)
by: Luo, Xiangyang, et al.
Published: (2025)
ORID: Organ-Regional Information Driven Framework for Radiology Report Generation
by: Gu, Tiancheng, et al.
Published: (2024)
by: Gu, Tiancheng, et al.
Published: (2024)
MonoHair: High-Fidelity Hair Modeling from a Monocular Video
by: Wu, Keyu, et al.
Published: (2024)
by: Wu, Keyu, et al.
Published: (2024)
GFSR: Geometric Fidelity and Spatial Refinement for Reliable Lane Detection
by: Wang, Tiancheng, et al.
Published: (2026)
by: Wang, Tiancheng, et al.
Published: (2026)
VividDreamer: Towards High-Fidelity and Efficient Text-to-3D Generation
by: Chen, Zixuan, et al.
Published: (2024)
by: Chen, Zixuan, et al.
Published: (2024)
FlashFace: Human Image Personalization with High-fidelity Identity Preservation
by: Zhang, Shilong, et al.
Published: (2024)
by: Zhang, Shilong, et al.
Published: (2024)
TexDreamer: Towards Zero-Shot High-Fidelity 3D Human Texture Generation
by: Liu, Yufei, et al.
Published: (2024)
by: Liu, Yufei, et al.
Published: (2024)
UniTEX: Universal High Fidelity Generative Texturing for 3D Shapes
by: Liang, Yixun, et al.
Published: (2025)
by: Liang, Yixun, et al.
Published: (2025)
X-Dancer: Expressive Music to Human Dance Video Generation
by: Chen, Zeyuan, et al.
Published: (2025)
by: Chen, Zeyuan, et al.
Published: (2025)
Similar Items
-
Learning Feature-Preserving Portrait Editing from Generated Pairs
by: Chen, Bowei, et al.
Published: (2024) -
ID-Patch: Robust ID Association for Group Photo Personalization
by: Zhang, Yimeng, et al.
Published: (2024) -
Video-As-Prompt: Unified Semantic Control for Video Generation
by: Bian, Yuxuan, et al.
Published: (2025) -
COAP: Memory-Efficient Training with Correlation-Aware Gradient Projection
by: Xiao, Jinqi, et al.
Published: (2024) -
Plan-X: Instruct Video Generation via Semantic Planning
by: Huang, Lun, et al.
Published: (2025)