Saved in:
| Main Authors: | Tanveer, Maham, Wang, Yizhi, Wang, Ruiqi, Zhao, Nanxuan, Mahdavi-Amiri, Ali, Zhang, Hao |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.03549 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MotionBridge: Dynamic Video Inbetweening with Flexible Controls
by: Tanveer, Maham, et al.
Published: (2024)
by: Tanveer, Maham, et al.
Published: (2024)
MultiCOIN: Multi-Modal COntrollable Video INbetweening
by: Tanveer, Maham, et al.
Published: (2025)
by: Tanveer, Maham, et al.
Published: (2025)
BRICS: Bi-level feature Representation of Image CollectionS
by: Yang, Dingdong, et al.
Published: (2023)
by: Yang, Dingdong, et al.
Published: (2023)
GALA: Geometry-Aware Local Adaptive Grids for Detailed 3D Generation
by: Yang, Dingdong, et al.
Published: (2024)
by: Yang, Dingdong, et al.
Published: (2024)
SweepNet: Unsupervised Learning Shape Abstraction via Neural Sweepers
by: Zhao, Mingrui, et al.
Published: (2024)
by: Zhao, Mingrui, et al.
Published: (2024)
EASI-Tex: Edge-Aware Mesh Texturing from Single Image
by: Perla, Sai Raj Kishore, et al.
Published: (2024)
by: Perla, Sai Raj Kishore, et al.
Published: (2024)
Advances in 4D Representation: Geometry, Motion, and Interaction
by: Zhao, Mingrui, et al.
Published: (2025)
by: Zhao, Mingrui, et al.
Published: (2025)
Functionalization via Structure Completion and Motion Rectification
by: Zhao, Mingrui, et al.
Published: (2026)
by: Zhao, Mingrui, et al.
Published: (2026)
In-2-4D: Inbetweening from Two Single-View Images to 4D Generation
by: Nag, Sauradip, et al.
Published: (2025)
by: Nag, Sauradip, et al.
Published: (2025)
Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks
by: Liu, Yuhao, et al.
Published: (2024)
by: Liu, Yuhao, et al.
Published: (2024)
Advances in Neural 3D Mesh Texturing: A Survey
by: Perla, Sai Raj Kishore, et al.
Published: (2026)
by: Perla, Sai Raj Kishore, et al.
Published: (2026)
pOps: Photo-Inspired Diffusion Operators
by: Richardson, Elad, et al.
Published: (2024)
by: Richardson, Elad, et al.
Published: (2024)
GroupDiff: Diffusion-based Group Portrait Editing
by: Jiang, Yuming, et al.
Published: (2024)
by: Jiang, Yuming, et al.
Published: (2024)
CREward: A Type-Specific Creativity Reward Model
by: Han, Jiyeon, et al.
Published: (2025)
by: Han, Jiyeon, et al.
Published: (2025)
Survey on Modeling of Human-made Articulated Objects
by: Liu, Jiayi, et al.
Published: (2024)
by: Liu, Jiayi, et al.
Published: (2024)
ASIA: Adaptive 3D Segmentation using Few Image Annotations
by: Perla, Sai Raj Kishore, et al.
Published: (2025)
by: Perla, Sai Raj Kishore, et al.
Published: (2025)
FiGO: Fine-Grained Object Counting without Annotations
by: D'Alessandro, Adriano, et al.
Published: (2025)
by: D'Alessandro, Adriano, et al.
Published: (2025)
AFreeCA: Annotation-Free Counting for All
by: D'Alessandro, Adriano, et al.
Published: (2024)
by: D'Alessandro, Adriano, et al.
Published: (2024)
AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising
by: Chen, Zigeng, et al.
Published: (2024)
by: Chen, Zigeng, et al.
Published: (2024)
Video Analysis and Generation via a Semantic Progress Function
by: Metzer, Gal, et al.
Published: (2026)
by: Metzer, Gal, et al.
Published: (2026)
Style Customization of Text-to-Vector Generation with Image Diffusion Priors
by: Zhang, Peiying, et al.
Published: (2025)
by: Zhang, Peiying, et al.
Published: (2025)
GazeMoDiff: Gaze-guided Diffusion Model for Stochastic Human Motion Prediction
by: Yan, Haodong, et al.
Published: (2023)
by: Yan, Haodong, et al.
Published: (2023)
Sound Sparks Motion: Audio and Text Tuning for Video Editing
by: Razlighi, AmirHossein Naghi, et al.
Published: (2026)
by: Razlighi, AmirHossein Naghi, et al.
Published: (2026)
ACT-R: Adaptive Camera Trajectories for Single View 3D Reconstruction
by: Wang, Yizhi, et al.
Published: (2025)
by: Wang, Yizhi, et al.
Published: (2025)
CausalDiff: Causality-Inspired Disentanglement via Diffusion Model for Adversarial Defense
by: Zhang, Mingkun, et al.
Published: (2024)
by: Zhang, Mingkun, et al.
Published: (2024)
FoundDiff: Foundational Diffusion Model for Generalizable Low-Dose CT Denoising
by: Chen, Zhihao, et al.
Published: (2025)
by: Chen, Zhihao, et al.
Published: (2025)
X-NeMo: Expressive Neural Motion Reenactment via Disentangled Latent Attention
by: Zhao, Xiaochen, et al.
Published: (2025)
by: Zhao, Xiaochen, et al.
Published: (2025)
CAGE: Controllable Articulation GEneration
by: Liu, Jiayi, et al.
Published: (2023)
by: Liu, Jiayi, et al.
Published: (2023)
DiffVL: Diffusion-Based Visual Localization on 2D Maps via BEV-Conditioned GPS Denoising
by: Gao, Li, et al.
Published: (2025)
by: Gao, Li, et al.
Published: (2025)
In-Context Sync-LoRA for Portrait Video Editing
by: Polaczek, Sagi, et al.
Published: (2025)
by: Polaczek, Sagi, et al.
Published: (2025)
DisMo: Disentangled Motion Representations for Open-World Motion Transfer
by: Ressler-Antal, Thomas, et al.
Published: (2025)
by: Ressler-Antal, Thomas, et al.
Published: (2025)
Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation
by: Zhai, Yuanhao, et al.
Published: (2024)
by: Zhai, Yuanhao, et al.
Published: (2024)
MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head Generation
by: Kim, Seyeon, et al.
Published: (2024)
by: Kim, Seyeon, et al.
Published: (2024)
Untwisting RoPE: Frequency Control for Shared Attention in DiTs
by: Mikaeili, Aryan, et al.
Published: (2026)
by: Mikaeili, Aryan, et al.
Published: (2026)
Griffin: Generative Reference and Layout Guided Image Composition
by: Mikaeili, Aryan, et al.
Published: (2025)
by: Mikaeili, Aryan, et al.
Published: (2025)
GeoDiffMM: Geometry-Guided Conditional Diffusion for Motion Magnification
by: Liu, Xuedeng, et al.
Published: (2025)
by: Liu, Xuedeng, et al.
Published: (2025)
ConMo: Controllable Motion Disentanglement and Recomposition for Zero-Shot Motion Transfer
by: Gao, Jiayi, et al.
Published: (2025)
by: Gao, Jiayi, et al.
Published: (2025)
DiffDenoise: Self-Supervised Medical Image Denoising with Conditional Diffusion Models
by: Demir, Basar, et al.
Published: (2025)
by: Demir, Basar, et al.
Published: (2025)
MoCHA: Denoising Caption Supervision for Motion-Text Retrieval
by: Warner, Nikolai, et al.
Published: (2026)
by: Warner, Nikolai, et al.
Published: (2026)
FairDiff: Fair Segmentation with Point-Image Diffusion
by: Li, Wenyi, et al.
Published: (2024)
by: Li, Wenyi, et al.
Published: (2024)
Similar Items
-
MotionBridge: Dynamic Video Inbetweening with Flexible Controls
by: Tanveer, Maham, et al.
Published: (2024) -
MultiCOIN: Multi-Modal COntrollable Video INbetweening
by: Tanveer, Maham, et al.
Published: (2025) -
BRICS: Bi-level feature Representation of Image CollectionS
by: Yang, Dingdong, et al.
Published: (2023) -
GALA: Geometry-Aware Local Adaptive Grids for Detailed 3D Generation
by: Yang, Dingdong, et al.
Published: (2024) -
SweepNet: Unsupervised Learning Shape Abstraction via Neural Sweepers
by: Zhao, Mingrui, et al.
Published: (2024)