Saved in:
| Main Authors: | Zhao, Haoyu, Zhang, Yuang, Cheng, Junqi, Gu, Jiaxi, Lu, Zenghui, Shu, Peng, Wu, Zuxuan, Jiang, Yu-Gang |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.13637 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ShoulderShot: Generating Over-the-Shoulder Dialogue Videos
by: Zhang, Yuang, et al.
Published: (2025)
by: Zhang, Yuang, et al.
Published: (2025)
CT-1: Vision-Language-Camera Models Transfer Spatial Reasoning Knowledge to Camera-Controllable Video Generation
by: Zhao, Haoyu, et al.
Published: (2026)
by: Zhao, Haoyu, et al.
Published: (2026)
CameraNoise: Enabling Faithful Camera Control in Video Diffusion through Geometry-Flow-Guided Noise Warping
by: Zhao, Haoyu, et al.
Published: (2026)
by: Zhao, Haoyu, et al.
Published: (2026)
MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing
by: Zhao, Haoyu, et al.
Published: (2023)
by: Zhao, Haoyu, et al.
Published: (2023)
Repeating Words for Video-Language Retrieval with Coarse-to-Fine Objectives
by: Zhao, Haoyu, et al.
Published: (2025)
by: Zhao, Haoyu, et al.
Published: (2025)
AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video Grounding
by: Zhang, Xing, et al.
Published: (2024)
by: Zhang, Xing, et al.
Published: (2024)
GenRec: Unifying Video Generation and Recognition with Diffusion Models
by: Weng, Zejia, et al.
Published: (2024)
by: Weng, Zejia, et al.
Published: (2024)
MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
by: Zhang, Yuang, et al.
Published: (2024)
by: Zhang, Yuang, et al.
Published: (2024)
Diffusion Generative Modelling for Divide-and-Conquer MCMC
by: Trojan, C., et al.
Published: (2024)
by: Trojan, C., et al.
Published: (2024)
Memory Consistency Guided Divide-and-Conquer Learning for Generalized Category Discovery
by: Tu, Yuanpeng, et al.
Published: (2024)
by: Tu, Yuanpeng, et al.
Published: (2024)
Fuse Your Latents: Video Editing with Multi-source Latent Diffusion Models
by: Lu, Tianyi, et al.
Published: (2023)
by: Lu, Tianyi, et al.
Published: (2023)
AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction
by: Xing, Zhen, et al.
Published: (2024)
by: Xing, Zhen, et al.
Published: (2024)
OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation
by: Wang, Junke, et al.
Published: (2024)
by: Wang, Junke, et al.
Published: (2024)
Divide, Conquer and Unite: Hierarchical Style-Recalibrated Prototype Alignment for Federated Medical Segmentation
by: Zhao, Xingyue, et al.
Published: (2025)
by: Zhao, Xingyue, et al.
Published: (2025)
Divide and Conquer: Multimodal Video Deepfake Detection via Cross-Modal Fusion and Localization
by: Li, Qingcao, et al.
Published: (2026)
by: Li, Qingcao, et al.
Published: (2026)
DCR-Consistency: Divide-Conquer-Reasoning for Consistency Evaluation and Improvement of Large Language Models
by: Cui, Wendi, et al.
Published: (2024)
by: Cui, Wendi, et al.
Published: (2024)
MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation
by: Wang, Yanhui, et al.
Published: (2023)
by: Wang, Yanhui, et al.
Published: (2023)
DCARL: A Divide-and-Conquer Framework for Autoregressive Long-Trajectory Video Generation
by: Ouyang, Junyi, et al.
Published: (2026)
by: Ouyang, Junyi, et al.
Published: (2026)
A Survey on Video Diffusion Models
by: Xing, Zhen, et al.
Published: (2023)
by: Xing, Zhen, et al.
Published: (2023)
Divide and Conquer: Reliable Multi-View Evidential Learning for Deepfake Detection
by: Kang, Xiaolu, et al.
Published: (2026)
by: Kang, Xiaolu, et al.
Published: (2026)
EDEN: Enhanced Diffusion for High-quality Large-motion Video Frame Interpolation
by: Zhang, Zihao, et al.
Published: (2025)
by: Zhang, Zihao, et al.
Published: (2025)
GeoGS3D: Single-view 3D Reconstruction via Geometric-aware Diffusion Model and Gaussian Splatting
by: Feng, Qijun, et al.
Published: (2024)
by: Feng, Qijun, et al.
Published: (2024)
Control Large Language Models via Divide and Conquer
by: Li, Bingxuan, et al.
Published: (2024)
by: Li, Bingxuan, et al.
Published: (2024)
Divide-and-Conquer Posterior Sampling for Denoising Diffusion Priors
by: Janati, Yazid, et al.
Published: (2024)
by: Janati, Yazid, et al.
Published: (2024)
AdaDiff: Adaptive Step Selection for Fast Diffusion Models
by: Zhang, Hui, et al.
Published: (2023)
by: Zhang, Hui, et al.
Published: (2023)
OmniVid: A Generative Framework for Universal Video Understanding
by: Wang, Junke, et al.
Published: (2024)
by: Wang, Junke, et al.
Published: (2024)
EasyControl: Transfer ControlNet to Video Diffusion for Controllable Generation and Interpolation
by: Wang, Cong, et al.
Published: (2024)
by: Wang, Cong, et al.
Published: (2024)
OmAgent: A Multi-modal Agent Framework for Complex Video Understanding with Task Divide-and-Conquer
by: Zhang, Lu, et al.
Published: (2024)
by: Zhang, Lu, et al.
Published: (2024)
DiffusionAD: Norm-guided One-step Denoising Diffusion for Anomaly Detection
by: Zhang, Hui, et al.
Published: (2023)
by: Zhang, Hui, et al.
Published: (2023)
Accurate and Scalable Matrix Mechanisms via Divide and Conquer
by: He, Guanlin, et al.
Published: (2026)
by: He, Guanlin, et al.
Published: (2026)
Progressive Divide-and-Conquer via Subsampling Decomposition for Accelerated MRI
by: Wang, Chong, et al.
Published: (2024)
by: Wang, Chong, et al.
Published: (2024)
Divide and Conquer: Accelerating Diffusion-Based Large Language Models via Adaptive Parallel Decoding
by: Luo, Xiangzhong, et al.
Published: (2026)
by: Luo, Xiangzhong, et al.
Published: (2026)
Divide-or-Conquer? Which Part Should You Distill Your LLM?
by: Wu, Zhuofeng, et al.
Published: (2024)
by: Wu, Zhuofeng, et al.
Published: (2024)
Structured Divide-and-Conquer for the Definite Generalized Eigenvalue Problem
by: Demmel, James, et al.
Published: (2025)
by: Demmel, James, et al.
Published: (2025)
Recursive Decomposition with Dependencies for Generic Divide-and-Conquer Reasoning
by: Hernández-Gutiérrez, Sergio, et al.
Published: (2025)
by: Hernández-Gutiérrez, Sergio, et al.
Published: (2025)
CoMP: Continual Multimodal Pre-training for Vision Foundation Models
by: Chen, Yitong, et al.
Published: (2025)
by: Chen, Yitong, et al.
Published: (2025)
Learning Accurate Segmentation Purely from Self-Supervision
by: You, Zuyao, et al.
Published: (2026)
by: You, Zuyao, et al.
Published: (2026)
Decomposition-Based Synthesis for Applying Divide-and-Conquer-Like Algorithmic Paradigms
by: Ji, Ruyi, et al.
Published: (2022)
by: Ji, Ruyi, et al.
Published: (2022)
Divide and Conquer: Grounding a Bleeding Areas in Gastrointestinal Image with Two-Stage Model
by: Lin, Yu-Fan, et al.
Published: (2024)
by: Lin, Yu-Fan, et al.
Published: (2024)
Dividing and Conquering the Van Vleck Catastrophe
by: Simon, Sophia, et al.
Published: (2025)
by: Simon, Sophia, et al.
Published: (2025)
Similar Items
-
ShoulderShot: Generating Over-the-Shoulder Dialogue Videos
by: Zhang, Yuang, et al.
Published: (2025) -
CT-1: Vision-Language-Camera Models Transfer Spatial Reasoning Knowledge to Camera-Controllable Video Generation
by: Zhao, Haoyu, et al.
Published: (2026) -
CameraNoise: Enabling Faithful Camera Control in Video Diffusion through Geometry-Flow-Guided Noise Warping
by: Zhao, Haoyu, et al.
Published: (2026) -
MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing
by: Zhao, Haoyu, et al.
Published: (2023) -
Repeating Words for Video-Language Retrieval with Coarse-to-Fine Objectives
by: Zhao, Haoyu, et al.
Published: (2025)