Saved in:
| Main Authors: | Huang, Rui, Shao, Shitong, Zhou, Zikai, Zhao, Pukun, Guo, Hangyu, Ye, Tian, Bai, Lichen, Yang, Shuo, Xie, Zeke |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.05914 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis
by: Shao, Shitong, et al.
Published: (2024)
by: Shao, Shitong, et al.
Published: (2024)
Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer
by: Shao, Shitong, et al.
Published: (2024)
by: Shao, Shitong, et al.
Published: (2024)
CoRe^2: Collect, Reflect and Refine to Generate Better and Faster
by: Shao, Shitong, et al.
Published: (2025)
by: Shao, Shitong, et al.
Published: (2025)
Zigzag Diffusion Sampling: Diffusion Models Can Self-Improve via Self-Reflection
by: Bai, Lichen, et al.
Published: (2024)
by: Bai, Lichen, et al.
Published: (2024)
Golden Noise for Diffusion Models: A Learning Framework
by: Zhou, Zikai, et al.
Published: (2024)
by: Zhou, Zikai, et al.
Published: (2024)
PISA: Piecewise Sparse Attention Is Wiser for Efficient Diffusion Transformers
by: Li, Haopeng, et al.
Published: (2026)
by: Li, Haopeng, et al.
Published: (2026)
Exploring Data-Free LoRA Transferability for Video Diffusion Models
by: Wang, Yuchen, et al.
Published: (2026)
by: Wang, Yuchen, et al.
Published: (2026)
Optimizing Few-Step Generation with Adaptive Matching Distillation
by: Bai, Lichen, et al.
Published: (2026)
by: Bai, Lichen, et al.
Published: (2026)
CRAFT: Aligning Diffusion Models with Fine-Tuning Is Easier Than You Think
by: Sun, Zening, et al.
Published: (2026)
by: Sun, Zening, et al.
Published: (2026)
Guidance Matters: Rethinking the Evaluation Pitfall for Text-to-Image Generation
by: Xie, Dian, et al.
Published: (2026)
by: Xie, Dian, et al.
Published: (2026)
Efficient Video Diffusion Models: Advancements and Challenges
by: Shao, Shitong, et al.
Published: (2026)
by: Shao, Shitong, et al.
Published: (2026)
LIVEditor-14B: Lightning Unified Video Editing via In-Context Sparse Attention
by: Shao, Shitong, et al.
Published: (2026)
by: Shao, Shitong, et al.
Published: (2026)
Reflective Flow Sampling Enhancement
by: Zhou, Zikai, et al.
Published: (2026)
by: Zhou, Zikai, et al.
Published: (2026)
Weak-to-Strong Diffusion with Reflection
by: Bai, Lichen, et al.
Published: (2025)
by: Bai, Lichen, et al.
Published: (2025)
Elucidating the Design Space of Dataset Condensation
by: Shao, Shitong, et al.
Published: (2024)
by: Shao, Shitong, et al.
Published: (2024)
Alignment of Diffusion Models: Fundamentals, Challenges, and Future
by: Liu, Buhua, et al.
Published: (2024)
by: Liu, Buhua, et al.
Published: (2024)
Not All Noises Are Created Equally:Diffusion Noise Selection and Optimization
by: Qi, Zipeng, et al.
Published: (2024)
by: Qi, Zipeng, et al.
Published: (2024)
FastLightGen: Fast and Light Video Generation with Fewer Steps and Parameters
by: Shao, Shitong, et al.
Published: (2026)
by: Shao, Shitong, et al.
Published: (2026)
Late-to-Early Training: LET LLMs Learn Earlier, So Faster and Better
by: Zhao, Ji, et al.
Published: (2026)
by: Zhao, Ji, et al.
Published: (2026)
MagicDistillation: Weak-to-Strong Video Distillation for Large-Scale Few-Step Synthesis
by: Shao, Shitong, et al.
Published: (2025)
by: Shao, Shitong, et al.
Published: (2025)
Multiphysics Bench: Benchmarking and Investigating Scientific Machine Learning for Multiphysics PDEs
by: Yang, Changfan, et al.
Published: (2025)
by: Yang, Changfan, et al.
Published: (2025)
Learning to Accelerate Vision-Language-Action Models through Adaptive Visual Token Caching
by: Wei, Yujie, et al.
Published: (2026)
by: Wei, Yujie, et al.
Published: (2026)
Learning from Ambiguous Data with Hard Labels
by: Xie, Zeke, et al.
Published: (2025)
by: Xie, Zeke, et al.
Published: (2025)
Rethinking Centered Kernel Alignment in Knowledge Distillation
by: Zhou, Zikai, et al.
Published: (2024)
by: Zhou, Zikai, et al.
Published: (2024)
Generalized Large-Scale Data Condensation via Various Backbone and Statistical Matching
by: Shao, Shitong, et al.
Published: (2023)
by: Shao, Shitong, et al.
Published: (2023)
MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice
by: Yi, Hongwei, et al.
Published: (2025)
by: Yi, Hongwei, et al.
Published: (2025)
Mano: Restriking Manifold Optimization for LLM Training
by: Gu, Yufei, et al.
Published: (2026)
by: Gu, Yufei, et al.
Published: (2026)
Efficient Alternating Minimization with Applications to Weighted Low Rank Approximation
by: Song, Zhao, et al.
Published: (2023)
by: Song, Zhao, et al.
Published: (2023)
SoftCap: Soft-Budget Control for Diffusion Transformer Acceleration
by: Zhang, Yuhang, et al.
Published: (2026)
by: Zhang, Yuhang, et al.
Published: (2026)
Catch-Up Distillation: You Only Need to Train Once for Accelerating Sampling
by: Shao, Shitong, et al.
Published: (2023)
by: Shao, Shitong, et al.
Published: (2023)
EvoEmpirBench: Dynamic Spatial Reasoning with Agent-ExpVer
by: Zhao, Pukun, et al.
Published: (2025)
by: Zhao, Pukun, et al.
Published: (2025)
Training-free Zero-shot Composed Image Retrieval with Local Concept Reranking
by: Sun, Shitong, et al.
Published: (2023)
by: Sun, Shitong, et al.
Published: (2023)
Magic 1-For-1: Generating One Minute Video Clips within One Minute
by: Yi, Hongwei, et al.
Published: (2025)
by: Yi, Hongwei, et al.
Published: (2025)
Order Matters in Hallucination: Reasoning Order as Benchmark and Reflexive Prompting for Large-Language-Models
by: Xie, Zikai
Published: (2024)
by: Xie, Zikai
Published: (2024)
Plug-and-Play Fidelity Optimization for Diffusion Transformer Acceleration via Cumulative Error Minimization
by: Shao, Tong, et al.
Published: (2025)
by: Shao, Tong, et al.
Published: (2025)
Provable Acceleration for Diffusion Models under Minimal Assumptions
by: Li, Gen, et al.
Published: (2024)
by: Li, Gen, et al.
Published: (2024)
Memory-Anchored Multimodal Reasoning for Explainable Video Forensics
by: Chen, Chen, et al.
Published: (2025)
by: Chen, Chen, et al.
Published: (2025)
STAR: Mitigating Cascading Errors in Spatial Reasoning via Turn-point Alignment and Segment-level DPO
by: Zhao, Pukun, et al.
Published: (2026)
by: Zhao, Pukun, et al.
Published: (2026)
DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation
by: Shen, Zhiqiang, et al.
Published: (2024)
by: Shen, Zhiqiang, et al.
Published: (2024)
Rethinking and Accelerating Graph Condensation: A Training-Free Approach with Class Partition
by: Gao, Xinyi, et al.
Published: (2024)
by: Gao, Xinyi, et al.
Published: (2024)
Similar Items
-
IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis
by: Shao, Shitong, et al.
Published: (2024) -
Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer
by: Shao, Shitong, et al.
Published: (2024) -
CoRe^2: Collect, Reflect and Refine to Generate Better and Faster
by: Shao, Shitong, et al.
Published: (2025) -
Zigzag Diffusion Sampling: Diffusion Models Can Self-Improve via Self-Reflection
by: Bai, Lichen, et al.
Published: (2024) -
Golden Noise for Diffusion Models: A Learning Framework
by: Zhou, Zikai, et al.
Published: (2024)