:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Jiao, Guanlong, Zhang, Chenyangguang, Xian, Jia Jun Cheng, Zhang, Zewei, Liao, Renjie
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2605.21466
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

TrajLoom: Dense Future Trajectory Generation from Video
by: Zhang, Zewei, et al.
Published: (2026)

UniEdit-Flow: Unleashing Inversion and Editing in the Era of Flow Models
by: Jiao, Guanlong, et al.
Published: (2025)

StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams
by: Wu, Zike, et al.
Published: (2025)

A Training-Free Approach for Multi-ID Customization via Attention Adjustment and Spatial Control
by: Lin, Jiawei, et al.
Published: (2025)

Streaming Video Diffusion: Online Video Editing with Diffusion Models
by: Chen, Feng, et al.
Published: (2024)

3DGStream: On-the-Fly Training of 3D Gaussians for Efficient Streaming of Photo-Realistic Free-Viewpoint Videos
by: Sun, Jiakai, et al.
Published: (2024)

Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation
by: Wu, Bin, et al.
Published: (2026)

Learning Streaming Video Representation via Multitask Training
by: Yan, Yibin, et al.
Published: (2025)

Semantic-Rearrangement-Based Multi-Level Alignment for Domain Generalized Segmentation
by: Jiao, Guanlong, et al.
Published: (2024)

Eyes Wide Open: Ego Proactive Video-LLM for Streaming Video
by: Zhang, Yulin, et al.
Published: (2025)

SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer
by: Zhao, Yuyang, et al.
Published: (2026)

StreamChat: Chatting with Streaming Video
by: Liu, Jihao, et al.
Published: (2024)

DeformStream: Deformation-based Adaptive Volumetric Video Streaming
by: Li, Boyan, et al.
Published: (2024)

Stream-T1: Test-Time Scaling for Streaming Video Generation
by: Tu, Yijing, et al.
Published: (2026)

Streaming Video Instruction Tuning
by: Xia, Jiaer, et al.
Published: (2025)

Test-Time Training on Video Streams
by: Wang, Renhao, et al.
Published: (2023)

HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
by: Qiu, Haonan, et al.
Published: (2025)

EvoStreaming: Your Offline Video Model Is a Natively Streaming Assistant
by: Wen, Zichen, et al.
Published: (2026)

LOVECon: Text-driven Training-Free Long Video Editing with ControlNet
by: Liao, Zhenyi, et al.
Published: (2023)

Streaming Autoregressive Video Generation via Diagonal Distillation
by: Liu, Jinxiu, et al.
Published: (2026)

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling
by: Luo, Yawen, et al.
Published: (2026)

Accelerating Streaming Video Large Language Models via Hierarchical Token Compression
by: Wang, Yiyu, et al.
Published: (2025)

Thinking in Streaming Video
by: Liu, Zikang, et al.
Published: (2026)

StreamAgent: Towards Anticipatory Agents for Streaming Video Understanding
by: Yang, Haolin, et al.
Published: (2025)

Streaming Video Question-Answering with In-context Video KV-Cache Retrieval
by: Di, Shangzhe, et al.
Published: (2025)

StreamSTGS: Streaming Spatial and Temporal Gaussian Grids for Real-Time Free-Viewpoint Video
by: Ke, Zhihui, et al.
Published: (2025)

Long-Horizon Streaming Video Generation via Hybrid Attention with Decoupled Distillation
by: Li, Ruibin, et al.
Published: (2026)

DOLLAR: Few-Step Video Generation via Distillation and Latent Reward Optimization
by: Ding, Zihan, et al.
Published: (2024)

OmniHumanoid: Streaming Cross-Embodiment Video Generation with Paired-Free Adaptation
by: Song, Yiren, et al.
Published: (2026)

An Efficient Streaming Video Understanding Framework with Agentic Control
by: Liu, Jinming, et al.
Published: (2026)

Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation
by: Lu, Yunhong, et al.
Published: (2025)

WeaveTime: Stream from Earlier Frames into Emergent Memory in VideoLLMs
by: Zhang, Yulin, et al.
Published: (2026)

VideoLLM-online: Online Video Large Language Model for Streaming Video
by: Chen, Joya, et al.
Published: (2024)

Controllable Egocentric Video Generation via Occlusion-Aware Sparse 3D Hand Joints
by: Zhang, Chenyangguang, et al.
Published: (2026)

DUO-VSR: Dual-Stream Distillation for One-Step Video Super-Resolution
by: Lv, Zhengyao, et al.
Published: (2026)

KGEdit: Ambiguity-Aware Knowledge Graphs for Training-Free Precise Video Generation and Editing
by: Cai, Mingshu, et al.
Published: (2026)

StreamingEffect: Real-Time Human-Centric Video Effect Generation
by: Song, Yiren, et al.
Published: (2026)

MotionStream: Real-Time Video Generation with Interactive Motion Controls
by: Shin, Joonghyuk, et al.
Published: (2025)

MOHO: Learning Single-view Hand-held Object Reconstruction with Multi-view Occlusion-Aware Supervision
by: Zhang, Chenyangguang, et al.
Published: (2023)

Streaming Dense Video Captioning
by: Zhou, Xingyi, et al.
Published: (2024)