:: Library Catalog

Image de couverture de livre

Enregistré dans:

Détails bibliographiques
Auteurs principaux:	Feng, Huidong, Chen, Wentao, Chen, Jie, Cai, Xinqi, Ma, Ruolong, Zheng, Yinglin, Lin, Yuxin, Zeng, Ming
Format:	Preprint
Publié:	2026
Sujets:	Computer Vision and Pattern Recognition Artificial Intelligence
Accès en ligne:	https://arxiv.org/abs/2606.00101
Tags:	Ajouter un tag Pas de tags, Soyez le premier à ajouter un tag!

Documents similaires

HairShifter: Consistent and High-Fidelity Video Hair Transfer via Anchor-Guided Animation
par: Shi, Wangzheng, et autres
Publié: (2025)

Chameleon: Benchmarking Detection and Backtracking on Commercial-Grade AI-Generated Videos
par: Liao, Xingming, et autres
Publié: (2025)

Predicting Turn-Taking and Backchannel in Human-Machine Conversations Using Linguistic, Acoustic, and Visual Signals
par: Lin, Yuxin, et autres
Publié: (2025)

RISE-T2V: Rephrasing and Injecting Semantics with LLM for Expansive Text-to-Video Generation
par: Zhang, Xiangjun, et autres
Publié: (2025)

Towards Real-world Video Face Restoration: A New Benchmark
par: Chen, Ziyan, et autres
Publié: (2024)

CoAgent: Collaborative Planning and Consistency Agent for Coherent Video Generation
par: Zeng, Qinglin, et autres
Publié: (2025)

AR-CoPO: Align Autoregressive Video Generation with Contrastive Policy Optimization
par: He, Dailan, et autres
Publié: (2026)

LoCoT2V-Bench: Benchmarking Long-Form and Complex Text-to-Video Generation
par: Zheng, Xiangqing, et autres
Publié: (2025)

VideoCoT: A Video Chain-of-Thought Dataset with Active Annotation Tool
par: Wang, Yan, et autres
Publié: (2024)

Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs
par: Zhang, Zicheng, et autres
Publié: (2024)

VideoDiff: Human-AI Video Co-Creation with Alternatives
par: Huh, Mina, et autres
Publié: (2025)

DDI-CoCo: A Dataset For Understanding The Effect Of Color Contrast In Machine-Assisted Skin Disease Detection
par: Chiu, Ming-Chang, et autres
Publié: (2024)

Achieving High Efficiency And Enhanced Beam Quality In Laser Wakefield Acceleration
par: Wang, Jia, et autres
Publié: (2025)

GenWorld: Towards Detecting AI-generated Real-world Simulation Videos
par: Chen, Weiliang, et autres
Publié: (2025)

Democratizing High-Fidelity Co-Speech Gesture Video Generation
par: Yang, Xu, et autres
Publié: (2025)

Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality Data
par: Yang, Tao, et autres
Publié: (2024)

Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation
par: Wang, Jun, et autres
Publié: (2025)

VABench: A Comprehensive Benchmark for Audio-Video Generation
par: Hua, Daili, et autres
Publié: (2025)

MIMOSA: Human-AI Co-Creation of Computational Spatial Audio Effects on Videos
par: Ning, Zheng, et autres
Publié: (2024)

PodReels: Human-AI Co-Creation of Video Podcast Teasers
par: Wang, Sitong, et autres
Publié: (2023)

Skyra: AI-Generated Video Detection via Grounded Artifact Reasoning
par: Li, Yifei, et autres
Publié: (2025)

Towards Video Anomaly Retrieval from Video Anomaly Detection: New Benchmarks and Model
par: Wu, Peng, et autres
Publié: (2023)

UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions
par: Xue, Zhucun, et autres
Publié: (2025)

Video-CoM: Interactive Video Reasoning via Chain of Manipulations
par: Rasheed, Hanoona, et autres
Publié: (2025)

VideoCoF: Unified Video Editing with Temporal Reasoner
par: Yang, Xiangpeng, et autres
Publié: (2025)

DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark
par: Chen, Haoxing, et autres
Publié: (2024)

FLARE: Full-Modality Long-Video Audiovisual Retrieval Benchmark with User-Simulated Queries
par: You, Qijie, et autres
Publié: (2026)

Fewer Tokens and Fewer Videos: Extending Video Understanding Abilities in Large Vision-Language Models
par: Chen, Shimin, et autres
Publié: (2024)

TraceAV-Bench: Benchmarking Multi-Hop Trajectory Reasoning over Long Audio-Visual Videos
par: Feng, Hengyi, et autres
Publié: (2026)

Training-free Video Temporal Grounding using Large-scale Pre-trained Models
par: Zheng, Minghang, et autres
Publié: (2024)

CoInteract: Physically-Consistent Human-Object Interaction Video Synthesis via Spatially-Structured Co-Generation
par: Luo, Xiangyang, et autres
Publié: (2026)

Video-CoE: Reinforcing Video Event Prediction via Chain of Events
par: Su, Qile, et autres
Publié: (2026)

EVQAScore: A Fine-grained Metric for Video Question Answering Data Quality Evaluation
par: Liang, Hao, et autres
Publié: (2024)

CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility
par: Zi, Bojia, et autres
Publié: (2024)

Beyond Rating: A Comprehensive Evaluation and Benchmark for AI Reviews
par: Li, Bowen, et autres
Publié: (2026)

BVI-Artefact: An Artefact Detection Benchmark Dataset for Streamed Videos
par: Feng, Chen, et autres
Publié: (2023)

Unified Long Video Inpainting and Outpainting via Overlapping High-Order Co-Denoising
par: Lyu, Shuangquan, et autres
Publié: (2025)

Towards Unified Video Quality Assessment
par: Feng, Chen, et autres
Publié: (2025)

CoRNStack: High-Quality Contrastive Data for Better Code Retrieval and Reranking
par: Suresh, Tarun, et autres
Publié: (2024)

QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design
par: Schneider, Benjamin, et autres
Publié: (2025)