:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ji, Lichuan, Lin, Yingqi, Huang, Zhenhua, Han, Yan, Xu, Xiaogang, Wu, Jiafei, Wang, Chong, Liu, Zhe
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2405.15343
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Restoration-Oriented Video Frame Interpolation with Region-Distinguishable Priors from SAM
by: Han, Yan, et al.
Published: (2023)

Geometric-Aware Low-Light Image and Video Enhancement via Depth Guidance
by: Lin, Yingqi, et al.
Published: (2023)

Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
by: Yang, Lihe, et al.
Published: (2024)

CG-FedLLM: How to Compress Gradients in Federated Fune-tuning for Large Language Models
by: Wu, Huiwen, et al.
Published: (2024)

Iter-AHMCL: Alleviate Hallucination for Large Language Model via Iterative Model-level Contrastive Learning
by: Wu, Huiwen, et al.
Published: (2024)

DR-Encoder: Encode Low-rank Gradients with Random Prior for Large Language Models Differentially Privately
by: Wu, Huiwen, et al.
Published: (2024)

Adversarial Attacks of Vision Tasks in the Past 10 Years: A Survey
by: Zhang, Chiyu, et al.
Published: (2024)

Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data
by: Li, Jiajie, et al.
Published: (2025)

Segment Any Motion in Videos
by: Huang, Nan, et al.
Published: (2025)

Jailbreaking Commercial Black-Box LLMs with Explicitly Harmful Prompts
by: Zhang, Chiyu, et al.
Published: (2025)

Fair in Mind, Fair in Action? A Synchronous Benchmark for Understanding and Generation in UMLLMs
by: Zhao, Yiran, et al.
Published: (2026)

LMSeg: Unleashing the Power of Large-Scale Models for Open-Vocabulary Semantic Segmentation
by: Tang, Huadong, et al.
Published: (2024)

Differential Private Stochastic Optimization with Heavy-tailed Data: Towards Optimal Rates
by: Zhao, Puning, et al.
Published: (2024)

From Thinker to Society: Security in Hierarchical Autonomy Evolution of AI Agents
by: Zhang, Xiaolei, et al.
Published: (2026)

Low-Light Video Enhancement via Spatial-Temporal Consistent Decomposition
by: Xu, Xiaogang, et al.
Published: (2024)

Low-Light Video Enhancement with An Effective Spatial-Temporal Decomposition Paradigm
by: Xu, Xiaogang, et al.
Published: (2026)

Unleashing the Power of CNN and Transformer for Balanced RGB-Event Video Recognition
by: Wang, Xiao, et al.
Published: (2023)

Moaw: Unleashing Motion Awareness for Video Diffusion Models
by: Zhang, Tianqi, et al.
Published: (2026)

Towards Understanding Camera Motions in Any Video
by: Lin, Zhiqiu, et al.
Published: (2025)

Exploringand Unleashing the Power of Large Language Models in CI/CD Configuration Translation
by: Wang, Chong, et al.
Published: (2025)

HAFormer: Unleashing the Power of Hierarchy-Aware Features for Lightweight Semantic Segmentation
by: Xu, Guoan, et al.
Published: (2024)

MotionBank: A Large-scale Video Motion Benchmark with Disentangled Rule-based Annotations
by: Xu, Liang, et al.
Published: (2024)

FC-VFI: Faithful and Consistent Video Frame Interpolation for High-FPS Slow Motion Video Generation
by: Ding, Ganggui, et al.
Published: (2026)

Track Any Motions under Any Disturbances
by: Zhang, Zhikai, et al.
Published: (2025)

Unleashing the Power of Natural Audio Featuring Multiple Sound Sources
by: Cheng, Xize, et al.
Published: (2025)

HoloFair: Unified T2I Fairness Evaluation and Fair-GRPO Debiasing
by: Chen, Ruyi, et al.
Published: (2026)

MorphAny3D: Unleashing the Power of Structured Latent in 3D Morphing
by: Sun, Xiaokun, et al.
Published: (2026)

Depth Any Video with Scalable Synthetic Data
by: Yang, Honghui, et al.
Published: (2024)

Tree-of-Table: Unleashing the Power of LLMs for Enhanced Large-Scale Table Understanding
by: Ji, Deyi, et al.
Published: (2024)

COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing
by: Wang, Jiangshan, et al.
Published: (2024)

LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence
by: Li, Zhuoling, et al.
Published: (2024)

Feature Disentanglement in GANs for Photorealistic Multi‐view Hair Transfer
by: Jiayi Xu, et al.
Published: (2025)

Towards Fine-Grained Human Motion Video Captioning
by: Song, Guorui, et al.
Published: (2025)

UCF-Crime-DVS: A Novel Event-Based Dataset for Video Anomaly Detection with Spiking Neural Networks
by: Qian, Yuanbin, et al.
Published: (2025)

Boosting Fidelity for Pre-Trained-Diffusion-Based Low-Light Image Enhancement via Condition Refinement
by: Xu, Xiaogang, et al.
Published: (2025)

Motion Anything: Any to Motion Generation
by: Zhang, Zeyu, et al.
Published: (2025)

Motion Graph Unleashed: A Novel Approach to Video Prediction
by: Zhong, Yiqi, et al.
Published: (2024)

AnyMoLe: Any Character Motion In-betweening Leveraging Video Diffusion Models
by: Yun, Kwan, et al.
Published: (2025)

MARS: Unleashing the Power of Variance Reduction for Training Large Models
by: Yuan, Huizhuo, et al.
Published: (2024)

AnySR: Realizing Image Super-Resolution as Any-Scale, Any-Resource
by: Zhan, Wengyi, et al.
Published: (2024)