Saved in:
| Main Authors: | Ji, Lichuan, Lin, Yingqi, Huang, Zhenhua, Han, Yan, Xu, Xiaogang, Wu, Jiafei, Wang, Chong, Liu, Zhe |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.15343 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Restoration-Oriented Video Frame Interpolation with Region-Distinguishable Priors from SAM
by: Han, Yan, et al.
Published: (2023)
by: Han, Yan, et al.
Published: (2023)
Geometric-Aware Low-Light Image and Video Enhancement via Depth Guidance
by: Lin, Yingqi, et al.
Published: (2023)
by: Lin, Yingqi, et al.
Published: (2023)
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
by: Yang, Lihe, et al.
Published: (2024)
by: Yang, Lihe, et al.
Published: (2024)
CG-FedLLM: How to Compress Gradients in Federated Fune-tuning for Large Language Models
by: Wu, Huiwen, et al.
Published: (2024)
by: Wu, Huiwen, et al.
Published: (2024)
Iter-AHMCL: Alleviate Hallucination for Large Language Model via Iterative Model-level Contrastive Learning
by: Wu, Huiwen, et al.
Published: (2024)
by: Wu, Huiwen, et al.
Published: (2024)
DR-Encoder: Encode Low-rank Gradients with Random Prior for Large Language Models Differentially Privately
by: Wu, Huiwen, et al.
Published: (2024)
by: Wu, Huiwen, et al.
Published: (2024)
Adversarial Attacks of Vision Tasks in the Past 10 Years: A Survey
by: Zhang, Chiyu, et al.
Published: (2024)
by: Zhang, Chiyu, et al.
Published: (2024)
Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data
by: Li, Jiajie, et al.
Published: (2025)
by: Li, Jiajie, et al.
Published: (2025)
Segment Any Motion in Videos
by: Huang, Nan, et al.
Published: (2025)
by: Huang, Nan, et al.
Published: (2025)
Jailbreaking Commercial Black-Box LLMs with Explicitly Harmful Prompts
by: Zhang, Chiyu, et al.
Published: (2025)
by: Zhang, Chiyu, et al.
Published: (2025)
Fair in Mind, Fair in Action? A Synchronous Benchmark for Understanding and Generation in UMLLMs
by: Zhao, Yiran, et al.
Published: (2026)
by: Zhao, Yiran, et al.
Published: (2026)
LMSeg: Unleashing the Power of Large-Scale Models for Open-Vocabulary Semantic Segmentation
by: Tang, Huadong, et al.
Published: (2024)
by: Tang, Huadong, et al.
Published: (2024)
Differential Private Stochastic Optimization with Heavy-tailed Data: Towards Optimal Rates
by: Zhao, Puning, et al.
Published: (2024)
by: Zhao, Puning, et al.
Published: (2024)
From Thinker to Society: Security in Hierarchical Autonomy Evolution of AI Agents
by: Zhang, Xiaolei, et al.
Published: (2026)
by: Zhang, Xiaolei, et al.
Published: (2026)
Low-Light Video Enhancement via Spatial-Temporal Consistent Decomposition
by: Xu, Xiaogang, et al.
Published: (2024)
by: Xu, Xiaogang, et al.
Published: (2024)
Low-Light Video Enhancement with An Effective Spatial-Temporal Decomposition Paradigm
by: Xu, Xiaogang, et al.
Published: (2026)
by: Xu, Xiaogang, et al.
Published: (2026)
Unleashing the Power of CNN and Transformer for Balanced RGB-Event Video Recognition
by: Wang, Xiao, et al.
Published: (2023)
by: Wang, Xiao, et al.
Published: (2023)
Moaw: Unleashing Motion Awareness for Video Diffusion Models
by: Zhang, Tianqi, et al.
Published: (2026)
by: Zhang, Tianqi, et al.
Published: (2026)
Towards Understanding Camera Motions in Any Video
by: Lin, Zhiqiu, et al.
Published: (2025)
by: Lin, Zhiqiu, et al.
Published: (2025)
Exploringand Unleashing the Power of Large Language Models in CI/CD Configuration Translation
by: Wang, Chong, et al.
Published: (2025)
by: Wang, Chong, et al.
Published: (2025)
HAFormer: Unleashing the Power of Hierarchy-Aware Features for Lightweight Semantic Segmentation
by: Xu, Guoan, et al.
Published: (2024)
by: Xu, Guoan, et al.
Published: (2024)
MotionBank: A Large-scale Video Motion Benchmark with Disentangled Rule-based Annotations
by: Xu, Liang, et al.
Published: (2024)
by: Xu, Liang, et al.
Published: (2024)
FC-VFI: Faithful and Consistent Video Frame Interpolation for High-FPS Slow Motion Video Generation
by: Ding, Ganggui, et al.
Published: (2026)
by: Ding, Ganggui, et al.
Published: (2026)
Track Any Motions under Any Disturbances
by: Zhang, Zhikai, et al.
Published: (2025)
by: Zhang, Zhikai, et al.
Published: (2025)
Unleashing the Power of Natural Audio Featuring Multiple Sound Sources
by: Cheng, Xize, et al.
Published: (2025)
by: Cheng, Xize, et al.
Published: (2025)
HoloFair: Unified T2I Fairness Evaluation and Fair-GRPO Debiasing
by: Chen, Ruyi, et al.
Published: (2026)
by: Chen, Ruyi, et al.
Published: (2026)
MorphAny3D: Unleashing the Power of Structured Latent in 3D Morphing
by: Sun, Xiaokun, et al.
Published: (2026)
by: Sun, Xiaokun, et al.
Published: (2026)
Depth Any Video with Scalable Synthetic Data
by: Yang, Honghui, et al.
Published: (2024)
by: Yang, Honghui, et al.
Published: (2024)
Tree-of-Table: Unleashing the Power of LLMs for Enhanced Large-Scale Table Understanding
by: Ji, Deyi, et al.
Published: (2024)
by: Ji, Deyi, et al.
Published: (2024)
COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing
by: Wang, Jiangshan, et al.
Published: (2024)
by: Wang, Jiangshan, et al.
Published: (2024)
LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence
by: Li, Zhuoling, et al.
Published: (2024)
by: Li, Zhuoling, et al.
Published: (2024)
Feature Disentanglement in GANs for Photorealistic Multi‐view Hair Transfer
by: Jiayi Xu, et al.
Published: (2025)
by: Jiayi Xu, et al.
Published: (2025)
Towards Fine-Grained Human Motion Video Captioning
by: Song, Guorui, et al.
Published: (2025)
by: Song, Guorui, et al.
Published: (2025)
UCF-Crime-DVS: A Novel Event-Based Dataset for Video Anomaly Detection with Spiking Neural Networks
by: Qian, Yuanbin, et al.
Published: (2025)
by: Qian, Yuanbin, et al.
Published: (2025)
Boosting Fidelity for Pre-Trained-Diffusion-Based Low-Light Image Enhancement via Condition Refinement
by: Xu, Xiaogang, et al.
Published: (2025)
by: Xu, Xiaogang, et al.
Published: (2025)
Motion Anything: Any to Motion Generation
by: Zhang, Zeyu, et al.
Published: (2025)
by: Zhang, Zeyu, et al.
Published: (2025)
Motion Graph Unleashed: A Novel Approach to Video Prediction
by: Zhong, Yiqi, et al.
Published: (2024)
by: Zhong, Yiqi, et al.
Published: (2024)
AnyMoLe: Any Character Motion In-betweening Leveraging Video Diffusion Models
by: Yun, Kwan, et al.
Published: (2025)
by: Yun, Kwan, et al.
Published: (2025)
MARS: Unleashing the Power of Variance Reduction for Training Large Models
by: Yuan, Huizhuo, et al.
Published: (2024)
by: Yuan, Huizhuo, et al.
Published: (2024)
AnySR: Realizing Image Super-Resolution as Any-Scale, Any-Resource
by: Zhan, Wengyi, et al.
Published: (2024)
by: Zhan, Wengyi, et al.
Published: (2024)
Similar Items
-
Restoration-Oriented Video Frame Interpolation with Region-Distinguishable Priors from SAM
by: Han, Yan, et al.
Published: (2023) -
Geometric-Aware Low-Light Image and Video Enhancement via Depth Guidance
by: Lin, Yingqi, et al.
Published: (2023) -
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
by: Yang, Lihe, et al.
Published: (2024) -
CG-FedLLM: How to Compress Gradients in Federated Fune-tuning for Large Language Models
by: Wu, Huiwen, et al.
Published: (2024) -
Iter-AHMCL: Alleviate Hallucination for Large Language Model via Iterative Model-level Contrastive Learning
by: Wu, Huiwen, et al.
Published: (2024)