:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Sun, Xiaopeng, Lin, Qinwei, Gao, Yu, Zhong, Yujie, Feng, Chengjian, Li, Dengjie, Zhao, Zheng, Hu, Jie, Ma, Lin
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2412.03268
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

TASR: Timestep-Aware Diffusion Model for Image Super-Resolution
by: Lin, Qinwei, et al.
Published: (2024)

RoboTron-Sim: Improving Real-World Driving via Simulated Hard-Case
by: Xiao, Baihui, et al.
Published: (2025)

InstaGen: Enhancing Object Detection by Training on Synthetic Dataset
by: Feng, Chengjian, et al.
Published: (2024)

UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection
by: Zeng, Yingsen, et al.
Published: (2024)

AP-CAP: Advancing High-Quality Data Synthesis for Animal Pose Estimation via a Controllable Image Generation Pipeline
by: Wang, Lei, et al.
Published: (2025)

DisTime: Distribution-based Time Representation for Video Large Language Models
by: Zeng, Yingsen, et al.
Published: (2025)

Manga Generation via Layout-controllable Diffusion
by: Chen, Siyu, et al.
Published: (2024)

LinVT: Empower Your Image-level Large Language Model to Understand Videos
by: Gao, Lishuai, et al.
Published: (2024)

Matten: Video Generation with Mamba-Attention
by: Gao, Yu, et al.
Published: (2024)

RoboTron-Drive: All-in-One Large Multimodal Model for Autonomous Driving
by: Huang, Zhijian, et al.
Published: (2024)

RoboTron-Nav: A Unified Framework for Embodied Navigation Integrating Perception, Planning, and Prediction
by: Zhong, Yufeng, et al.
Published: (2025)

Boosting Robotic Manipulation Generalization with Minimal Costly Data
by: Zheng, Liming, et al.
Published: (2025)

InstructVEdit: A Holistic Approach for Instructional Video Editing
by: Zhang, Chi, et al.
Published: (2025)

Advancing Visual Large Language Model for Multi-granular Versatile Perception
by: Xiang, Wentao, et al.
Published: (2025)

RoboUniView: Visual-Language Model with Unified View Representation for Robotic Manipulation
by: Liu, Fanfan, et al.
Published: (2024)

RoboTron-Mani: All-in-One Multimodal Large Model for Robotic Manipulation
by: Yan, Feng, et al.
Published: (2024)

Control-A-Video: Controllable Text-to-Video Diffusion Models with Motion Prior and Reward Feedback Learning
by: Chen, Weifeng, et al.
Published: (2023)

RoboCAS: A Benchmark for Robotic Manipulation in Complex Object Arrangement Scenarios
by: Zheng, Liming, et al.
Published: (2024)

High-quality Image Dehazing with Diffusion Model
by: Yu, Hu, et al.
Published: (2023)

Regulating Anatomy-Aware Rewards via Trajectory-Integral Feedback for Volumetric Computed Tomography Analysis
by: Lin, Tianwei, et al.
Published: (2026)

HiMix: Reducing Computational Complexity in Large Vision-Language Models
by: Zhang, Xuange, et al.
Published: (2025)

MRStyle: A Unified Framework for Color Style Transfer with Multi-Modality Reference
by: Huang, Jiancheng, et al.
Published: (2024)

DiffusionReward: Enhancing Blind Face Restoration through Reward Feedback Learning
by: Wu, Bin, et al.
Published: (2025)

Reward-Directed Score-Based Diffusion Models via q-Learning
by: Gao, Xuefeng, et al.
Published: (2024)

CamPilot: Improving Camera Control in Video Diffusion Model with Efficient Camera Reward Feedback
by: Ge, Wenhang, et al.
Published: (2026)

Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble
by: Zhang, Shun, et al.
Published: (2024)

LeapAlign: Post-Training Flow Matching Models at Any Generation Step by Building Two-Step Trajectories
by: Liang, Zhanhao, et al.
Published: (2026)

Playmate2: Training-Free Multi-Character Audio-Driven Animation via Diffusion Transformer with Reward Feedback
by: Ma, Xingpei, et al.
Published: (2025)

UniFL: Improve Latent Diffusion Model via Unified Feedback Learning
by: Zhang, Jiacheng, et al.
Published: (2024)

Monocular Gaussian SLAM with Language Extended Loop Closure
by: Lan, Tian, et al.
Published: (2024)

Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards
by: Shen, Wei, et al.
Published: (2024)

ISR: Invertible Symbolic Regression
by: Tohme, Tony, et al.
Published: (2024)

Reward-free Alignment for Conflicting Objectives
by: Chen, Peter, et al.
Published: (2026)

ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning
by: Chen, Weifeng, et al.
Published: (2024)

MindBench: A Comprehensive Benchmark for Mind Map Structure Recognition and Analysis
by: Chen, Lei, et al.
Published: (2024)

Efficient Reasoning via Reward Model
by: Wang, Yuhao, et al.
Published: (2025)

Privacy-Preserving Reinforcement Learning from Human Feedback via Decoupled Reward Modeling
by: Cho, Young Hyun, et al.
Published: (2026)

LiveR: Fine-Grained Elasticity via Live Reconfiguration for Model Training
by: Liu, Haoyuan, et al.
Published: (2026)

Output Feedback to Improve the Delay Margin of Linear Delay Systems
by: Renhong Hu, et al.
Published: (2025)

DIffSteISR: Harnessing Diffusion Prior for Superior Real-world Stereo Image Super-Resolution
by: Zhou, Yuanbo, et al.
Published: (2024)