Saved in:
| Main Authors: | Sun, Xiaopeng, Lin, Qinwei, Gao, Yu, Zhong, Yujie, Feng, Chengjian, Li, Dengjie, Zhao, Zheng, Hu, Jie, Ma, Lin |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.03268 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
TASR: Timestep-Aware Diffusion Model for Image Super-Resolution
by: Lin, Qinwei, et al.
Published: (2024)
by: Lin, Qinwei, et al.
Published: (2024)
RoboTron-Sim: Improving Real-World Driving via Simulated Hard-Case
by: Xiao, Baihui, et al.
Published: (2025)
by: Xiao, Baihui, et al.
Published: (2025)
InstaGen: Enhancing Object Detection by Training on Synthetic Dataset
by: Feng, Chengjian, et al.
Published: (2024)
by: Feng, Chengjian, et al.
Published: (2024)
UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection
by: Zeng, Yingsen, et al.
Published: (2024)
by: Zeng, Yingsen, et al.
Published: (2024)
AP-CAP: Advancing High-Quality Data Synthesis for Animal Pose Estimation via a Controllable Image Generation Pipeline
by: Wang, Lei, et al.
Published: (2025)
by: Wang, Lei, et al.
Published: (2025)
DisTime: Distribution-based Time Representation for Video Large Language Models
by: Zeng, Yingsen, et al.
Published: (2025)
by: Zeng, Yingsen, et al.
Published: (2025)
Manga Generation via Layout-controllable Diffusion
by: Chen, Siyu, et al.
Published: (2024)
by: Chen, Siyu, et al.
Published: (2024)
LinVT: Empower Your Image-level Large Language Model to Understand Videos
by: Gao, Lishuai, et al.
Published: (2024)
by: Gao, Lishuai, et al.
Published: (2024)
Matten: Video Generation with Mamba-Attention
by: Gao, Yu, et al.
Published: (2024)
by: Gao, Yu, et al.
Published: (2024)
RoboTron-Drive: All-in-One Large Multimodal Model for Autonomous Driving
by: Huang, Zhijian, et al.
Published: (2024)
by: Huang, Zhijian, et al.
Published: (2024)
RoboTron-Nav: A Unified Framework for Embodied Navigation Integrating Perception, Planning, and Prediction
by: Zhong, Yufeng, et al.
Published: (2025)
by: Zhong, Yufeng, et al.
Published: (2025)
Boosting Robotic Manipulation Generalization with Minimal Costly Data
by: Zheng, Liming, et al.
Published: (2025)
by: Zheng, Liming, et al.
Published: (2025)
InstructVEdit: A Holistic Approach for Instructional Video Editing
by: Zhang, Chi, et al.
Published: (2025)
by: Zhang, Chi, et al.
Published: (2025)
Advancing Visual Large Language Model for Multi-granular Versatile Perception
by: Xiang, Wentao, et al.
Published: (2025)
by: Xiang, Wentao, et al.
Published: (2025)
RoboUniView: Visual-Language Model with Unified View Representation for Robotic Manipulation
by: Liu, Fanfan, et al.
Published: (2024)
by: Liu, Fanfan, et al.
Published: (2024)
RoboTron-Mani: All-in-One Multimodal Large Model for Robotic Manipulation
by: Yan, Feng, et al.
Published: (2024)
by: Yan, Feng, et al.
Published: (2024)
Control-A-Video: Controllable Text-to-Video Diffusion Models with Motion Prior and Reward Feedback Learning
by: Chen, Weifeng, et al.
Published: (2023)
by: Chen, Weifeng, et al.
Published: (2023)
RoboCAS: A Benchmark for Robotic Manipulation in Complex Object Arrangement Scenarios
by: Zheng, Liming, et al.
Published: (2024)
by: Zheng, Liming, et al.
Published: (2024)
High-quality Image Dehazing with Diffusion Model
by: Yu, Hu, et al.
Published: (2023)
by: Yu, Hu, et al.
Published: (2023)
Regulating Anatomy-Aware Rewards via Trajectory-Integral Feedback for Volumetric Computed Tomography Analysis
by: Lin, Tianwei, et al.
Published: (2026)
by: Lin, Tianwei, et al.
Published: (2026)
HiMix: Reducing Computational Complexity in Large Vision-Language Models
by: Zhang, Xuange, et al.
Published: (2025)
by: Zhang, Xuange, et al.
Published: (2025)
MRStyle: A Unified Framework for Color Style Transfer with Multi-Modality Reference
by: Huang, Jiancheng, et al.
Published: (2024)
by: Huang, Jiancheng, et al.
Published: (2024)
DiffusionReward: Enhancing Blind Face Restoration through Reward Feedback Learning
by: Wu, Bin, et al.
Published: (2025)
by: Wu, Bin, et al.
Published: (2025)
Reward-Directed Score-Based Diffusion Models via q-Learning
by: Gao, Xuefeng, et al.
Published: (2024)
by: Gao, Xuefeng, et al.
Published: (2024)
CamPilot: Improving Camera Control in Video Diffusion Model with Efficient Camera Reward Feedback
by: Ge, Wenhang, et al.
Published: (2026)
by: Ge, Wenhang, et al.
Published: (2026)
Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble
by: Zhang, Shun, et al.
Published: (2024)
by: Zhang, Shun, et al.
Published: (2024)
LeapAlign: Post-Training Flow Matching Models at Any Generation Step by Building Two-Step Trajectories
by: Liang, Zhanhao, et al.
Published: (2026)
by: Liang, Zhanhao, et al.
Published: (2026)
Playmate2: Training-Free Multi-Character Audio-Driven Animation via Diffusion Transformer with Reward Feedback
by: Ma, Xingpei, et al.
Published: (2025)
by: Ma, Xingpei, et al.
Published: (2025)
UniFL: Improve Latent Diffusion Model via Unified Feedback Learning
by: Zhang, Jiacheng, et al.
Published: (2024)
by: Zhang, Jiacheng, et al.
Published: (2024)
Monocular Gaussian SLAM with Language Extended Loop Closure
by: Lan, Tian, et al.
Published: (2024)
by: Lan, Tian, et al.
Published: (2024)
Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards
by: Shen, Wei, et al.
Published: (2024)
by: Shen, Wei, et al.
Published: (2024)
ISR: Invertible Symbolic Regression
by: Tohme, Tony, et al.
Published: (2024)
by: Tohme, Tony, et al.
Published: (2024)
Reward-free Alignment for Conflicting Objectives
by: Chen, Peter, et al.
Published: (2026)
by: Chen, Peter, et al.
Published: (2026)
ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning
by: Chen, Weifeng, et al.
Published: (2024)
by: Chen, Weifeng, et al.
Published: (2024)
MindBench: A Comprehensive Benchmark for Mind Map Structure Recognition and Analysis
by: Chen, Lei, et al.
Published: (2024)
by: Chen, Lei, et al.
Published: (2024)
Efficient Reasoning via Reward Model
by: Wang, Yuhao, et al.
Published: (2025)
by: Wang, Yuhao, et al.
Published: (2025)
Privacy-Preserving Reinforcement Learning from Human Feedback via Decoupled Reward Modeling
by: Cho, Young Hyun, et al.
Published: (2026)
by: Cho, Young Hyun, et al.
Published: (2026)
LiveR: Fine-Grained Elasticity via Live Reconfiguration for Model Training
by: Liu, Haoyuan, et al.
Published: (2026)
by: Liu, Haoyuan, et al.
Published: (2026)
Output Feedback to Improve the Delay Margin of Linear Delay Systems
by: Renhong Hu, et al.
Published: (2025)
by: Renhong Hu, et al.
Published: (2025)
DIffSteISR: Harnessing Diffusion Prior for Superior Real-world Stereo Image Super-Resolution
by: Zhou, Yuanbo, et al.
Published: (2024)
by: Zhou, Yuanbo, et al.
Published: (2024)
Similar Items
-
TASR: Timestep-Aware Diffusion Model for Image Super-Resolution
by: Lin, Qinwei, et al.
Published: (2024) -
RoboTron-Sim: Improving Real-World Driving via Simulated Hard-Case
by: Xiao, Baihui, et al.
Published: (2025) -
InstaGen: Enhancing Object Detection by Training on Synthetic Dataset
by: Feng, Chengjian, et al.
Published: (2024) -
UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection
by: Zeng, Yingsen, et al.
Published: (2024) -
AP-CAP: Advancing High-Quality Data Synthesis for Animal Pose Estimation via a Controllable Image Generation Pipeline
by: Wang, Lei, et al.
Published: (2025)