:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yang, Qianlan, Wang, Yu-Xiong
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2406.04323
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Proposer-Agent-Evaluator(PAE): Autonomous Skill Discovery For Foundation Model Internet Agents
by: Zhou, Yifei, et al.
Published: (2024)

DiffusionNFT: Online Diffusion Reinforcement with Forward Process
by: Zheng, Kaiwen, et al.
Published: (2025)

IVR-R1: Refining Trajectories through Iterative Visual-Grounded Reasoning in Reinforcement Learning
by: Li, Chenghao, et al.
Published: (2026)

FedDiff: Diffusion Model Driven Federated Learning for Multi-Modal and Multi-Clients
by: Li, DaiXun, et al.
Published: (2023)

RefDiffNet: Learning to Expose Subtle PCB Defects Before Detection
by: Edula, Vinay, et al.
Published: (2026)

Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion
by: Kim, Dongjun, et al.
Published: (2023)

DiffEM: Learning from Corrupted Data with Diffusion Models via Expectation Maximization
by: Hosseintabar, Danial, et al.
Published: (2025)

DiffCLIP: Differential Attention Meets CLIP
by: Hammoud, Hasan Abed Al Kader, et al.
Published: (2025)

OpenWebRL: Demystifying Online Multi-turn Reinforcement Learning for Visual Web Agents
by: Yang, Rui, et al.
Published: (2026)

DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents
by: Xu, Yilun, et al.
Published: (2024)

DiffBlender: Composable and Versatile Multimodal Text-to-Image Diffusion Models
by: Kim, Sungnyun, et al.
Published: (2023)

SafeDreamer: Safe Reinforcement Learning with World Models
by: Huang, Weidong, et al.
Published: (2023)

STeP-Diff: Spatio-Temporal Physics-Informed Diffusion Models for Mobile Fine-Grained Pollution Forecasting
by: Zhou, Nan, et al.
Published: (2025)

TrajPRed: Trajectory Prediction with Region-based Relation Learning
by: Zhou, Chen, et al.
Published: (2024)

Learning Equi-angular Representations for Online Continual Learning
by: Seo, Minhyuk, et al.
Published: (2024)

Reinforcement Learning with Generalizable Gaussian Splatting
by: Wang, Jiaxu, et al.
Published: (2024)

Measure Twice, Click Once: Co-evolving Proposer and Visual Critic via Reinforcement Learning for GUI Grounding
by: Wang, Wenkai, et al.
Published: (2026)

GNNRL-Smoothing: A Prior-Free Reinforcement Learning Model for Mesh Smoothing
by: Wang, Zhichao, et al.
Published: (2024)

Diff-Instruct with Diffused Reward: Towards Principled One-step Generator RL
by: Wu, Junyi, et al.
Published: (2026)

Diffusion Reinforcement Learning via Centered Reward Distillation
by: Zhu, Yuanzhi, et al.
Published: (2026)

MVR: Multi-view Video Reward Shaping for Reinforcement Learning
by: Luo, Lirui, et al.
Published: (2026)

Rays as Pixels: Learning A Joint Distribution of Videos and Camera Trajectories
by: Jang, Wonbong, et al.
Published: (2026)

Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models
by: Zhou, Andy, et al.
Published: (2023)

Natural Gradient Descent for Online Continual Learning
by: Khawand, Joe, et al.
Published: (2026)

Is Pre-training Truly Better Than Meta-Learning?
by: Miranda, Brando, et al.
Published: (2023)

BridgeDrive: Diffusion Bridge Policy for Closed-Loop Trajectory Planning in Autonomous Driving
by: Liu, Shu, et al.
Published: (2025)

DiffFinger: Advancing Synthetic Fingerprint Generation through Denoising Diffusion Probabilistic Models
by: Grabovski, Freddie, et al.
Published: (2024)

Delta-Based Neural Architecture Search: LLM Fine-Tuning via Code Diffs
by: Adhikari, Santosh Premi, et al.
Published: (2026)

Towards Physics-informed Diffusion for Anomaly Detection in Trajectories
by: Sharma, Arun, et al.
Published: (2025)

Accelerating Heterogeneous Federated Learning with Closed-form Classifiers
by: Fanì, Eros, et al.
Published: (2024)

HERO: Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning
by: Hiranaka, Ayano, et al.
Published: (2024)

X-VORTEX: Spatio-Temporal Contrastive Learning for Wake Vortex Trajectory Forecasting
by: Qu, Zhan, et al.
Published: (2026)

Theoretical Guarantees for High Order Trajectory Refinement in Generative Flows
by: Gong, Chengyue, et al.
Published: (2025)

Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion
by: Mou, Linzhan, et al.
Published: (2024)

Emergent Visual Grounding in Large Multimodal Models Without Grounding Supervision
by: Cao, Shengcao, et al.
Published: (2024)

Diff-Instruct++: Training One-step Text-to-image Generator Model to Align with Human Preferences
by: Luo, Weijian
Published: (2024)

DiffRaman: A Conditional Latent Denoising Diffusion Probabilistic Model for Bacterial Raman Spectroscopy Identification Under Limited Data Conditions
by: Yao, Haiming, et al.
Published: (2024)

Enhancing Online Continual Learning with Plug-and-Play State Space Model and Class-Conditional Mixture of Discretization
by: Liu, Sihao, et al.
Published: (2024)

Training Diffusion Models with Reinforcement Learning
by: Black, Kevin, et al.
Published: (2023)

VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning
by: Yang, Senqiao, et al.
Published: (2025)