Saved in:
| Main Authors: | Yang, Qianlan, Wang, Yu-Xiong |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.04323 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Proposer-Agent-Evaluator(PAE): Autonomous Skill Discovery For Foundation Model Internet Agents
by: Zhou, Yifei, et al.
Published: (2024)
by: Zhou, Yifei, et al.
Published: (2024)
DiffusionNFT: Online Diffusion Reinforcement with Forward Process
by: Zheng, Kaiwen, et al.
Published: (2025)
by: Zheng, Kaiwen, et al.
Published: (2025)
IVR-R1: Refining Trajectories through Iterative Visual-Grounded Reasoning in Reinforcement Learning
by: Li, Chenghao, et al.
Published: (2026)
by: Li, Chenghao, et al.
Published: (2026)
FedDiff: Diffusion Model Driven Federated Learning for Multi-Modal and Multi-Clients
by: Li, DaiXun, et al.
Published: (2023)
by: Li, DaiXun, et al.
Published: (2023)
RefDiffNet: Learning to Expose Subtle PCB Defects Before Detection
by: Edula, Vinay, et al.
Published: (2026)
by: Edula, Vinay, et al.
Published: (2026)
Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion
by: Kim, Dongjun, et al.
Published: (2023)
by: Kim, Dongjun, et al.
Published: (2023)
DiffEM: Learning from Corrupted Data with Diffusion Models via Expectation Maximization
by: Hosseintabar, Danial, et al.
Published: (2025)
by: Hosseintabar, Danial, et al.
Published: (2025)
DiffCLIP: Differential Attention Meets CLIP
by: Hammoud, Hasan Abed Al Kader, et al.
Published: (2025)
by: Hammoud, Hasan Abed Al Kader, et al.
Published: (2025)
OpenWebRL: Demystifying Online Multi-turn Reinforcement Learning for Visual Web Agents
by: Yang, Rui, et al.
Published: (2026)
by: Yang, Rui, et al.
Published: (2026)
DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents
by: Xu, Yilun, et al.
Published: (2024)
by: Xu, Yilun, et al.
Published: (2024)
DiffBlender: Composable and Versatile Multimodal Text-to-Image Diffusion Models
by: Kim, Sungnyun, et al.
Published: (2023)
by: Kim, Sungnyun, et al.
Published: (2023)
SafeDreamer: Safe Reinforcement Learning with World Models
by: Huang, Weidong, et al.
Published: (2023)
by: Huang, Weidong, et al.
Published: (2023)
STeP-Diff: Spatio-Temporal Physics-Informed Diffusion Models for Mobile Fine-Grained Pollution Forecasting
by: Zhou, Nan, et al.
Published: (2025)
by: Zhou, Nan, et al.
Published: (2025)
TrajPRed: Trajectory Prediction with Region-based Relation Learning
by: Zhou, Chen, et al.
Published: (2024)
by: Zhou, Chen, et al.
Published: (2024)
Learning Equi-angular Representations for Online Continual Learning
by: Seo, Minhyuk, et al.
Published: (2024)
by: Seo, Minhyuk, et al.
Published: (2024)
Reinforcement Learning with Generalizable Gaussian Splatting
by: Wang, Jiaxu, et al.
Published: (2024)
by: Wang, Jiaxu, et al.
Published: (2024)
Measure Twice, Click Once: Co-evolving Proposer and Visual Critic via Reinforcement Learning for GUI Grounding
by: Wang, Wenkai, et al.
Published: (2026)
by: Wang, Wenkai, et al.
Published: (2026)
GNNRL-Smoothing: A Prior-Free Reinforcement Learning Model for Mesh Smoothing
by: Wang, Zhichao, et al.
Published: (2024)
by: Wang, Zhichao, et al.
Published: (2024)
Diff-Instruct with Diffused Reward: Towards Principled One-step Generator RL
by: Wu, Junyi, et al.
Published: (2026)
by: Wu, Junyi, et al.
Published: (2026)
Diffusion Reinforcement Learning via Centered Reward Distillation
by: Zhu, Yuanzhi, et al.
Published: (2026)
by: Zhu, Yuanzhi, et al.
Published: (2026)
MVR: Multi-view Video Reward Shaping for Reinforcement Learning
by: Luo, Lirui, et al.
Published: (2026)
by: Luo, Lirui, et al.
Published: (2026)
Rays as Pixels: Learning A Joint Distribution of Videos and Camera Trajectories
by: Jang, Wonbong, et al.
Published: (2026)
by: Jang, Wonbong, et al.
Published: (2026)
Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models
by: Zhou, Andy, et al.
Published: (2023)
by: Zhou, Andy, et al.
Published: (2023)
Natural Gradient Descent for Online Continual Learning
by: Khawand, Joe, et al.
Published: (2026)
by: Khawand, Joe, et al.
Published: (2026)
Is Pre-training Truly Better Than Meta-Learning?
by: Miranda, Brando, et al.
Published: (2023)
by: Miranda, Brando, et al.
Published: (2023)
BridgeDrive: Diffusion Bridge Policy for Closed-Loop Trajectory Planning in Autonomous Driving
by: Liu, Shu, et al.
Published: (2025)
by: Liu, Shu, et al.
Published: (2025)
DiffFinger: Advancing Synthetic Fingerprint Generation through Denoising Diffusion Probabilistic Models
by: Grabovski, Freddie, et al.
Published: (2024)
by: Grabovski, Freddie, et al.
Published: (2024)
Delta-Based Neural Architecture Search: LLM Fine-Tuning via Code Diffs
by: Adhikari, Santosh Premi, et al.
Published: (2026)
by: Adhikari, Santosh Premi, et al.
Published: (2026)
Towards Physics-informed Diffusion for Anomaly Detection in Trajectories
by: Sharma, Arun, et al.
Published: (2025)
by: Sharma, Arun, et al.
Published: (2025)
Accelerating Heterogeneous Federated Learning with Closed-form Classifiers
by: Fanì, Eros, et al.
Published: (2024)
by: Fanì, Eros, et al.
Published: (2024)
HERO: Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning
by: Hiranaka, Ayano, et al.
Published: (2024)
by: Hiranaka, Ayano, et al.
Published: (2024)
X-VORTEX: Spatio-Temporal Contrastive Learning for Wake Vortex Trajectory Forecasting
by: Qu, Zhan, et al.
Published: (2026)
by: Qu, Zhan, et al.
Published: (2026)
Theoretical Guarantees for High Order Trajectory Refinement in Generative Flows
by: Gong, Chengyue, et al.
Published: (2025)
by: Gong, Chengyue, et al.
Published: (2025)
Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion
by: Mou, Linzhan, et al.
Published: (2024)
by: Mou, Linzhan, et al.
Published: (2024)
Emergent Visual Grounding in Large Multimodal Models Without Grounding Supervision
by: Cao, Shengcao, et al.
Published: (2024)
by: Cao, Shengcao, et al.
Published: (2024)
Diff-Instruct++: Training One-step Text-to-image Generator Model to Align with Human Preferences
by: Luo, Weijian
Published: (2024)
by: Luo, Weijian
Published: (2024)
DiffRaman: A Conditional Latent Denoising Diffusion Probabilistic Model for Bacterial Raman Spectroscopy Identification Under Limited Data Conditions
by: Yao, Haiming, et al.
Published: (2024)
by: Yao, Haiming, et al.
Published: (2024)
Enhancing Online Continual Learning with Plug-and-Play State Space Model and Class-Conditional Mixture of Discretization
by: Liu, Sihao, et al.
Published: (2024)
by: Liu, Sihao, et al.
Published: (2024)
Training Diffusion Models with Reinforcement Learning
by: Black, Kevin, et al.
Published: (2023)
by: Black, Kevin, et al.
Published: (2023)
VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning
by: Yang, Senqiao, et al.
Published: (2025)
by: Yang, Senqiao, et al.
Published: (2025)
Similar Items
-
Proposer-Agent-Evaluator(PAE): Autonomous Skill Discovery For Foundation Model Internet Agents
by: Zhou, Yifei, et al.
Published: (2024) -
DiffusionNFT: Online Diffusion Reinforcement with Forward Process
by: Zheng, Kaiwen, et al.
Published: (2025) -
IVR-R1: Refining Trajectories through Iterative Visual-Grounded Reasoning in Reinforcement Learning
by: Li, Chenghao, et al.
Published: (2026) -
FedDiff: Diffusion Model Driven Federated Learning for Multi-Modal and Multi-Clients
by: Li, DaiXun, et al.
Published: (2023) -
RefDiffNet: Learning to Expose Subtle PCB Defects Before Detection
by: Edula, Vinay, et al.
Published: (2026)