Saved in:
| Main Authors: | Guo, Zhengyi, Li, Jiatu, Tang, Wenpin, Yao, David D. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.03898 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
by: Zhao, Hanyang, et al.
Published: (2025)
by: Zhao, Hanyang, et al.
Published: (2025)
Understanding Sampler Stochasticity in Training Diffusion Models for RLHF
by: Sheng, Jiayuan, et al.
Published: (2025)
by: Sheng, Jiayuan, et al.
Published: (2025)
DiFFPO: Training Diffusion LLMs to Reason Fast and Furious via Reinforcement Learning
by: Zhao, Hanyang, et al.
Published: (2025)
by: Zhao, Hanyang, et al.
Published: (2025)
RPO: Fine-Tuning Visual Generative Models via Rich Vision-Language Preferences
by: Zhao, Hanyang, et al.
Published: (2025)
by: Zhao, Hanyang, et al.
Published: (2025)
Conditional Diffusion Guidance under Hard Constraint: A Stochastic Analysis Approach
by: Guo, Zhengyi, et al.
Published: (2026)
by: Guo, Zhengyi, et al.
Published: (2026)
Scores as Actions: a framework of fine-tuning diffusion models by continuous-time reinforcement learning
by: Zhao, Hanyang, et al.
Published: (2024)
by: Zhao, Hanyang, et al.
Published: (2024)
MallowsPO: Fine-Tune Your LLM with Preference Dispersions
by: Chen, Haoxian, et al.
Published: (2024)
by: Chen, Haoxian, et al.
Published: (2024)
Improved techniques for fine-tuning flow models via adjoint matching: a deterministic control pipeline
by: Guo, Zhengyi, et al.
Published: (2026)
by: Guo, Zhengyi, et al.
Published: (2026)
ART for Diffusion Sampling: A Reinforcement Learning Approach to Timestep Schedule
by: Huang, Yilie, et al.
Published: (2026)
by: Huang, Yilie, et al.
Published: (2026)
OPD+: Rethinking the Advantage Design for On-Policy Distillation
by: Zhao, Hanyang, et al.
Published: (2026)
by: Zhao, Hanyang, et al.
Published: (2026)
CoSA: Compressed Sensing-Based Adaptation of Large Language Models
by: Wei, Songtao, et al.
Published: (2026)
by: Wei, Songtao, et al.
Published: (2026)
Evolution Meets Diffusion: Efficient Neural Architecture Generation
by: Zhou, Bingye, et al.
Published: (2025)
by: Zhou, Bingye, et al.
Published: (2025)
When Reasoning Meets Compression: Understanding the Effects of LLMs Compression on Large Reasoning Models
by: Zhang, Nan, et al.
Published: (2025)
by: Zhang, Nan, et al.
Published: (2025)
Diffusion Models Meet Contextual Bandits
by: Aouali, Imad
Published: (2024)
by: Aouali, Imad
Published: (2024)
Latent Generative Models with Tunable Complexity for Compressed Sensing and other Inverse Problems
by: Gunn, Sean, et al.
Published: (2026)
by: Gunn, Sean, et al.
Published: (2026)
Generative AI Meets Wireless Sensing: Towards Wireless Foundation Model
by: Yang, Zheng, et al.
Published: (2025)
by: Yang, Zheng, et al.
Published: (2025)
Fuzz-Testing Meets LLM-Based Agents: An Automated and Efficient Framework for Jailbreaking Text-To-Image Generation Models
by: Dong, Yingkai, et al.
Published: (2024)
by: Dong, Yingkai, et al.
Published: (2024)
Verified Neural Compressed Sensing
by: Bunel, Rudy, et al.
Published: (2024)
by: Bunel, Rudy, et al.
Published: (2024)
LLMs Meet Finance: Fine-Tuning Foundation Models for the Open FinLLM Leaderboard
by: Rao, Varun, et al.
Published: (2025)
by: Rao, Varun, et al.
Published: (2025)
FaultDiffusion: Few-Shot Fault Time Series Generation with Diffusion Model
by: Xu, Yi, et al.
Published: (2025)
by: Xu, Yi, et al.
Published: (2025)
Training-free Ultra Small Model for Universal Sparse Reconstruction in Compressed Sensing
by: Tang, Chaoqing, et al.
Published: (2025)
by: Tang, Chaoqing, et al.
Published: (2025)
When Continue Learning Meets Multimodal Large Language Model: A Survey
by: Huo, Yukang, et al.
Published: (2025)
by: Huo, Yukang, et al.
Published: (2025)
ChangeAnywhere: Sample Generation for Remote Sensing Change Detection via Semantic Latent Diffusion Model
by: Tang, Kai, et al.
Published: (2024)
by: Tang, Kai, et al.
Published: (2024)
Contractive Diffusion Probabilistic Models
by: Tang, Wenpin, et al.
Published: (2024)
by: Tang, Wenpin, et al.
Published: (2024)
Random Conditioning with Distillation for Data-Efficient Diffusion Model Compression
by: Kim, Dohyun, et al.
Published: (2025)
by: Kim, Dohyun, et al.
Published: (2025)
Multi-level Self-supervised Pretraining on Compositional Hierarchical Graph for Molecular Property Prediction
by: Liu, Xiayu, et al.
Published: (2026)
by: Liu, Xiayu, et al.
Published: (2026)
Bridging Structured Knowledge and Data: A Unified Framework with Finance Applications
by: Cao, Yi, et al.
Published: (2026)
by: Cao, Yi, et al.
Published: (2026)
Diffusion Meets Options: Hierarchical Generative Skill Composition for Temporally-Extended Tasks
by: Feng, Zeyu, et al.
Published: (2024)
by: Feng, Zeyu, et al.
Published: (2024)
GeneZip: Region-Aware Compression for Long Context DNA Modeling
by: Zhao, Jianan, et al.
Published: (2026)
by: Zhao, Jianan, et al.
Published: (2026)
Efficient Multi-Task Modeling through Automated Fusion of Trained Models
by: Zhou, Jingxuan, et al.
Published: (2025)
by: Zhou, Jingxuan, et al.
Published: (2025)
Sink-Aware Pruning for Diffusion Language Models
by: Myrzakhan, Aidar, et al.
Published: (2026)
by: Myrzakhan, Aidar, et al.
Published: (2026)
Information-Theoretic Optimization for Task-Adapted Compressed Sensing Magnetic Resonance Imaging
by: Peng, Xinyu, et al.
Published: (2026)
by: Peng, Xinyu, et al.
Published: (2026)
LLM Meets Diffusion: A Hybrid Framework for Crystal Material Generation
by: Khastagir, Subhojyoti, et al.
Published: (2025)
by: Khastagir, Subhojyoti, et al.
Published: (2025)
Self-Indexing KVCache: Predicting Sparse Attention from Compressed Keys
by: Yang, Xu, et al.
Published: (2026)
by: Yang, Xu, et al.
Published: (2026)
Sub-graph Based Diffusion Model for Link Prediction
by: Li, Hang, et al.
Published: (2024)
by: Li, Hang, et al.
Published: (2024)
A Score-Based Density Formula, with Applications in Diffusion Generative Models
by: Li, Gen, et al.
Published: (2024)
by: Li, Gen, et al.
Published: (2024)
PerturbDiff: Functional Diffusion for Single-Cell Perturbation Modeling
by: Yuan, Xinyu, et al.
Published: (2026)
by: Yuan, Xinyu, et al.
Published: (2026)
Diffusion-Driven Semantic Communication for Generative Models with Bandwidth Constraints
by: Guo, Lei, et al.
Published: (2024)
by: Guo, Lei, et al.
Published: (2024)
Alquist 5.0: Dialogue Trees Meet Generative Models. A Novel Approach for Enhancing SocialBot Conversations
by: Kobza, Ondřej, et al.
Published: (2023)
by: Kobza, Ondřej, et al.
Published: (2023)
DEER: Draft with Diffusion, Verify with Autoregressive Models
by: Cheng, Zicong, et al.
Published: (2025)
by: Cheng, Zicong, et al.
Published: (2025)
Similar Items
-
Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
by: Zhao, Hanyang, et al.
Published: (2025) -
Understanding Sampler Stochasticity in Training Diffusion Models for RLHF
by: Sheng, Jiayuan, et al.
Published: (2025) -
DiFFPO: Training Diffusion LLMs to Reason Fast and Furious via Reinforcement Learning
by: Zhao, Hanyang, et al.
Published: (2025) -
RPO: Fine-Tuning Visual Generative Models via Rich Vision-Language Preferences
by: Zhao, Hanyang, et al.
Published: (2025) -
Conditional Diffusion Guidance under Hard Constraint: A Stochastic Analysis Approach
by: Guo, Zhengyi, et al.
Published: (2026)