Saved in:
| Main Authors: | Miao, Yanting, Loh, William, Poupart, Pacal, Kothawade, Suraj |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.12036 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Subject-driven Text-to-Image Generation via Preference-based Reinforcement Learning
by: Miao, Yanting, et al.
Published: (2024)
by: Miao, Yanting, et al.
Published: (2024)
Image-POSER: Reflective RL for Multi-Expert Image Generation and Editing
by: Mohebbi, Hossein, et al.
Published: (2025)
by: Mohebbi, Hossein, et al.
Published: (2025)
Source-Free Domain Adaptation with Diffusion-Guided Source Data Generation
by: Chopra, Shivang, et al.
Published: (2024)
by: Chopra, Shivang, et al.
Published: (2024)
A Simple and Effective Reinforcement Learning Method for Text-to-Image Diffusion Fine-tuning
by: Gupta, Shashank, et al.
Published: (2025)
by: Gupta, Shashank, et al.
Published: (2025)
TDHook: A Lightweight Framework for Interpretability
by: Poupart, Yoann
Published: (2025)
by: Poupart, Yoann
Published: (2025)
Confidence-aware Reward Optimization for Fine-tuning Text-to-Image Models
by: Kim, Kyuyoung, et al.
Published: (2024)
by: Kim, Kyuyoung, et al.
Published: (2024)
Transcending Domains through Text-to-Image Diffusion: A Source-Free Approach to Domain Adaptation
by: Chopra, Shivang, et al.
Published: (2023)
by: Chopra, Shivang, et al.
Published: (2023)
Match & Choose: Model Selection Framework for Fine-tuning Text-to-Image Diffusion Models
by: Lewandowski, Basile, et al.
Published: (2025)
by: Lewandowski, Basile, et al.
Published: (2025)
Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps
by: Yang, Ningyuan, et al.
Published: (2025)
by: Yang, Ningyuan, et al.
Published: (2025)
Membership Inference Attacks Against Fine-tuned Diffusion Language Models
by: Chen, Yuetian, et al.
Published: (2026)
by: Chen, Yuetian, et al.
Published: (2026)
Layered Diffusion Model for One-Shot High Resolution Text-to-Image Synthesis
by: Khwaja, Emaad, et al.
Published: (2024)
by: Khwaja, Emaad, et al.
Published: (2024)
Fine-tuning Pocket-Aware Diffusion Models via Denoising Policy Optimization
by: Xue, Yuan, et al.
Published: (2026)
by: Xue, Yuan, et al.
Published: (2026)
Faster Convergence for Transformer Fine-tuning with Line Search Methods
by: Kenneweg, Philip, et al.
Published: (2024)
by: Kenneweg, Philip, et al.
Published: (2024)
Diffusion Adaptive Text Embedding for Text-to-Image Diffusion Models
by: Na, Byeonghu, et al.
Published: (2025)
by: Na, Byeonghu, et al.
Published: (2025)
Fill the GAP: A Granular Alignment Paradigm for Visual Reasoning in Multimodal Large Language Models
by: Miao, Yanting, et al.
Published: (2026)
by: Miao, Yanting, et al.
Published: (2026)
How Robust is Model Editing after Fine-Tuning? An Empirical Study on Text-to-Image Diffusion Models
by: He, Feng, et al.
Published: (2025)
by: He, Feng, et al.
Published: (2025)
Why Online Reinforcement Learning is Causal
by: Schulte, Oliver, et al.
Published: (2024)
by: Schulte, Oliver, et al.
Published: (2024)
Reflect-then-Plan: Offline Model-Based Planning through a Doubly Bayesian Lens
by: Jeong, Jihwan, et al.
Published: (2025)
by: Jeong, Jihwan, et al.
Published: (2025)
DLPO: Diffusion Model Loss-Guided Reinforcement Learning for Fine-Tuning Text-to-Speech Diffusion Models
by: Chen, Jingyi, et al.
Published: (2024)
by: Chen, Jingyi, et al.
Published: (2024)
Measures of Variability for Risk-averse Policy Gradient
by: Luo, Yudong, et al.
Published: (2025)
by: Luo, Yudong, et al.
Published: (2025)
Task-Agnostic Pre-training and Task-Guided Fine-tuning for Versatile Diffusion Planner
by: Fan, Chenyou, et al.
Published: (2024)
by: Fan, Chenyou, et al.
Published: (2024)
Fine-tuning Large Language Model for Automated Algorithm Design
by: Liu, Fei, et al.
Published: (2025)
by: Liu, Fei, et al.
Published: (2025)
Fine-tuning Flow Matching Generative Models with Intermediate Feedback
by: Fan, Jiajun, et al.
Published: (2025)
by: Fan, Jiajun, et al.
Published: (2025)
Training-Free Safe Text Embedding Guidance for Text-to-Image Diffusion Models
by: Na, Byeonghu, et al.
Published: (2025)
by: Na, Byeonghu, et al.
Published: (2025)
MINTS: Minimalist Thompson Sampling
by: Wang, Kaizheng
Published: (2026)
by: Wang, Kaizheng
Published: (2026)
Exploring Memorization in Fine-tuned Language Models
by: Zeng, Shenglai, et al.
Published: (2023)
by: Zeng, Shenglai, et al.
Published: (2023)
KV Cache Quantization for Self-Forcing Video Generation: A 33-Method Empirical Study
by: Ranganath, Suraj, et al.
Published: (2026)
by: Ranganath, Suraj, et al.
Published: (2026)
TuneComp: Joint Fine-tuning and Compression for Large Foundation Models
by: Chen, Xiangyu, et al.
Published: (2025)
by: Chen, Xiangyu, et al.
Published: (2025)
Adaptive Divergence Regularized Policy Optimization for Fine-tuning Generative Models
by: Fan, Jiajun, et al.
Published: (2025)
by: Fan, Jiajun, et al.
Published: (2025)
Fine-tuning CLIP Text Encoders with Two-step Paraphrasing
by: Kim, Hyunjae, et al.
Published: (2024)
by: Kim, Hyunjae, et al.
Published: (2024)
Gradients: When Markets Meet Fine-tuning -- A Distributed Approach to Model Optimisation
by: Subia-Waud, Christopher
Published: (2025)
by: Subia-Waud, Christopher
Published: (2025)
PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts
by: Zhang, Ming, et al.
Published: (2025)
by: Zhang, Ming, et al.
Published: (2025)
Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model
by: Yang, Kai, et al.
Published: (2023)
by: Yang, Kai, et al.
Published: (2023)
A Minimalist Bayesian Framework for Stochastic Optimization
by: Wang, Kaizheng
Published: (2025)
by: Wang, Kaizheng
Published: (2025)
Gaussian Match-and-Copy: A Minimalist Benchmark for Studying Transformer Induction
by: Gonon, Antoine, et al.
Published: (2026)
by: Gonon, Antoine, et al.
Published: (2026)
A Generic Method for Fine-grained Category Discovery in Natural Language Texts
by: Tian, Chang, et al.
Published: (2024)
by: Tian, Chang, et al.
Published: (2024)
Skip2-LoRA: A Lightweight On-device DNN Fine-tuning Method for Low-cost Edge Devices
by: Matsutani, Hiroki, et al.
Published: (2024)
by: Matsutani, Hiroki, et al.
Published: (2024)
Non-Uniform Class-Wise Coreset Selection for Vision Model Fine-tuning
by: Zhang, Hanyu, et al.
Published: (2025)
by: Zhang, Hanyu, et al.
Published: (2025)
Evaluating Temporal Plasticity in Foundation Time Series Models for Incremental Fine-tuning
by: Liu, Jia, et al.
Published: (2025)
by: Liu, Jia, et al.
Published: (2025)
REFINE-DP: Diffusion Policy Fine-tuning for Humanoid Loco-manipulation via Reinforcement Learning
by: Gu, Zhaoyuan, et al.
Published: (2026)
by: Gu, Zhaoyuan, et al.
Published: (2026)
Similar Items
-
Subject-driven Text-to-Image Generation via Preference-based Reinforcement Learning
by: Miao, Yanting, et al.
Published: (2024) -
Image-POSER: Reflective RL for Multi-Expert Image Generation and Editing
by: Mohebbi, Hossein, et al.
Published: (2025) -
Source-Free Domain Adaptation with Diffusion-Guided Source Data Generation
by: Chopra, Shivang, et al.
Published: (2024) -
A Simple and Effective Reinforcement Learning Method for Text-to-Image Diffusion Fine-tuning
by: Gupta, Shashank, et al.
Published: (2025) -
TDHook: A Lightweight Framework for Interpretability
by: Poupart, Yoann
Published: (2025)