Saved in:
| Main Authors: | Chen, Deshu, Liu, Yuchen, Zhou, Zhijian, Qu, Chao, Qi, Yuan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.23087 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Project and Generate: Divergence-Free Neural Operators for Incompressible Flows
by: Li, Xigui, et al.
Published: (2026)
by: Li, Xigui, et al.
Published: (2026)
Equivariant Asynchronous Diffusion: An Adaptive Denoising Schedule for Accelerated Molecular Conformation Generation
by: An, Junyi, et al.
Published: (2026)
by: An, Junyi, et al.
Published: (2026)
Guiding Diffusion Models with Reinforcement Learning for Stable Molecule Generation
by: Zhou, Zhijian, et al.
Published: (2025)
by: Zhou, Zhijian, et al.
Published: (2025)
Distributional Off-Policy Evaluation with Deep Quantile Process Regression
by: Kuang, Qi, et al.
Published: (2026)
by: Kuang, Qi, et al.
Published: (2026)
Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models
by: Zhuo, Zhijian, et al.
Published: (2024)
by: Zhuo, Zhijian, et al.
Published: (2024)
A Kernel Distribution Closeness Testing
by: Zhou, Zhijian, et al.
Published: (2025)
by: Zhou, Zhijian, et al.
Published: (2025)
Enhancing Reasoning for Diffusion LLMs via Distribution Matching Policy Optimization
by: Zhu, Yuchen, et al.
Published: (2025)
by: Zhu, Yuchen, et al.
Published: (2025)
Distributional Soft Actor-Critic with Diffusion Policy
by: Liu, Tong, et al.
Published: (2025)
by: Liu, Tong, et al.
Published: (2025)
Nested Spatio-Temporal Time Series Forecasting
by: Ai, Yinghao, et al.
Published: (2026)
by: Ai, Yinghao, et al.
Published: (2026)
DyJR: Preserving Diversity in Reinforcement Learning with Verifiable Rewards via Dynamic Jensen-Shannon Replay
by: Li, Long, et al.
Published: (2026)
by: Li, Long, et al.
Published: (2026)
MARS: Unleashing the Power of Variance Reduction for Training Large Models
by: Yuan, Huizhuo, et al.
Published: (2024)
by: Yuan, Huizhuo, et al.
Published: (2024)
Inverse Flow and Consistency Models
by: Zhang, Yuchen, et al.
Published: (2025)
by: Zhang, Yuchen, et al.
Published: (2025)
Exploring Critical Testing Scenarios for Decision-Making Policies: An LLM Approach
by: Xu, Weichao, et al.
Published: (2024)
by: Xu, Weichao, et al.
Published: (2024)
PolicyFlow: Policy Optimization with Continuous Normalizing Flow in Reinforcement Learning
by: Yang, Shunpeng, et al.
Published: (2026)
by: Yang, Shunpeng, et al.
Published: (2026)
Optimal Convergence Analysis of DDPM for General Distributions
by: Jiao, Yuchen, et al.
Published: (2025)
by: Jiao, Yuchen, et al.
Published: (2025)
Deep Transfer Learning: Model Framework and Error Analysis
by: Jiao, Yuling, et al.
Published: (2024)
by: Jiao, Yuling, et al.
Published: (2024)
The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
by: Li, Long, et al.
Published: (2025)
by: Li, Long, et al.
Published: (2025)
Distributional Treatment Effect Estimation across Heterogeneous Sites via Optimal Transport
by: Bateni, Borna, et al.
Published: (2025)
by: Bateni, Borna, et al.
Published: (2025)
ReinFlow: Fine-tuning Flow Matching Policy with Online Reinforcement Learning
by: Zhang, Tonghe, et al.
Published: (2025)
by: Zhang, Tonghe, et al.
Published: (2025)
MMBind: Unleashing the Potential of Distributed and Heterogeneous Data for Multimodal Learning in IoT
by: Ouyang, Xiaomin, et al.
Published: (2024)
by: Ouyang, Xiaomin, et al.
Published: (2024)
Continual Policy Distillation from Distributed Reinforcement Learning Teachers
by: Li, Yuxuan, et al.
Published: (2026)
by: Li, Yuxuan, et al.
Published: (2026)
TFTF: Training-Free Targeted Flow for Conditional Sampling
by: Qu, Qianqian, et al.
Published: (2026)
by: Qu, Qianqian, et al.
Published: (2026)
Cross-Domain Policy Optimization via Bellman Consistency and Hybrid Critics
by: Chen, Ming-Hong, et al.
Published: (2026)
by: Chen, Ming-Hong, et al.
Published: (2026)
Anchor-based Maximum Discrepancy for Relative Similarity Testing
by: Zhou, Zhijian, et al.
Published: (2025)
by: Zhou, Zhijian, et al.
Published: (2025)
TrafficKAN-GCN: Graph Convolutional-based Kolmogorov-Arnold Network for Traffic Flow Optimization
by: Zhang, Jiayi, et al.
Published: (2025)
by: Zhang, Jiayi, et al.
Published: (2025)
M3Ret: Unleashing Zero-shot Multimodal Medical Image Retrieval via Self-Supervision
by: Liu, Che, et al.
Published: (2025)
by: Liu, Che, et al.
Published: (2025)
Lipschitz-Regularized Critics Lead to Policy Robustness Against Transition Dynamics Uncertainty
by: Chen, Xulin, et al.
Published: (2024)
by: Chen, Xulin, et al.
Published: (2024)
DUAL: Learning Diverse Kernels for Aggregated Two-sample and Independence Testing
by: Zhou, Zhijian, et al.
Published: (2025)
by: Zhou, Zhijian, et al.
Published: (2025)
DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing
by: Zhou, Zihan, et al.
Published: (2025)
by: Zhou, Zihan, et al.
Published: (2025)
Dynamic Gaussian Graph Operator: Learning parametric partial differential equations in arbitrary discrete mechanics problems
by: Wang, Chu, et al.
Published: (2024)
by: Wang, Chu, et al.
Published: (2024)
Flow-based Policy With Distributional Reinforcement Learning in Trajectory Optimization
by: Hao, Ruijie, et al.
Published: (2026)
by: Hao, Ruijie, et al.
Published: (2026)
Decision Flow Policy Optimization
by: Hu, Jifeng, et al.
Published: (2025)
by: Hu, Jifeng, et al.
Published: (2025)
Functional Critics Are Essential for Actor-Critic: From Off-Policy Stability to Efficient Exploration
by: Bai, Qinxun, et al.
Published: (2025)
by: Bai, Qinxun, et al.
Published: (2025)
Perturbations in the Orthogonal Complement Subspace for Efficient Out-of-Distribution Detection
by: Huang, Zhexiao, et al.
Published: (2025)
by: Huang, Zhexiao, et al.
Published: (2025)
Revisiting Policy Gradients for Restricted Policy Classes: Escaping Myopic Local Optima with $k$-step Policy Gradients
by: DeWeese, Alex, et al.
Published: (2026)
by: DeWeese, Alex, et al.
Published: (2026)
Distributional Reinforcement Learning with Diffusion Bridge Critics
by: Ding, Shutong, et al.
Published: (2026)
by: Ding, Shutong, et al.
Published: (2026)
Evolving Diffusion and Flow Matching Policies for Online Reinforcement Learning
by: Zhang, Chubin, et al.
Published: (2025)
by: Zhang, Chubin, et al.
Published: (2025)
AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based Policies
by: Hu, Xixi, et al.
Published: (2024)
by: Hu, Xixi, et al.
Published: (2024)
HyperINF: Unleashing the HyperPower of the Schulz's Method for Data Influence Estimation
by: Zhou, Xinyu, et al.
Published: (2024)
by: Zhou, Xinyu, et al.
Published: (2024)
Improving DAPO from a Mixed-Policy Perspective
by: Tan, Hongze, et al.
Published: (2025)
by: Tan, Hongze, et al.
Published: (2025)
Similar Items
-
Project and Generate: Divergence-Free Neural Operators for Incompressible Flows
by: Li, Xigui, et al.
Published: (2026) -
Equivariant Asynchronous Diffusion: An Adaptive Denoising Schedule for Accelerated Molecular Conformation Generation
by: An, Junyi, et al.
Published: (2026) -
Guiding Diffusion Models with Reinforcement Learning for Stable Molecule Generation
by: Zhou, Zhijian, et al.
Published: (2025) -
Distributional Off-Policy Evaluation with Deep Quantile Process Regression
by: Kuang, Qi, et al.
Published: (2026) -
Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models
by: Zhuo, Zhijian, et al.
Published: (2024)