:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Chen, Deshu, Liu, Yuchen, Zhou, Zhijian, Qu, Chao, Qi, Yuan
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2509.23087
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Project and Generate: Divergence-Free Neural Operators for Incompressible Flows
by: Li, Xigui, et al.
Published: (2026)

Equivariant Asynchronous Diffusion: An Adaptive Denoising Schedule for Accelerated Molecular Conformation Generation
by: An, Junyi, et al.
Published: (2026)

Guiding Diffusion Models with Reinforcement Learning for Stable Molecule Generation
by: Zhou, Zhijian, et al.
Published: (2025)

Distributional Off-Policy Evaluation with Deep Quantile Process Regression
by: Kuang, Qi, et al.
Published: (2026)

Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models
by: Zhuo, Zhijian, et al.
Published: (2024)

A Kernel Distribution Closeness Testing
by: Zhou, Zhijian, et al.
Published: (2025)

Enhancing Reasoning for Diffusion LLMs via Distribution Matching Policy Optimization
by: Zhu, Yuchen, et al.
Published: (2025)

Distributional Soft Actor-Critic with Diffusion Policy
by: Liu, Tong, et al.
Published: (2025)

Nested Spatio-Temporal Time Series Forecasting
by: Ai, Yinghao, et al.
Published: (2026)

DyJR: Preserving Diversity in Reinforcement Learning with Verifiable Rewards via Dynamic Jensen-Shannon Replay
by: Li, Long, et al.
Published: (2026)

MARS: Unleashing the Power of Variance Reduction for Training Large Models
by: Yuan, Huizhuo, et al.
Published: (2024)

Inverse Flow and Consistency Models
by: Zhang, Yuchen, et al.
Published: (2025)

Exploring Critical Testing Scenarios for Decision-Making Policies: An LLM Approach
by: Xu, Weichao, et al.
Published: (2024)

PolicyFlow: Policy Optimization with Continuous Normalizing Flow in Reinforcement Learning
by: Yang, Shunpeng, et al.
Published: (2026)

Optimal Convergence Analysis of DDPM for General Distributions
by: Jiao, Yuchen, et al.
Published: (2025)

Deep Transfer Learning: Model Framework and Error Analysis
by: Jiao, Yuling, et al.
Published: (2024)

The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
by: Li, Long, et al.
Published: (2025)

Distributional Treatment Effect Estimation across Heterogeneous Sites via Optimal Transport
by: Bateni, Borna, et al.
Published: (2025)

ReinFlow: Fine-tuning Flow Matching Policy with Online Reinforcement Learning
by: Zhang, Tonghe, et al.
Published: (2025)

MMBind: Unleashing the Potential of Distributed and Heterogeneous Data for Multimodal Learning in IoT
by: Ouyang, Xiaomin, et al.
Published: (2024)

Continual Policy Distillation from Distributed Reinforcement Learning Teachers
by: Li, Yuxuan, et al.
Published: (2026)

TFTF: Training-Free Targeted Flow for Conditional Sampling
by: Qu, Qianqian, et al.
Published: (2026)

Cross-Domain Policy Optimization via Bellman Consistency and Hybrid Critics
by: Chen, Ming-Hong, et al.
Published: (2026)

Anchor-based Maximum Discrepancy for Relative Similarity Testing
by: Zhou, Zhijian, et al.
Published: (2025)

TrafficKAN-GCN: Graph Convolutional-based Kolmogorov-Arnold Network for Traffic Flow Optimization
by: Zhang, Jiayi, et al.
Published: (2025)

M3Ret: Unleashing Zero-shot Multimodal Medical Image Retrieval via Self-Supervision
by: Liu, Che, et al.
Published: (2025)

Lipschitz-Regularized Critics Lead to Policy Robustness Against Transition Dynamics Uncertainty
by: Chen, Xulin, et al.
Published: (2024)

DUAL: Learning Diverse Kernels for Aggregated Two-sample and Independence Testing
by: Zhou, Zhijian, et al.
Published: (2025)

DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing
by: Zhou, Zihan, et al.
Published: (2025)

Dynamic Gaussian Graph Operator: Learning parametric partial differential equations in arbitrary discrete mechanics problems
by: Wang, Chu, et al.
Published: (2024)

Flow-based Policy With Distributional Reinforcement Learning in Trajectory Optimization
by: Hao, Ruijie, et al.
Published: (2026)

Decision Flow Policy Optimization
by: Hu, Jifeng, et al.
Published: (2025)

Functional Critics Are Essential for Actor-Critic: From Off-Policy Stability to Efficient Exploration
by: Bai, Qinxun, et al.
Published: (2025)

Perturbations in the Orthogonal Complement Subspace for Efficient Out-of-Distribution Detection
by: Huang, Zhexiao, et al.
Published: (2025)

Revisiting Policy Gradients for Restricted Policy Classes: Escaping Myopic Local Optima with $k$-step Policy Gradients
by: DeWeese, Alex, et al.
Published: (2026)

Distributional Reinforcement Learning with Diffusion Bridge Critics
by: Ding, Shutong, et al.
Published: (2026)

Evolving Diffusion and Flow Matching Policies for Online Reinforcement Learning
by: Zhang, Chubin, et al.
Published: (2025)

AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based Policies
by: Hu, Xixi, et al.
Published: (2024)

HyperINF: Unleashing the HyperPower of the Schulz's Method for Data Influence Estimation
by: Zhou, Xinyu, et al.
Published: (2024)

Improving DAPO from a Mixed-Policy Perspective
by: Tan, Hongze, et al.
Published: (2025)