Saved in:
| Main Authors: | Fang, Zijian, Liu, Zongkai, Yu, Chao, Hu, Chaohao |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.00533 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement Learning
by: Lin, Qian, et al.
Published: (2024)
by: Lin, Qian, et al.
Published: (2024)
Policy-regularized Offline Multi-objective Reinforcement Learning
by: Lin, Qian, et al.
Published: (2024)
by: Lin, Qian, et al.
Published: (2024)
Iterative Minimax Games with Coupled Linear Constraints
by: Zhang, Huiling, et al.
Published: (2022)
by: Zhang, Huiling, et al.
Published: (2022)
Learning Word Embedding with Better Distance Weighting and Window Size Scheduling
by: Yang, Chaohao, et al.
Published: (2024)
by: Yang, Chaohao, et al.
Published: (2024)
Momentum Contrastive Learning with Enhanced Negative Sampling and Hard Negative Filtering
by: Hoang, Duy, et al.
Published: (2025)
by: Hoang, Duy, et al.
Published: (2025)
CPGD: Toward Stable Rule-based Reinforcement Learning for Language Models
by: Liu, Zongkai, et al.
Published: (2025)
by: Liu, Zongkai, et al.
Published: (2025)
GAGPO: Generalized Advantage Grouped Policy Optimization
by: Zhu, Siyuan, et al.
Published: (2026)
by: Zhu, Siyuan, et al.
Published: (2026)
Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization
by: Liu, Zongkai, et al.
Published: (2024)
by: Liu, Zongkai, et al.
Published: (2024)
Fast Stochastic Policy Gradient: Negative Momentum for Reinforcement Learning
by: Zhang, Haobin, et al.
Published: (2024)
by: Zhang, Haobin, et al.
Published: (2024)
Minimax-Optimal Policy Regret in Partially Observable Markov Games
by: Arora, Raman
Published: (2026)
by: Arora, Raman
Published: (2026)
Guiding Diffusion Models with Reinforcement Learning for Stable Molecule Generation
by: Zhou, Zhijian, et al.
Published: (2025)
by: Zhou, Zhijian, et al.
Published: (2025)
Reinforcement Learning for Game-Theoretic Resource Allocation on Graphs
by: An, Zijian, et al.
Published: (2025)
by: An, Zijian, et al.
Published: (2025)
Negative Momentum for Convex-Concave Optimization
by: Shugart, Henry, et al.
Published: (2026)
by: Shugart, Henry, et al.
Published: (2026)
Minimax Statistical Estimation under Wasserstein Contamination
by: Chao, Patrick, et al.
Published: (2023)
by: Chao, Patrick, et al.
Published: (2023)
Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game
by: Xu, Zelai, et al.
Published: (2023)
by: Xu, Zelai, et al.
Published: (2023)
Deep SOR Minimax Q-learning for Two-player Zero-sum Game
by: Gautam, Saksham, et al.
Published: (2025)
by: Gautam, Saksham, et al.
Published: (2025)
Minimax Signal Detection in Sparse Additive Models
by: Kotekal, Subhodh, et al.
Published: (2023)
by: Kotekal, Subhodh, et al.
Published: (2023)
On Statistical Rates of Conditional Diffusion Transformers: Approximation, Estimation and Minimax Optimality
by: Hu, Jerry Yao-Chieh, et al.
Published: (2024)
by: Hu, Jerry Yao-Chieh, et al.
Published: (2024)
Minimax Rates for Hyperbolic Hierarchical Learning
by: Rawal, Divit, et al.
Published: (2026)
by: Rawal, Divit, et al.
Published: (2026)
Minimax Generalized Cross-Entropy
by: Bondugula, Kartheek, et al.
Published: (2026)
by: Bondugula, Kartheek, et al.
Published: (2026)
ParaFormer: A Generalized PageRank Graph Transformer for Graph Representation Learning
by: Yuan, Chaohao, et al.
Published: (2025)
by: Yuan, Chaohao, et al.
Published: (2025)
AdaPM: a Partial Momentum Algorithm for LLM Training
by: Zhang, Yimu, et al.
Published: (2025)
by: Zhang, Yimu, et al.
Published: (2025)
Dynamic Momentum Recalibration in Online Gradient Learning
by: Yao, Zhipeng, et al.
Published: (2026)
by: Yao, Zhipeng, et al.
Published: (2026)
Decouple before Integration: Test-time Synthesis of SFT and RLVR Task Vectors
by: Yuan, Chaohao, et al.
Published: (2026)
by: Yuan, Chaohao, et al.
Published: (2026)
ASD Classification on Dynamic Brain Connectome using Temporal Random Walk with Transformer-based Dynamic Network Embedding
by: Piriyasatit, Suchanuch, et al.
Published: (2025)
by: Piriyasatit, Suchanuch, et al.
Published: (2025)
Minimax Optimal Reinforcement Learning with Quasi-Optimism
by: Lee, Harin, et al.
Published: (2025)
by: Lee, Harin, et al.
Published: (2025)
Minimax Optimal Q Learning with Nearest Neighbors
by: Zhao, Puning, et al.
Published: (2023)
by: Zhao, Puning, et al.
Published: (2023)
Minimax Regret Learning for Data with Heterogeneous Subgroups
by: Mo, Weibin, et al.
Published: (2024)
by: Mo, Weibin, et al.
Published: (2024)
Automated discovery of symbolic laws governing skill acquisition from naturally occurring data
by: Liu, Sannyuya, et al.
Published: (2024)
by: Liu, Sannyuya, et al.
Published: (2024)
Momentum Approximation in Asynchronous Private Federated Learning
by: Yu, Tao, et al.
Published: (2024)
by: Yu, Tao, et al.
Published: (2024)
Learning What to Recommend: Minimax Optimal Simple Regret in Logistic Bandits
by: Liu, Shuai, et al.
Published: (2026)
by: Liu, Shuai, et al.
Published: (2026)
Corruption-Robust Linear Bandits: Minimax Optimality and Gap-Dependent Misspecification
by: Liu, Haolin, et al.
Published: (2024)
by: Liu, Haolin, et al.
Published: (2024)
A Multi-Step Minimax Q-learning Algorithm for Two-Player Zero-Sum Markov Games
by: R, Shreyas S, et al.
Published: (2024)
by: R, Shreyas S, et al.
Published: (2024)
Unifying Structural Proximity and Equivalence for Enhanced Dynamic Network Embedding
by: Piriyasatit, Suchanuch, et al.
Published: (2025)
by: Piriyasatit, Suchanuch, et al.
Published: (2025)
Minimax-Optimal Multi-Agent Robust Reinforcement Learning
by: Jiao, Yuchen, et al.
Published: (2024)
by: Jiao, Yuchen, et al.
Published: (2024)
Efficient Large-Scale Learning of Minimax Risk Classifiers
by: Bondugula, Kartheek, et al.
Published: (2025)
by: Bondugula, Kartheek, et al.
Published: (2025)
Momentum Further Constrains Sharpness at the Edge of Stochastic Stability
by: Andreyev, Arseniy, et al.
Published: (2026)
by: Andreyev, Arseniy, et al.
Published: (2026)
Towards Sharper Risk Bounds for Minimax Problems
by: Zhu, Bowei, et al.
Published: (2024)
by: Zhu, Bowei, et al.
Published: (2024)
Penalty-Based First-Order Methods for Bilevel Optimization with Minimax and Constrained Lower-Level Problems
by: Shen, Yiyang, et al.
Published: (2026)
by: Shen, Yiyang, et al.
Published: (2026)
Minimax Semiparametric Learning With Approximate Sparsity
by: Bradic, Jelena, et al.
Published: (2019)
by: Bradic, Jelena, et al.
Published: (2019)
Similar Items
-
An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement Learning
by: Lin, Qian, et al.
Published: (2024) -
Policy-regularized Offline Multi-objective Reinforcement Learning
by: Lin, Qian, et al.
Published: (2024) -
Iterative Minimax Games with Coupled Linear Constraints
by: Zhang, Huiling, et al.
Published: (2022) -
Learning Word Embedding with Better Distance Weighting and Window Size Scheduling
by: Yang, Chaohao, et al.
Published: (2024) -
Momentum Contrastive Learning with Enhanced Negative Sampling and Hard Negative Filtering
by: Hoang, Duy, et al.
Published: (2025)