Saved in:
| Main Authors: | Liao, Luofeng, Fu, Zuyue, Yang, Zhuoran, Wang, Yixin, Kolar, Mladen, Wang, Zhaoran |
|---|---|
| Format: | Preprint |
| Published: |
2021
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2102.09907 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
by: Lu, Miao, et al.
Published: (2022)
by: Lu, Miao, et al.
Published: (2022)
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning
by: Bai, Chenjia, et al.
Published: (2024)
by: Bai, Chenjia, et al.
Published: (2024)
An Instrumental Value for Data Production and its Application to Data Pricing
by: Ai, Rui, et al.
Published: (2024)
by: Ai, Rui, et al.
Published: (2024)
Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning
by: Zhang, Dake, et al.
Published: (2024)
by: Zhang, Dake, et al.
Published: (2024)
Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency
by: Cai, Qi, et al.
Published: (2022)
by: Cai, Qi, et al.
Published: (2022)
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
by: Qiu, Shuang, et al.
Published: (2022)
by: Qiu, Shuang, et al.
Published: (2022)
Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning
by: Li, Zihao, et al.
Published: (2024)
by: Li, Zihao, et al.
Published: (2024)
Federated Offline Reinforcement Learning
by: Zhou, Doudou, et al.
Published: (2022)
by: Zhou, Doudou, et al.
Published: (2022)
SMART: A Spectral Transfer Approach to Multi-Task Learning
by: Zhao, Boxin, et al.
Published: (2026)
by: Zhao, Boxin, et al.
Published: (2026)
Trans-Glasso: A Transfer Learning Approach to Precision Matrix Estimation
by: Zhao, Boxin, et al.
Published: (2024)
by: Zhao, Boxin, et al.
Published: (2024)
Embed to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency
by: Wang, Lingxiao, et al.
Published: (2022)
by: Wang, Lingxiao, et al.
Published: (2022)
Gradient Clipping Beyond Vector Norms: A Spectral Approach for Matrix-Valued Parameters
by: Yukhimchuk, Alexander, et al.
Published: (2026)
by: Yukhimchuk, Alexander, et al.
Published: (2026)
A Reinforcement Learning Approach in Multi-Phase Second-Price Auction Design
by: Ai, Rui, et al.
Published: (2022)
by: Ai, Rui, et al.
Published: (2022)
High-dimensional Functional Graphical Model Structure Learning via Neighborhood Selection Approach
by: Zhao, Boxin, et al.
Published: (2021)
by: Zhao, Boxin, et al.
Published: (2021)
Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning
by: Di, Qiwei, et al.
Published: (2023)
by: Di, Qiwei, et al.
Published: (2023)
Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory
by: Zhang, Yufeng, et al.
Published: (2020)
by: Zhang, Yufeng, et al.
Published: (2020)
Learning Dynamic Mechanisms in Unknown Environments: A Reinforcement Learning Approach
by: Qiu, Shuang, et al.
Published: (2022)
by: Qiu, Shuang, et al.
Published: (2022)
Provably Efficient Exploration in Policy Optimization
by: Cai, Qi, et al.
Published: (2019)
by: Cai, Qi, et al.
Published: (2019)
Policy learning "without" overlap: Pessimism and generalized empirical Bernstein's inequality
by: Jin, Ying, et al.
Published: (2022)
by: Jin, Ying, et al.
Published: (2022)
Variational Transport: A Convergent Particle-BasedAlgorithm for Distributional Optimization
by: Yang, Zhuoran, et al.
Published: (2020)
by: Yang, Zhuoran, et al.
Published: (2020)
High-Dimensional Markov-switching Ordinary Differential Processes
by: Tsai, Katherine, et al.
Published: (2024)
by: Tsai, Katherine, et al.
Published: (2024)
Optimistic Policy Optimization is Provably Efficient in Non-stationary MDPs
by: Zhong, Han, et al.
Published: (2021)
by: Zhong, Han, et al.
Published: (2021)
Wasserstein Flow Meets Replicator Dynamics: A Mean-Field Analysis of Representation Learning in Actor-Critic
by: Zhang, Yufeng, et al.
Published: (2021)
by: Zhang, Yufeng, et al.
Published: (2021)
Contextual Dynamic Pricing with Strategic Buyers
by: Liu, Pangpang, et al.
Published: (2023)
by: Liu, Pangpang, et al.
Published: (2023)
Provable Accelerated Bayesian Optimization with Knowledge Transfer
by: Lin, Haitao, et al.
Published: (2025)
by: Lin, Haitao, et al.
Published: (2025)
Confounded Causal Imitation Learning with Instrumental Variables
by: Zeng, Yan, et al.
Published: (2025)
by: Zeng, Yan, et al.
Published: (2025)
Active Advantage-Aligned Online Reinforcement Learning with Offline Data
by: Liu, Xuefeng, et al.
Published: (2025)
by: Liu, Xuefeng, et al.
Published: (2025)
A Mean-Field Analysis of Neural Stochastic Gradient Descent-Ascent for Functional Minimax Optimization
by: Zhu, Yuchen, et al.
Published: (2024)
by: Zhu, Yuchen, et al.
Published: (2024)
Adaptive Client Sampling in Federated Learning via Online Learning with Bandit Feedback
by: Zhao, Boxin, et al.
Published: (2021)
by: Zhao, Boxin, et al.
Published: (2021)
Pessimistic Causal Reinforcement Learning with Mediators for Confounded Offline Data
by: Wang, Danyang, et al.
Published: (2024)
by: Wang, Danyang, et al.
Published: (2024)
High-Dimensional Differential Parameter Inference in Exponential Family using Time Score Matching
by: Williams, Daniel J., et al.
Published: (2024)
by: Williams, Daniel J., et al.
Published: (2024)
Dual-Robust Cross-Domain Offline Reinforcement Learning Against Dynamics Shifts
by: Qiao, Zhongjian, et al.
Published: (2025)
by: Qiao, Zhongjian, et al.
Published: (2025)
AdAdaGrad: Adaptive Batch Size Schemes for Adaptive Gradient Methods
by: Lau, Tim Tsz-Kit, et al.
Published: (2024)
by: Lau, Tim Tsz-Kit, et al.
Published: (2024)
VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning
by: Chen, Xuyang, et al.
Published: (2025)
by: Chen, Xuyang, et al.
Published: (2025)
Policy-Guided Causal State Representation for Offline Reinforcement Learning Recommendation
by: Wang, Siyu, et al.
Published: (2025)
by: Wang, Siyu, et al.
Published: (2025)
Personalized Binomial DAGs Learning with Network Structured Covariates
by: Zhao, Boxin, et al.
Published: (2024)
by: Zhao, Boxin, et al.
Published: (2024)
Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning
by: Wang, Qi, et al.
Published: (2023)
by: Wang, Qi, et al.
Published: (2023)
MACCA: Offline Multi-agent Reinforcement Learning with Causal Credit Assignment
by: Wang, Ziyan, et al.
Published: (2023)
by: Wang, Ziyan, et al.
Published: (2023)
Causal GNNs: A GNN-Driven Instrumental Variable Approach for Causal Inference in Networks
by: Du, Xiaojing, et al.
Published: (2024)
by: Du, Xiaojing, et al.
Published: (2024)
CausalCOMRL: Context-Based Offline Meta-Reinforcement Learning with Causal Representation
by: Zhang, Zhengzhe, et al.
Published: (2025)
by: Zhang, Zhengzhe, et al.
Published: (2025)
Similar Items
-
Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
by: Lu, Miao, et al.
Published: (2022) -
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning
by: Bai, Chenjia, et al.
Published: (2024) -
An Instrumental Value for Data Production and its Application to Data Pricing
by: Ai, Rui, et al.
Published: (2024) -
Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning
by: Zhang, Dake, et al.
Published: (2024) -
Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency
by: Cai, Qi, et al.
Published: (2022)