Saved in:
| Main Authors: | Corrado, Nicholas E., Hanna, Josiah P. |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2310.17786 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
On-Policy Policy Gradient Reinforcement Learning Without On-Policy Sampling
by: Corrado, Nicholas E., et al.
Published: (2023)
by: Corrado, Nicholas E., et al.
Published: (2023)
Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning
by: Corrado, Nicholas E., et al.
Published: (2023)
by: Corrado, Nicholas E., et al.
Published: (2023)
Distributionally Robust Multi-Task Reinforcement Learning via Adaptive Task Sampling
by: Corrado, Nicholas E., et al.
Published: (2026)
by: Corrado, Nicholas E., et al.
Published: (2026)
Centralized Adaptive Sampling for Reliable Co-Training of Independent Multi-Agent Policies
by: Corrado, Nicholas E., et al.
Published: (2025)
by: Corrado, Nicholas E., et al.
Published: (2025)
When Can Model-Free Reinforcement Learning be Enough for Thinking?
by: Hanna, Josiah P., et al.
Published: (2025)
by: Hanna, Josiah P., et al.
Published: (2025)
Learning to Stabilize Online Reinforcement Learning in Unbounded State Spaces
by: Pavse, Brahma S., et al.
Published: (2023)
by: Pavse, Brahma S., et al.
Published: (2023)
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
by: Mukherjee, Subhojyoti, et al.
Published: (2024)
by: Mukherjee, Subhojyoti, et al.
Published: (2024)
Adaptive Exploration for Data-Efficient General Value Function Evaluations
by: Jain, Arushi, et al.
Published: (2024)
by: Jain, Arushi, et al.
Published: (2024)
Reinforcement Learning via Auxiliary Task Distillation
by: Harish, Abhinav Narayan, et al.
Published: (2024)
by: Harish, Abhinav Narayan, et al.
Published: (2024)
Reinforcement Learning for Control of Non-Markovian Cellular Population Dynamics
by: Kratz, Josiah C., et al.
Published: (2024)
by: Kratz, Josiah C., et al.
Published: (2024)
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
by: Mukherjee, Subhojyoti, et al.
Published: (2024)
by: Mukherjee, Subhojyoti, et al.
Published: (2024)
Stable Offline Value Function Learning with Bisimulation-based Representations
by: Pavse, Brahma S., et al.
Published: (2024)
by: Pavse, Brahma S., et al.
Published: (2024)
Sparsely Multimodal Data Fusion
by: Bjorgaard, Josiah
Published: (2024)
by: Bjorgaard, Josiah
Published: (2024)
SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits
by: Mukherjee, Subhojyoti, et al.
Published: (2023)
by: Mukherjee, Subhojyoti, et al.
Published: (2023)
An Empirical Study on the Power of Future Prediction in Partially Observable Environments
by: Kwon, Jeongyeol, et al.
Published: (2024)
by: Kwon, Jeongyeol, et al.
Published: (2024)
Adversarial Label Invariant Graph Data Augmentations for Out-of-Distribution Generalization
by: Zhang, Simon, et al.
Published: (2026)
by: Zhang, Simon, et al.
Published: (2026)
Augmentation Invariant Manifold Learning
by: Wang, Shulei
Published: (2022)
by: Wang, Shulei
Published: (2022)
Copy-Augmented Representation for Structure Invariant Template-Free Retrosynthesis
by: Zhuang, Jiaxi, et al.
Published: (2025)
by: Zhuang, Jiaxi, et al.
Published: (2025)
Demystifying the Paradox of Importance Sampling with an Estimated History-Dependent Behavior Policy in Off-Policy Evaluation
by: Zhou, Hongyi, et al.
Published: (2025)
by: Zhou, Hongyi, et al.
Published: (2025)
On the Benefit of Optimal Transport for Curriculum Reinforcement Learning
by: Klink, Pascal, et al.
Published: (2023)
by: Klink, Pascal, et al.
Published: (2023)
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
by: Voelcker, Claas A, et al.
Published: (2024)
by: Voelcker, Claas A, et al.
Published: (2024)
Through the River: Understanding the Benefit of Schedule-Free Methods for Language Model Training
by: Song, Minhak, et al.
Published: (2025)
by: Song, Minhak, et al.
Published: (2025)
Model-Based Offline Reinforcement Learning with Adversarial Data Augmentation
by: Cao, Hongye, et al.
Published: (2025)
by: Cao, Hongye, et al.
Published: (2025)
Reinforcement Learning for Graph Coloring: Understanding the Power and Limits of Non-Label Invariant Representations
by: Cummins, Chase, et al.
Published: (2024)
by: Cummins, Chase, et al.
Published: (2024)
Reviving Stale Updates: Data-Free Knowledge Distillation for Asynchronous Federated Learning
by: Askin, Baris, et al.
Published: (2025)
by: Askin, Baris, et al.
Published: (2025)
Understanding the Generalization Benefits of Late Learning Rate Decay
by: Ren, Yinuo, et al.
Published: (2024)
by: Ren, Yinuo, et al.
Published: (2024)
Fundamental Benefit of Alternating Updates in Minimax Optimization
by: Lee, Jaewook, et al.
Published: (2024)
by: Lee, Jaewook, et al.
Published: (2024)
Less is More: Convergence Benefits of Fewer Data Weight Updates over Longer Horizon
by: Das, Rudrajit, et al.
Published: (2026)
by: Das, Rudrajit, et al.
Published: (2026)
Trajectory-Level Data Augmentation for Offline Reinforcement Learning
by: Schmähling, Tobias, et al.
Published: (2026)
by: Schmähling, Tobias, et al.
Published: (2026)
T-JEPA: Augmentation-Free Self-Supervised Learning for Tabular Data
by: Thimonier, Hugo, et al.
Published: (2024)
by: Thimonier, Hugo, et al.
Published: (2024)
Towards Understanding the Benefit of Multitask Representation Learning in Decision Process
by: Lu, Rui, et al.
Published: (2025)
by: Lu, Rui, et al.
Published: (2025)
An Analysis of Causal Effect Estimation using Outcome Invariant Data Augmentation
by: Akbar, Uzair, et al.
Published: (2025)
by: Akbar, Uzair, et al.
Published: (2025)
Intentional Updates for Streaming Reinforcement Learning
by: Sharifnassab, Arsalan, et al.
Published: (2026)
by: Sharifnassab, Arsalan, et al.
Published: (2026)
SiT: Symmetry-Invariant Transformers for Generalisation in Reinforcement Learning
by: Weissenbacher, Matthias, et al.
Published: (2024)
by: Weissenbacher, Matthias, et al.
Published: (2024)
Optimized Local Updates in Federated Learning via Reinforcement Learning
by: Murad, Ali, et al.
Published: (2025)
by: Murad, Ali, et al.
Published: (2025)
On the Benefits of Active Data Collection in Operator Learning
by: Subedi, Unique, et al.
Published: (2024)
by: Subedi, Unique, et al.
Published: (2024)
Model-Free Active Exploration in Reinforcement Learning
by: Russo, Alessio, et al.
Published: (2024)
by: Russo, Alessio, et al.
Published: (2024)
The Quantization Benefits of Residual-Free Transformers
by: Ji, Yiping, et al.
Published: (2026)
by: Ji, Yiping, et al.
Published: (2026)
Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer
by: Labiosa, Adam, et al.
Published: (2024)
by: Labiosa, Adam, et al.
Published: (2024)
Goal-Conditioned Data Augmentation for Offline Reinforcement Learning
by: Huang, Xingshuai, et al.
Published: (2024)
by: Huang, Xingshuai, et al.
Published: (2024)
Similar Items
-
On-Policy Policy Gradient Reinforcement Learning Without On-Policy Sampling
by: Corrado, Nicholas E., et al.
Published: (2023) -
Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning
by: Corrado, Nicholas E., et al.
Published: (2023) -
Distributionally Robust Multi-Task Reinforcement Learning via Adaptive Task Sampling
by: Corrado, Nicholas E., et al.
Published: (2026) -
Centralized Adaptive Sampling for Reliable Co-Training of Independent Multi-Agent Policies
by: Corrado, Nicholas E., et al.
Published: (2025) -
When Can Model-Free Reinforcement Learning be Enough for Thinking?
by: Hanna, Josiah P., et al.
Published: (2025)