:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Corrado, Nicholas E., Hanna, Josiah P.
Format:	Preprint
Published:	2023
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2310.17786
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

On-Policy Policy Gradient Reinforcement Learning Without On-Policy Sampling
by: Corrado, Nicholas E., et al.
Published: (2023)

Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning
by: Corrado, Nicholas E., et al.
Published: (2023)

Distributionally Robust Multi-Task Reinforcement Learning via Adaptive Task Sampling
by: Corrado, Nicholas E., et al.
Published: (2026)

Centralized Adaptive Sampling for Reliable Co-Training of Independent Multi-Agent Policies
by: Corrado, Nicholas E., et al.
Published: (2025)

When Can Model-Free Reinforcement Learning be Enough for Thinking?
by: Hanna, Josiah P., et al.
Published: (2025)

Learning to Stabilize Online Reinforcement Learning in Unbounded State Spaces
by: Pavse, Brahma S., et al.
Published: (2023)

SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
by: Mukherjee, Subhojyoti, et al.
Published: (2024)

Adaptive Exploration for Data-Efficient General Value Function Evaluations
by: Jain, Arushi, et al.
Published: (2024)

Reinforcement Learning via Auxiliary Task Distillation
by: Harish, Abhinav Narayan, et al.
Published: (2024)

Reinforcement Learning for Control of Non-Markovian Cellular Population Dynamics
by: Kratz, Josiah C., et al.
Published: (2024)

Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
by: Mukherjee, Subhojyoti, et al.
Published: (2024)

Stable Offline Value Function Learning with Bisimulation-based Representations
by: Pavse, Brahma S., et al.
Published: (2024)

Sparsely Multimodal Data Fusion
by: Bjorgaard, Josiah
Published: (2024)

SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits
by: Mukherjee, Subhojyoti, et al.
Published: (2023)

An Empirical Study on the Power of Future Prediction in Partially Observable Environments
by: Kwon, Jeongyeol, et al.
Published: (2024)

Adversarial Label Invariant Graph Data Augmentations for Out-of-Distribution Generalization
by: Zhang, Simon, et al.
Published: (2026)

Augmentation Invariant Manifold Learning
by: Wang, Shulei
Published: (2022)

Copy-Augmented Representation for Structure Invariant Template-Free Retrosynthesis
by: Zhuang, Jiaxi, et al.
Published: (2025)

Demystifying the Paradox of Importance Sampling with an Estimated History-Dependent Behavior Policy in Off-Policy Evaluation
by: Zhou, Hongyi, et al.
Published: (2025)

On the Benefit of Optimal Transport for Curriculum Reinforcement Learning
by: Klink, Pascal, et al.
Published: (2023)

MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
by: Voelcker, Claas A, et al.
Published: (2024)

Through the River: Understanding the Benefit of Schedule-Free Methods for Language Model Training
by: Song, Minhak, et al.
Published: (2025)

Model-Based Offline Reinforcement Learning with Adversarial Data Augmentation
by: Cao, Hongye, et al.
Published: (2025)

Reinforcement Learning for Graph Coloring: Understanding the Power and Limits of Non-Label Invariant Representations
by: Cummins, Chase, et al.
Published: (2024)

Reviving Stale Updates: Data-Free Knowledge Distillation for Asynchronous Federated Learning
by: Askin, Baris, et al.
Published: (2025)

Understanding the Generalization Benefits of Late Learning Rate Decay
by: Ren, Yinuo, et al.
Published: (2024)

Fundamental Benefit of Alternating Updates in Minimax Optimization
by: Lee, Jaewook, et al.
Published: (2024)

Less is More: Convergence Benefits of Fewer Data Weight Updates over Longer Horizon
by: Das, Rudrajit, et al.
Published: (2026)

Trajectory-Level Data Augmentation for Offline Reinforcement Learning
by: Schmähling, Tobias, et al.
Published: (2026)

T-JEPA: Augmentation-Free Self-Supervised Learning for Tabular Data
by: Thimonier, Hugo, et al.
Published: (2024)

Towards Understanding the Benefit of Multitask Representation Learning in Decision Process
by: Lu, Rui, et al.
Published: (2025)

An Analysis of Causal Effect Estimation using Outcome Invariant Data Augmentation
by: Akbar, Uzair, et al.
Published: (2025)

Intentional Updates for Streaming Reinforcement Learning
by: Sharifnassab, Arsalan, et al.
Published: (2026)

SiT: Symmetry-Invariant Transformers for Generalisation in Reinforcement Learning
by: Weissenbacher, Matthias, et al.
Published: (2024)

Optimized Local Updates in Federated Learning via Reinforcement Learning
by: Murad, Ali, et al.
Published: (2025)

On the Benefits of Active Data Collection in Operator Learning
by: Subedi, Unique, et al.
Published: (2024)

Model-Free Active Exploration in Reinforcement Learning
by: Russo, Alessio, et al.
Published: (2024)

The Quantization Benefits of Residual-Free Transformers
by: Ji, Yiping, et al.
Published: (2026)

Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer
by: Labiosa, Adam, et al.
Published: (2024)

Goal-Conditioned Data Augmentation for Offline Reinforcement Learning
by: Huang, Xingshuai, et al.
Published: (2024)