Saved in:
| Main Authors: | Singh, Saurav, Sanchez, Rodney, Ororbia, Alexander, Heard, Jamison |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.02530 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Intrinsic Vicarious Conditioning for Deep Reinforcement Learning
by: Sanchez, Rodney A, et al.
Published: (2026)
by: Sanchez, Rodney A, et al.
Published: (2026)
Avoiding Death through Fear Intrinsic Conditioning
by: Sanchez, Rodney, et al.
Published: (2025)
by: Sanchez, Rodney, et al.
Published: (2025)
Human Comfortability Index Estimation in Industrial Human-Robot Collaboration Task
by: Savur, Celal, et al.
Published: (2023)
by: Savur, Celal, et al.
Published: (2023)
Robust Speech-Workload Estimation for Intelligent Human-Robot Systems
by: Fortune, Julian, et al.
Published: (2025)
by: Fortune, Julian, et al.
Published: (2025)
Optimizing Neurorobot Policy under Limited Demonstration Data through Preference Regret
by: Nguyen, Viet Dung, et al.
Published: (2026)
by: Nguyen, Viet Dung, et al.
Published: (2026)
FlashSAC: Fast and Stable Off-Policy Reinforcement Learning for High-Dimensional Robot Control
by: Kim, Donghu, et al.
Published: (2026)
by: Kim, Donghu, et al.
Published: (2026)
Off-Policy Safe Reinforcement Learning with Constrained Optimistic Exploration
by: Li, Guopeng, et al.
Published: (2026)
by: Li, Guopeng, et al.
Published: (2026)
Off Policy Lyapunov Stability in Reinforcement Learning
by: Gill, Sarvan, et al.
Published: (2025)
by: Gill, Sarvan, et al.
Published: (2025)
TOP-ERL: Transformer-based Off-Policy Episodic Reinforcement Learning
by: Li, Ge, et al.
Published: (2024)
by: Li, Ge, et al.
Published: (2024)
REBEL: Reward Regularization-Based Approach for Robotic Reinforcement Learning from Human Feedback
by: Chakraborty, Souradip, et al.
Published: (2023)
by: Chakraborty, Souradip, et al.
Published: (2023)
Frugal Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement Learning Using Unique Experiences
by: Singh, Nikhil Kumar, et al.
Published: (2024)
by: Singh, Nikhil Kumar, et al.
Published: (2024)
Learning Efficient and Fair Policies for Uncertainty-Aware Collaborative Human-Robot Order Picking
by: Smit, Igor G., et al.
Published: (2024)
by: Smit, Igor G., et al.
Published: (2024)
Continual Learning through Human-Robot Interaction: Human Perceptions of a Continual Learning Robot in Repeated Interactions
by: Ayub, Ali, et al.
Published: (2023)
by: Ayub, Ali, et al.
Published: (2023)
STITCH-OPE: Trajectory Stitching with Guided Diffusion for Off-Policy Evaluation
by: Goli, Hossein, et al.
Published: (2025)
by: Goli, Hossein, et al.
Published: (2025)
RLDG: Robotic Generalist Policy Distillation via Reinforcement Learning
by: Xu, Charles, et al.
Published: (2024)
by: Xu, Charles, et al.
Published: (2024)
Residual Off-Policy RL for Finetuning Behavior Cloning Policies
by: Ankile, Lars, et al.
Published: (2025)
by: Ankile, Lars, et al.
Published: (2025)
Proximal Control of UAVs with Federated Learning for Human-Robot Collaborative Domains
by: Nobrega, Lucas Nogueira, et al.
Published: (2024)
by: Nobrega, Lucas Nogueira, et al.
Published: (2024)
Scalable Multi-Objective Robot Reinforcement Learning through Gradient Conflict Resolution
by: Munn, Humphrey, et al.
Published: (2025)
by: Munn, Humphrey, et al.
Published: (2025)
TADPO: Reinforcement Learning Goes Off-road
by: Wu, Zhouchonghao, et al.
Published: (2026)
by: Wu, Zhouchonghao, et al.
Published: (2026)
Robot Policy Transfer with Online Demonstrations: An Active Reinforcement Learning Approach
by: Hou, Muhan, et al.
Published: (2025)
by: Hou, Muhan, et al.
Published: (2025)
Robustness Evaluation of Offline Reinforcement Learning for Robot Control Against Action Perturbations
by: Ayabe, Shingo, et al.
Published: (2024)
by: Ayabe, Shingo, et al.
Published: (2024)
Contrast Sets for Evaluating Language-Guided Robot Policies
by: Anwar, Abrar, et al.
Published: (2024)
by: Anwar, Abrar, et al.
Published: (2024)
Deep Reinforcement Learning for Robotic Manipulation under Distribution Shift with Bounded Extremum Seeking
by: Saxena, Shaifalee, et al.
Published: (2026)
by: Saxena, Shaifalee, et al.
Published: (2026)
Bipedalism for Quadrupedal Robots: Versatile Loco-Manipulation through Risk-Adaptive Reinforcement Learning
by: Zhang, Yuyou, et al.
Published: (2025)
by: Zhang, Yuyou, et al.
Published: (2025)
Robot Fleet Learning via Policy Merging
by: Wang, Lirui, et al.
Published: (2023)
by: Wang, Lirui, et al.
Published: (2023)
PolicyFlow: Policy Optimization with Continuous Normalizing Flow in Reinforcement Learning
by: Yang, Shunpeng, et al.
Published: (2026)
by: Yang, Shunpeng, et al.
Published: (2026)
Tool-as-Interface: Learning Robot Policies from Observing Human Tool Use
by: Chen, Haonan, et al.
Published: (2025)
by: Chen, Haonan, et al.
Published: (2025)
Enhancing Sample Efficiency and Exploration in Reinforcement Learning through the Integration of Diffusion Models and Proximal Policy Optimization
by: Gao, Tianci, et al.
Published: (2024)
by: Gao, Tianci, et al.
Published: (2024)
Continual Policy Distillation of Reinforcement Learning-based Controllers for Soft Robotic In-Hand Manipulation
by: Li, Lanpei, et al.
Published: (2024)
by: Li, Lanpei, et al.
Published: (2024)
Simulation-Aided Policy Tuning for Black-Box Robot Learning
by: He, Shiming, et al.
Published: (2024)
by: He, Shiming, et al.
Published: (2024)
Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning
by: Wang, Ruhan, et al.
Published: (2024)
by: Wang, Ruhan, et al.
Published: (2024)
MOBODY: Model Based Off-Dynamics Offline Reinforcement Learning
by: Guo, Yihong, et al.
Published: (2025)
by: Guo, Yihong, et al.
Published: (2025)
Off-Policy Actor-Critic for Adversarial Observation Robustness: Virtual Alternative Training via Symmetric Policy Evaluation
by: Nakanishi, Kosuke, et al.
Published: (2025)
by: Nakanishi, Kosuke, et al.
Published: (2025)
Sparse Diffusion Policy: A Sparse, Reusable, and Flexible Policy for Robot Learning
by: Wang, Yixiao, et al.
Published: (2024)
by: Wang, Yixiao, et al.
Published: (2024)
Reinforcement Learning with Lie Group Orientations for Robotics
by: Schuck, Martin, et al.
Published: (2024)
by: Schuck, Martin, et al.
Published: (2024)
On-Robot Reinforcement Learning with Goal-Contrastive Rewards
by: Biza, Ondrej, et al.
Published: (2024)
by: Biza, Ondrej, et al.
Published: (2024)
RoboArena: Distributed Real-World Evaluation of Generalist Robot Policies
by: Atreya, Pranav, et al.
Published: (2025)
by: Atreya, Pranav, et al.
Published: (2025)
PolaRiS: Scalable Real-to-Sim Evaluations for Generalist Robot Policies
by: Jain, Arhan, et al.
Published: (2025)
by: Jain, Arhan, et al.
Published: (2025)
Deep Reinforcement Learning-Based User Scheduling for Collaborative Perception
by: Liu, Yandi, et al.
Published: (2025)
by: Liu, Yandi, et al.
Published: (2025)
Riemannian Flow Matching Policy for Robot Motion Learning
by: Braun, Max, et al.
Published: (2024)
by: Braun, Max, et al.
Published: (2024)
Similar Items
-
Intrinsic Vicarious Conditioning for Deep Reinforcement Learning
by: Sanchez, Rodney A, et al.
Published: (2026) -
Avoiding Death through Fear Intrinsic Conditioning
by: Sanchez, Rodney, et al.
Published: (2025) -
Human Comfortability Index Estimation in Industrial Human-Robot Collaboration Task
by: Savur, Celal, et al.
Published: (2023) -
Robust Speech-Workload Estimation for Intelligent Human-Robot Systems
by: Fortune, Julian, et al.
Published: (2025) -
Optimizing Neurorobot Policy under Limited Demonstration Data through Preference Regret
by: Nguyen, Viet Dung, et al.
Published: (2026)