Saved in:
| Main Authors: | Beaussant, Samuel, Mounsif, Mehdi |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.13892 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Large Reasoning Models are not thinking straight: on the unreliability of thinking trajectories
by: Cuesta-Ramirez, Jhouben, et al.
Published: (2025)
by: Cuesta-Ramirez, Jhouben, et al.
Published: (2025)
Continual Policy Distillation of Reinforcement Learning-based Controllers for Soft Robotic In-Hand Manipulation
by: Li, Lanpei, et al.
Published: (2024)
by: Li, Lanpei, et al.
Published: (2024)
Continuous Control Reinforcement Learning: Distributed Distributional DrQ Algorithms
by: Zhou, Zehao
Published: (2024)
by: Zhou, Zehao
Published: (2024)
Embodiment-Aware Generalist Specialist Distillation for Unified Humanoid Whole-Body Control
by: Peng, Quanquan, et al.
Published: (2026)
by: Peng, Quanquan, et al.
Published: (2026)
Growing Q-Networks: Solving Continuous Control Tasks with Adaptive Control Resolution
by: Seyde, Tim, et al.
Published: (2024)
by: Seyde, Tim, et al.
Published: (2024)
Meta-Controller: Few-Shot Imitation of Unseen Embodiments and Tasks in Continuous Control
by: Cho, Seongwoong, et al.
Published: (2024)
by: Cho, Seongwoong, et al.
Published: (2024)
RIZE: Adaptive Regularization for Imitation Learning
by: Karimi, Adib, et al.
Published: (2025)
by: Karimi, Adib, et al.
Published: (2025)
Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient
by: Wang, Wenlong, et al.
Published: (2024)
by: Wang, Wenlong, et al.
Published: (2024)
Confounding Robust Continuous Control via Automatic Reward Shaping
by: Juliani, Mateo, et al.
Published: (2026)
by: Juliani, Mateo, et al.
Published: (2026)
Learning Pareto Set for Multi-Objective Continuous Robot Control
by: Shu, Tianye, et al.
Published: (2024)
by: Shu, Tianye, et al.
Published: (2024)
A Review of Online Diffusion Policy RL Algorithms for Scalable Robotic Control
by: Choi, Wonhyeok, et al.
Published: (2026)
by: Choi, Wonhyeok, et al.
Published: (2026)
A New Perspective on Transformers in Online Reinforcement Learning for Continuous Control
by: Kachaev, Nikita, et al.
Published: (2025)
by: Kachaev, Nikita, et al.
Published: (2025)
Select before Act: Spatially Decoupled Action Repetition for Continuous Control
by: Nie, Buqing, et al.
Published: (2025)
by: Nie, Buqing, et al.
Published: (2025)
Actor-Free Continuous Control via Structurally Maximizable Q-Functions
by: Korkmaz, Yigit, et al.
Published: (2025)
by: Korkmaz, Yigit, et al.
Published: (2025)
EfficientTDMPC: Improved MPC Objectives for Sample-Efficient Continuous Control
by: Evers, Thomas, et al.
Published: (2026)
by: Evers, Thomas, et al.
Published: (2026)
Visualizing Critic Match Loss Landscapes for Interpretation of Online Reinforcement Learning Control Algorithms
by: Liu, Jingyi, et al.
Published: (2026)
by: Liu, Jingyi, et al.
Published: (2026)
RLtools: A Fast, Portable Deep Reinforcement Learning Library for Continuous Control
by: Eschmann, Jonas, et al.
Published: (2023)
by: Eschmann, Jonas, et al.
Published: (2023)
EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data
by: Wang, Shengjie, et al.
Published: (2024)
by: Wang, Shengjie, et al.
Published: (2024)
MemER: Scaling Up Memory for Robot Control via Experience Retrieval
by: Sridhar, Ajay, et al.
Published: (2025)
by: Sridhar, Ajay, et al.
Published: (2025)
Reconciling Spatial and Temporal Abstractions for Goal Representation
by: Zadem, Mehdi, et al.
Published: (2024)
by: Zadem, Mehdi, et al.
Published: (2024)
Reinforcement Learning via Auxiliary Task Distillation
by: Harish, Abhinav Narayan, et al.
Published: (2024)
by: Harish, Abhinav Narayan, et al.
Published: (2024)
Variational Distillation of Diffusion Policies into Mixture of Experts
by: Zhou, Hongyi, et al.
Published: (2024)
by: Zhou, Hongyi, et al.
Published: (2024)
Spiking Neural Networks for Continuous Control via End-to-End Model-Based Learning
by: Huebotter, Justus, et al.
Published: (2025)
by: Huebotter, Justus, et al.
Published: (2025)
RLDG: Robotic Generalist Policy Distillation via Reinforcement Learning
by: Xu, Charles, et al.
Published: (2024)
by: Xu, Charles, et al.
Published: (2024)
Distilling On-device Language Models for Robot Planning with Minimal Human Intervention
by: Ravichandran, Zachary, et al.
Published: (2025)
by: Ravichandran, Zachary, et al.
Published: (2025)
Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections
by: Zha, Lihan, et al.
Published: (2023)
by: Zha, Lihan, et al.
Published: (2023)
Simulation Distillation: Pretraining World Models in Simulation for Rapid Real-World Adaptation
by: Levy, Jacob, et al.
Published: (2026)
by: Levy, Jacob, et al.
Published: (2026)
Random Network Distillation Based Deep Reinforcement Learning for AGV Path Planning
by: Yin, Huilin, et al.
Published: (2024)
by: Yin, Huilin, et al.
Published: (2024)
VER: Vision Expert Transformer for Robot Learning via Foundation Distillation and Dynamic Routing
by: Wang, Yixiao, et al.
Published: (2025)
by: Wang, Yixiao, et al.
Published: (2025)
Rethinking the Intermediate Features in Adversarial Attacks: Misleading Robotic Models via Adversarial Distillation
by: Zhao, Ke, et al.
Published: (2024)
by: Zhao, Ke, et al.
Published: (2024)
Towards Scalable & Efficient Interaction-Aware Planning in Autonomous Vehicles using Knowledge Distillation
by: Gupta, Piyush, et al.
Published: (2024)
by: Gupta, Piyush, et al.
Published: (2024)
Distilling Reinforcement Learning Policies for Interpretable Robot Locomotion: Gradient Boosting Machines and Symbolic Regression
by: Acero, Fernando, et al.
Published: (2024)
by: Acero, Fernando, et al.
Published: (2024)
The Autonomy-Alignment Problem in Open-Ended Learning Robots: Formalising the Purpose Framework
by: Baldassarre, Gianluca, et al.
Published: (2024)
by: Baldassarre, Gianluca, et al.
Published: (2024)
Toward Accurate Long-Horizon Robotic Manipulation: Language-to-Action with Foundation Models via Scene Graphs
by: Dinesh, Sushil Samuel, et al.
Published: (2025)
by: Dinesh, Sushil Samuel, et al.
Published: (2025)
Simplicial Embeddings Improve Sample Efficiency in Actor-Critic Agents
by: Obando-Ceron, Johan, et al.
Published: (2025)
by: Obando-Ceron, Johan, et al.
Published: (2025)
Vanishing Bias Heuristic-guided Reinforcement Learning Algorithm
by: Li, Qinru, et al.
Published: (2023)
by: Li, Qinru, et al.
Published: (2023)
Continuous Monte Carlo Graph Search
by: Kujanpää, Kalle, et al.
Published: (2022)
by: Kujanpää, Kalle, et al.
Published: (2022)
Continuous Reasoning for Vision-Language-Action
by: Wu, Yueh-Hua, et al.
Published: (2026)
by: Wu, Yueh-Hua, et al.
Published: (2026)
Robot Learning with Super-Linear Scaling
by: Torne, Marcel, et al.
Published: (2024)
by: Torne, Marcel, et al.
Published: (2024)
Decision Transformer as a Foundation Model for Partially Observable Continuous Control
by: Zhang, Xiangyuan, et al.
Published: (2024)
by: Zhang, Xiangyuan, et al.
Published: (2024)
Similar Items
-
Large Reasoning Models are not thinking straight: on the unreliability of thinking trajectories
by: Cuesta-Ramirez, Jhouben, et al.
Published: (2025) -
Continual Policy Distillation of Reinforcement Learning-based Controllers for Soft Robotic In-Hand Manipulation
by: Li, Lanpei, et al.
Published: (2024) -
Continuous Control Reinforcement Learning: Distributed Distributional DrQ Algorithms
by: Zhou, Zehao
Published: (2024) -
Embodiment-Aware Generalist Specialist Distillation for Unified Humanoid Whole-Body Control
by: Peng, Quanquan, et al.
Published: (2026) -
Growing Q-Networks: Solving Continuous Control Tasks with Adaptive Control Resolution
by: Seyde, Tim, et al.
Published: (2024)