Guardado en:
| Autores principales: | Gomaa, Amr, Mahdy, Bilal |
|---|---|
| Formato: | Preprint |
| Publicado: |
2024
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2410.21403 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Toward a Surgeon-in-the-Loop Ophthalmic Robotic Apprentice using Reinforcement and Imitation Learning
por: Gomaa, Amr, et al.
Publicado: (2023)
por: Gomaa, Amr, et al.
Publicado: (2023)
Reinforcement Learning via Implicit Imitation Guidance
por: Dong, Perry, et al.
Publicado: (2025)
por: Dong, Perry, et al.
Publicado: (2025)
Guidance is All You Need: Temperature-Guided Reasoning in Large Language Models
por: Gomaa, Eyad, et al.
Publicado: (2024)
por: Gomaa, Eyad, et al.
Publicado: (2024)
AdaptoML-UX: An Adaptive User-centered GUI-based AutoML Toolkit for Non-AI Experts and HCI Researchers
por: Gomaa, Amr, et al.
Publicado: (2024)
por: Gomaa, Amr, et al.
Publicado: (2024)
Using Non-Expert Data to Robustify Imitation Learning via Offline Reinforcement Learning
por: Huang, Kevin, et al.
Publicado: (2025)
por: Huang, Kevin, et al.
Publicado: (2025)
Comparing Traditional and Reinforcement-Learning Methods for Energy Storage Control
por: Ginzburg, Elinor, et al.
Publicado: (2025)
por: Ginzburg, Elinor, et al.
Publicado: (2025)
Imitation Bootstrapped Reinforcement Learning
por: Hu, Hengyuan, et al.
Publicado: (2023)
por: Hu, Hengyuan, et al.
Publicado: (2023)
Causal Imitation Learning under Expert-Observable and Expert-Unobservable Confounding
por: Shao, Daqian, et al.
Publicado: (2025)
por: Shao, Daqian, et al.
Publicado: (2025)
Dynamic Action Interpolation: A Universal Approach for Accelerating Reinforcement Learning with Expert Guidance
por: Cao, Wenjun
Publicado: (2025)
por: Cao, Wenjun
Publicado: (2025)
Improving Mixed-Criticality Scheduling with Reinforcement Learning
por: El-Mahdy, Muhammad, et al.
Publicado: (2025)
por: El-Mahdy, Muhammad, et al.
Publicado: (2025)
RILe: Reinforced Imitation Learning
por: Albaba, Mert, et al.
Publicado: (2024)
por: Albaba, Mert, et al.
Publicado: (2024)
Mixture of Autoencoder Experts Guidance using Unlabeled and Incomplete Data for Exploration in Reinforcement Learning
por: Malomgré, Elias, et al.
Publicado: (2025)
por: Malomgré, Elias, et al.
Publicado: (2025)
Looking for a better fit? An Incremental Learning Multimodal Object Referencing Framework adapting to Individual Drivers
por: Gomaa, Amr, et al.
Publicado: (2024)
por: Gomaa, Amr, et al.
Publicado: (2024)
IDIL: Imitation Learning of Intent-Driven Expert Behavior
por: Seo, Sangwon, et al.
Publicado: (2024)
por: Seo, Sangwon, et al.
Publicado: (2024)
Imitating Cost-Constrained Behaviors in Reinforcement Learning
por: Shao, Qian, et al.
Publicado: (2024)
por: Shao, Qian, et al.
Publicado: (2024)
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
por: Xu, Haoran, et al.
Publicado: (2025)
por: Xu, Haoran, et al.
Publicado: (2025)
Blending Imitation and Reinforcement Learning for Robust Policy Improvement
por: Liu, Xuefeng, et al.
Publicado: (2023)
por: Liu, Xuefeng, et al.
Publicado: (2023)
Beyond-Expert Performance with Limited Demonstrations: Efficient Imitation Learning with Double Exploration
por: Zhao, Heyang, et al.
Publicado: (2025)
por: Zhao, Heyang, et al.
Publicado: (2025)
Imitation Learning for Multi-turn LM Agents via On-policy Expert Corrections
por: Lauffer, Niklas, et al.
Publicado: (2025)
por: Lauffer, Niklas, et al.
Publicado: (2025)
Coupled Distributional Random Expert Distillation for World Model Online Imitation Learning
por: Li, Shangzhe, et al.
Publicado: (2025)
por: Li, Shangzhe, et al.
Publicado: (2025)
Learning What to Do and What Not To Do: Offline Imitation from Expert and Undesirable Demonstrations
por: Hoang, Huy, et al.
Publicado: (2025)
por: Hoang, Huy, et al.
Publicado: (2025)
Imitate the Good and Avoid the Bad: An Incremental Approach to Safe Reinforcement Learning
por: Hoang, Huy, et al.
Publicado: (2023)
por: Hoang, Huy, et al.
Publicado: (2023)
Physics-informed Imitative Reinforcement Learning for Real-world Driving
por: Zhou, Hang, et al.
Publicado: (2024)
por: Zhou, Hang, et al.
Publicado: (2024)
Dual RL: Unification and New Methods for Reinforcement and Imitation Learning
por: Sikchi, Harshit, et al.
Publicado: (2023)
por: Sikchi, Harshit, et al.
Publicado: (2023)
TDMPBC: Self-Imitative Reinforcement Learning for Humanoid Robot Control
por: Zhuang, Zifeng, et al.
Publicado: (2025)
por: Zhuang, Zifeng, et al.
Publicado: (2025)
Imitating Language via Scalable Inverse Reinforcement Learning
por: Wulfmeier, Markus, et al.
Publicado: (2024)
por: Wulfmeier, Markus, et al.
Publicado: (2024)
Inverse Reinforcement Learning with Sub-optimal Experts
por: Poiani, Riccardo, et al.
Publicado: (2024)
por: Poiani, Riccardo, et al.
Publicado: (2024)
Mixture-of-Experts Meets In-Context Reinforcement Learning
por: Wu, Wenhao, et al.
Publicado: (2025)
por: Wu, Wenhao, et al.
Publicado: (2025)
Sample-Efficient Expert Query Control in Active Imitation Learning via Conformal Prediction
por: Firouzkouhi, Arad, et al.
Publicado: (2025)
por: Firouzkouhi, Arad, et al.
Publicado: (2025)
Unveiling Imitation Learning: Exploring the Impact of Data Falsity to Large Language Model
por: Cho, Hyunsoo
Publicado: (2024)
por: Cho, Hyunsoo
Publicado: (2024)
GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning
por: Lee, Jaewoo, et al.
Publicado: (2024)
por: Lee, Jaewoo, et al.
Publicado: (2024)
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning
por: Mao, Liyuan, et al.
Publicado: (2024)
por: Mao, Liyuan, et al.
Publicado: (2024)
Decoupled Guidance Diffusion for Adaptive Offline Safe Reinforcement Learning
por: Chen, Rufeng, et al.
Publicado: (2026)
por: Chen, Rufeng, et al.
Publicado: (2026)
GHPO: Adaptive Guidance for Stable and Efficient LLM Reinforcement Learning
por: Liu, Ziru, et al.
Publicado: (2025)
por: Liu, Ziru, et al.
Publicado: (2025)
Bridging Classical and Quantum Machine Learning: Knowledge Transfer From Classical to Quantum Neural Networks Using Knowledge Distillation
por: Hasan, Mohammad Junayed, et al.
Publicado: (2023)
por: Hasan, Mohammad Junayed, et al.
Publicado: (2023)
DiffTORI: Differentiable Trajectory Optimization for Deep Reinforcement and Imitation Learning
por: Wan, Weikang, et al.
Publicado: (2024)
por: Wan, Weikang, et al.
Publicado: (2024)
Quantifying Generalisation in Imitation Learning
por: Gavenski, Nathan, et al.
Publicado: (2025)
por: Gavenski, Nathan, et al.
Publicado: (2025)
Learning Soft Driving Constraints from Vectorized Scene Embeddings while Imitating Expert Trajectories
por: Mobarakeh, Niloufar Saeidi, et al.
Publicado: (2024)
por: Mobarakeh, Niloufar Saeidi, et al.
Publicado: (2024)
Inference-Time Alignment Control for Diffusion Models with Reinforcement Learning Guidance
por: Jin, Luozhijie, et al.
Publicado: (2025)
por: Jin, Luozhijie, et al.
Publicado: (2025)
MIRA: Memory-Integrated Reinforcement Learning Agent with Limited LLM Guidance
por: Nourzad, Narjes, et al.
Publicado: (2026)
por: Nourzad, Narjes, et al.
Publicado: (2026)
Ejemplares similares
-
Toward a Surgeon-in-the-Loop Ophthalmic Robotic Apprentice using Reinforcement and Imitation Learning
por: Gomaa, Amr, et al.
Publicado: (2023) -
Reinforcement Learning via Implicit Imitation Guidance
por: Dong, Perry, et al.
Publicado: (2025) -
Guidance is All You Need: Temperature-Guided Reasoning in Large Language Models
por: Gomaa, Eyad, et al.
Publicado: (2024) -
AdaptoML-UX: An Adaptive User-centered GUI-based AutoML Toolkit for Non-AI Experts and HCI Researchers
por: Gomaa, Amr, et al.
Publicado: (2024) -
Using Non-Expert Data to Robustify Imitation Learning via Offline Reinforcement Learning
por: Huang, Kevin, et al.
Publicado: (2025)