:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Gomaa, Amr, Mahdy, Bilal
Formato:	Preprint
Publicado:	2024
Materias:	Machine Learning Artificial Intelligence
Acceso en línea:	https://arxiv.org/abs/2410.21403
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

Toward a Surgeon-in-the-Loop Ophthalmic Robotic Apprentice using Reinforcement and Imitation Learning
por: Gomaa, Amr, et al.
Publicado: (2023)

Reinforcement Learning via Implicit Imitation Guidance
por: Dong, Perry, et al.
Publicado: (2025)

Guidance is All You Need: Temperature-Guided Reasoning in Large Language Models
por: Gomaa, Eyad, et al.
Publicado: (2024)

AdaptoML-UX: An Adaptive User-centered GUI-based AutoML Toolkit for Non-AI Experts and HCI Researchers
por: Gomaa, Amr, et al.
Publicado: (2024)

Using Non-Expert Data to Robustify Imitation Learning via Offline Reinforcement Learning
por: Huang, Kevin, et al.
Publicado: (2025)

Comparing Traditional and Reinforcement-Learning Methods for Energy Storage Control
por: Ginzburg, Elinor, et al.
Publicado: (2025)

Imitation Bootstrapped Reinforcement Learning
por: Hu, Hengyuan, et al.
Publicado: (2023)

Causal Imitation Learning under Expert-Observable and Expert-Unobservable Confounding
por: Shao, Daqian, et al.
Publicado: (2025)

Dynamic Action Interpolation: A Universal Approach for Accelerating Reinforcement Learning with Expert Guidance
por: Cao, Wenjun
Publicado: (2025)

Improving Mixed-Criticality Scheduling with Reinforcement Learning
por: El-Mahdy, Muhammad, et al.
Publicado: (2025)

RILe: Reinforced Imitation Learning
por: Albaba, Mert, et al.
Publicado: (2024)

Mixture of Autoencoder Experts Guidance using Unlabeled and Incomplete Data for Exploration in Reinforcement Learning
por: Malomgré, Elias, et al.
Publicado: (2025)

Looking for a better fit? An Incremental Learning Multimodal Object Referencing Framework adapting to Individual Drivers
por: Gomaa, Amr, et al.
Publicado: (2024)

IDIL: Imitation Learning of Intent-Driven Expert Behavior
por: Seo, Sangwon, et al.
Publicado: (2024)

Imitating Cost-Constrained Behaviors in Reinforcement Learning
por: Shao, Qian, et al.
Publicado: (2024)

An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
por: Xu, Haoran, et al.
Publicado: (2025)

Blending Imitation and Reinforcement Learning for Robust Policy Improvement
por: Liu, Xuefeng, et al.
Publicado: (2023)

Beyond-Expert Performance with Limited Demonstrations: Efficient Imitation Learning with Double Exploration
por: Zhao, Heyang, et al.
Publicado: (2025)

Imitation Learning for Multi-turn LM Agents via On-policy Expert Corrections
por: Lauffer, Niklas, et al.
Publicado: (2025)

Coupled Distributional Random Expert Distillation for World Model Online Imitation Learning
por: Li, Shangzhe, et al.
Publicado: (2025)

Learning What to Do and What Not To Do: Offline Imitation from Expert and Undesirable Demonstrations
por: Hoang, Huy, et al.
Publicado: (2025)

Imitate the Good and Avoid the Bad: An Incremental Approach to Safe Reinforcement Learning
por: Hoang, Huy, et al.
Publicado: (2023)

Physics-informed Imitative Reinforcement Learning for Real-world Driving
por: Zhou, Hang, et al.
Publicado: (2024)

Dual RL: Unification and New Methods for Reinforcement and Imitation Learning
por: Sikchi, Harshit, et al.
Publicado: (2023)

TDMPBC: Self-Imitative Reinforcement Learning for Humanoid Robot Control
por: Zhuang, Zifeng, et al.
Publicado: (2025)

Imitating Language via Scalable Inverse Reinforcement Learning
por: Wulfmeier, Markus, et al.
Publicado: (2024)

Inverse Reinforcement Learning with Sub-optimal Experts
por: Poiani, Riccardo, et al.
Publicado: (2024)

Mixture-of-Experts Meets In-Context Reinforcement Learning
por: Wu, Wenhao, et al.
Publicado: (2025)

Sample-Efficient Expert Query Control in Active Imitation Learning via Conformal Prediction
por: Firouzkouhi, Arad, et al.
Publicado: (2025)

Unveiling Imitation Learning: Exploring the Impact of Data Falsity to Large Language Model
por: Cho, Hyunsoo
Publicado: (2024)

GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning
por: Lee, Jaewoo, et al.
Publicado: (2024)

Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning
por: Mao, Liyuan, et al.
Publicado: (2024)

Decoupled Guidance Diffusion for Adaptive Offline Safe Reinforcement Learning
por: Chen, Rufeng, et al.
Publicado: (2026)

GHPO: Adaptive Guidance for Stable and Efficient LLM Reinforcement Learning
por: Liu, Ziru, et al.
Publicado: (2025)

Bridging Classical and Quantum Machine Learning: Knowledge Transfer From Classical to Quantum Neural Networks Using Knowledge Distillation
por: Hasan, Mohammad Junayed, et al.
Publicado: (2023)

DiffTORI: Differentiable Trajectory Optimization for Deep Reinforcement and Imitation Learning
por: Wan, Weikang, et al.
Publicado: (2024)

Quantifying Generalisation in Imitation Learning
por: Gavenski, Nathan, et al.
Publicado: (2025)

Learning Soft Driving Constraints from Vectorized Scene Embeddings while Imitating Expert Trajectories
por: Mobarakeh, Niloufar Saeidi, et al.
Publicado: (2024)

Inference-Time Alignment Control for Diffusion Models with Reinforcement Learning Guidance
por: Jin, Luozhijie, et al.
Publicado: (2025)

MIRA: Memory-Integrated Reinforcement Learning Agent with Limited LLM Guidance
por: Nourzad, Narjes, et al.
Publicado: (2026)