:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Zhang, Jenny, Heim, Steve, Jeon, Se Hwan, Kim, Sangbae
Natura:	Preprint
Pubblicazione:	2024
Soggetti:	Robotics Machine Learning
Accesso online:	https://arxiv.org/abs/2402.08662
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

FLD: Fourier Latent Dynamics for Structured Motion Representation and Learning
di: Li, Chenhao, et al.
Pubblicazione: (2024)

Learning Humanoid Arm Motion via Centroidal Momentum Regularized Multi-Agent Reinforcement Learning
di: Lee, Ho Jae, et al.
Pubblicazione: (2025)

Residual MPC: Blending Reinforcement Learning with GPU-Parallelized Model Predictive Control
di: Jeon, Se Hwan, et al.
Pubblicazione: (2025)

CusADi: A GPU Parallelization Framework for Symbolic Expressions and Optimal Control
di: Jeon, Se Hwan, et al.
Pubblicazione: (2024)

Learning Reactive Dexterous Grasping via Hierarchical Task-Space RL Planning and Joint-Space QP Control
di: Lee, Ho Jae, et al.
Pubblicazione: (2026)

Adaptive Querying for Reward Learning from Human Feedback
di: Anand, Yashwanthi, et al.
Pubblicazione: (2024)

Policy Learning from Large Vision-Language Model Feedback without Reward Modeling
di: Luu, Tung M., et al.
Pubblicazione: (2025)

REBEL: Reward Regularization-Based Approach for Robotic Reinforcement Learning from Human Feedback
di: Chakraborty, Souradip, et al.
Pubblicazione: (2023)

Causally Robust Reward Learning from Reason-Augmented Preference Feedback
di: Hwang, Minjune, et al.
Pubblicazione: (2026)

STRIDE: Automating Reward Design, Deep Reinforcement Learning Training and Feedback Optimization in Humanoid Robotics Locomotion
di: Wu, Zhenwei, et al.
Pubblicazione: (2025)

Safe Value Functions
di: Massiani, Pierre-François, et al.
Pubblicazione: (2021)

Gait in Eight: Efficient On-Robot Learning for Omnidirectional Quadruped Locomotion
di: Bohlinger, Nico, et al.
Pubblicazione: (2025)

Average-Reward Maximum Entropy Reinforcement Learning for Underactuated Double Pendulum Tasks
di: Choe, Jean Seong Bjorn, et al.
Pubblicazione: (2024)

Probabilistic Safety Guarantee for Stochastic Control Systems Using Average Reward MDPs
di: Omidi, Saber, et al.
Pubblicazione: (2025)

Gaitor: Learning a Unified Representation Across Gaits for Real-World Quadruped Locomotion
di: Mitchell, Alexander L., et al.
Pubblicazione: (2024)

Learn to Swim: Data-Driven LSTM Hydrodynamic Model for Quadruped Robot Gait Optimization
di: Han, Fei, et al.
Pubblicazione: (2025)

ProcVLM: Learning Procedure-Grounded Progress Rewards for Robotic Manipulation
di: Feng, Youhe, et al.
Pubblicazione: (2026)

CoordLight: Learning Decentralized Coordination for Network-Wide Traffic Signal Control
di: Zhang, Yifeng, et al.
Pubblicazione: (2026)

Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language Models
di: Luu, Tung Minh, et al.
Pubblicazione: (2025)

Curriculum Reinforcement Learning for Complex Reward Functions
di: Freitag, Kilian, et al.
Pubblicazione: (2024)

On-Robot Reinforcement Learning with Goal-Contrastive Rewards
di: Biza, Ondrej, et al.
Pubblicazione: (2024)

Revisiting Sparse Rewards for Goal-Reaching Reinforcement Learning
di: Vasan, Gautham, et al.
Pubblicazione: (2024)

Constraints as Rewards: Reinforcement Learning for Robots without Reward Functions
di: Ishihara, Yu, et al.
Pubblicazione: (2025)

Online Intrinsic Rewards for Decision Making Agents from Large Language Model Feedback
di: Zheng, Qinqing, et al.
Pubblicazione: (2024)

Evidence of an Emergent "Self" in Continual Robot Learning
di: Jhunjhunwala, Adidev, et al.
Pubblicazione: (2026)

Generalization in Deep Reinforcement Learning for Robotic Navigation by Reward Shaping
di: Miranda, Victor R. F., et al.
Pubblicazione: (2022)

Rewarding DINO: Predicting Dense Rewards with Vision Foundation Models
di: Krack, Pierre, et al.
Pubblicazione: (2026)

Robot Policy Learning with Temporal Optimal Transport Reward
di: Fu, Yuwei, et al.
Pubblicazione: (2024)

The Dark Side of Rich Rewards: Understanding and Mitigating Noise in VLM Rewards
di: Huang, Sukai, et al.
Pubblicazione: (2024)

ReLAM: Learning Anticipation Model for Rewarding Visual Robotic Manipulation
di: Tang, Nan, et al.
Pubblicazione: (2025)

SCDP: Learning Humanoid Locomotion from Partial Observations via Mixed-Observation Distillation
di: Carroll, Milo, et al.
Pubblicazione: (2026)

Maximizing Alignment with Minimal Feedback: Efficiently Learning Rewards for Visuomotor Robot Policy Alignment
di: Tian, Ran, et al.
Pubblicazione: (2024)

Diffusion-Reward Adversarial Imitation Learning
di: Lai, Chun-Mao, et al.
Pubblicazione: (2024)

Quantization-Aware Imitation-Learning for Resource-Efficient Robotic Control
di: Park, Seongmin, et al.
Pubblicazione: (2024)

Teaching Robots to Handle Nuclear Waste: A Teleoperation-Based Learning Approach<
di: Lee, Joong-Ku, et al.
Pubblicazione: (2025)

ORSO: Accelerating Reward Design via Online Reward Selection and Policy Optimization
di: Zhang, Chen Bo Calvin, et al.
Pubblicazione: (2024)

Improving Generalization Ability of Robotic Imitation Learning by Resolving Causal Confusion in Observations
di: Chen, Yifei, et al.
Pubblicazione: (2025)

ELEMENTAL: Interactive Learning from Demonstrations and Vision-Language Models for Reward Design in Robotics
di: Chen, Letian, et al.
Pubblicazione: (2024)

Occupancy Reward Shaping: Improving Credit Assignment for Offline Goal-Conditioned Reinforcement Learning
di: Venugopal, Aravind, et al.
Pubblicazione: (2026)

Decoupling Task and Behavior: A Two-Stage Reward Curriculum in Reinforcement Learning for Robotics
di: Freitag, Kilian, et al.
Pubblicazione: (2026)