:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Choe, Jean Seong Bjorn, Choi, Bumkyu, Kim, Jong-kook
Format:	Preprint
Published:	2024
Subjects:	Robotics Machine Learning
Online Access:	https://arxiv.org/abs/2409.08938
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Average-Reward Maximum Entropy Reinforcement Learning for Global Policy in Double Pendulum Tasks
by: Choe, Jean Seong Bjorn, et al.
Published: (2025)

Maximum Entropy On-Policy Actor-Critic via Entropy Advantage Estimation
by: Choe, Jean Seong Bjorn, et al.
Published: (2024)

Reward-Punishment Reinforcement Learning with Maximum Entropy
by: Wang, Jiexin, et al.
Published: (2024)

Reinforcement Learning for Robust Athletic Intelligence: Lessons from the 2nd 'AI Olympics with RealAIGym' Competition
by: Wiebe, Felix, et al.
Published: (2025)

Decoupling Task and Behavior: A Two-Stage Reward Curriculum in Reinforcement Learning for Robotics
by: Freitag, Kilian, et al.
Published: (2026)

The Bid Picture: Auction-Inspired Multi-player Generative Adversarial Networks Training
by: Shim, Joo Yong, et al.
Published: (2024)

Curriculum Reinforcement Learning for Complex Reward Functions
by: Freitag, Kilian, et al.
Published: (2024)

On-Robot Reinforcement Learning with Goal-Contrastive Rewards
by: Biza, Ondrej, et al.
Published: (2024)

Robotic Skill Diversification via Active Mutation of Reward Functions in Reinforcement Learning During a Liquid Pouring Task
by: van Buuren, Jannick, et al.
Published: (2025)

Low-cost Real-world Implementation of the Swing-up Pendulum for Deep Reinforcement Learning Experiments
by: Böhm, Peter, et al.
Published: (2025)

Revisiting Sparse Rewards for Goal-Reaching Reinforcement Learning
by: Vasan, Gautham, et al.
Published: (2024)

Constraints as Rewards: Reinforcement Learning for Robots without Reward Functions
by: Ishihara, Yu, et al.
Published: (2025)

Real-Time Model Predictive Control for the Swing-Up Problem of an Underactuated Double Pendulum
by: Burchard, Blanka, et al.
Published: (2025)

Generalization in Deep Reinforcement Learning for Robotic Navigation by Reward Shaping
by: Miranda, Victor R. F., et al.
Published: (2022)

Trajectory Entropy Reinforcement Learning for Predictable and Robust Control
by: You, Bang, et al.
Published: (2025)

Multi-Task Reinforcement Learning for Quadrotors
by: Xing, Jiaxu, et al.
Published: (2024)

FedEMA: Federated Exponential Moving Averaging with Negative Entropy Regularizer in Autonomous Driving
by: Kou, Wei-Bin, et al.
Published: (2025)

ProgVLA: Progress-Aware Robot Manipulation Skill Learning
by: Kim, Seungsu, et al.
Published: (2026)

Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning
by: Yunis, David, et al.
Published: (2023)

Residual Reward Models for Preference-based Reinforcement Learning
by: Cao, Chenyang, et al.
Published: (2025)

DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning
by: Diaz-Bone, Leander, et al.
Published: (2025)

DrS: Learning Reusable Dense Rewards for Multi-Stage Tasks
by: Mu, Tongzhou, et al.
Published: (2024)

Occupancy Reward Shaping: Improving Credit Assignment for Offline Goal-Conditioned Reinforcement Learning
by: Venugopal, Aravind, et al.
Published: (2026)

REBEL: Reward Regularization-Based Approach for Robotic Reinforcement Learning from Human Feedback
by: Chakraborty, Souradip, et al.
Published: (2023)

Text2Reward: Reward Shaping with Language Models for Reinforcement Learning
by: Xie, Tianbao, et al.
Published: (2023)

BiCQL-ML: A Bi-Level Conservative Q-Learning Framework for Maximum Likelihood Inverse Reinforcement Learning
by: Park, Junsung
Published: (2025)

Deep Reinforcement Learning for Haptic Shared Control in Unknown Tasks
by: Fernandez, Franklin Cardeñoso, et al.
Published: (2021)

Communication-Aware Reinforcement Learning for Cooperative Adaptive Cruise Control
by: Jiang, Sicong, et al.
Published: (2024)

MoRe-ERL: Learning Motion Residuals using Episodic Reinforcement Learning
by: Huang, Xi, et al.
Published: (2025)

Projected Task-Specific Layers for Multi-Task Reinforcement Learning
by: Roberts, Josselin Somerville, et al.
Published: (2023)

STRIDE: Automating Reward Design, Deep Reinforcement Learning Training and Feedback Optimization in Humanoid Robotics Locomotion
by: Wu, Zhenwei, et al.
Published: (2025)

Learning Emergent Gaits with Decentralized Phase Oscillators: on the role of Observations, Rewards, and Feedback
by: Zhang, Jenny, et al.
Published: (2024)

A Review of Reward Functions for Reinforcement Learning in the context of Autonomous Driving
by: Abouelazm, Ahmed, et al.
Published: (2024)

Beware Untrusted Simulators -- Reward-Free Backdoor Attacks in Reinforcement Learning
by: Rathbun, Ethan, et al.
Published: (2026)

Offline Reinforcement Learning using Human-Aligned Reward Labeling for Autonomous Emergency Braking in Occluded Pedestrian Crossing
by: Asodia, Vinal, et al.
Published: (2025)

Reinforcement Learning via Auxiliary Task Distillation
by: Harish, Abhinav Narayan, et al.
Published: (2024)

MENTOR: Mixture-of-Experts Network with Task-Oriented Perturbation for Visual Reinforcement Learning
by: Huang, Suning, et al.
Published: (2024)

Partially Equivariant Reinforcement Learning in Symmetry-Breaking Environments
by: Chang, Junwoo, et al.
Published: (2025)

LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
by: Singh, Utsav, et al.
Published: (2024)

Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward Augmented Imitation
by: Guo, Yihong, et al.
Published: (2024)