Saved in:
| Main Authors: | Choe, Jean Seong Bjorn, Choi, Bumkyu, Kim, Jong-kook |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.08938 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Average-Reward Maximum Entropy Reinforcement Learning for Global Policy in Double Pendulum Tasks
by: Choe, Jean Seong Bjorn, et al.
Published: (2025)
by: Choe, Jean Seong Bjorn, et al.
Published: (2025)
Maximum Entropy On-Policy Actor-Critic via Entropy Advantage Estimation
by: Choe, Jean Seong Bjorn, et al.
Published: (2024)
by: Choe, Jean Seong Bjorn, et al.
Published: (2024)
Reward-Punishment Reinforcement Learning with Maximum Entropy
by: Wang, Jiexin, et al.
Published: (2024)
by: Wang, Jiexin, et al.
Published: (2024)
Reinforcement Learning for Robust Athletic Intelligence: Lessons from the 2nd 'AI Olympics with RealAIGym' Competition
by: Wiebe, Felix, et al.
Published: (2025)
by: Wiebe, Felix, et al.
Published: (2025)
Decoupling Task and Behavior: A Two-Stage Reward Curriculum in Reinforcement Learning for Robotics
by: Freitag, Kilian, et al.
Published: (2026)
by: Freitag, Kilian, et al.
Published: (2026)
The Bid Picture: Auction-Inspired Multi-player Generative Adversarial Networks Training
by: Shim, Joo Yong, et al.
Published: (2024)
by: Shim, Joo Yong, et al.
Published: (2024)
Curriculum Reinforcement Learning for Complex Reward Functions
by: Freitag, Kilian, et al.
Published: (2024)
by: Freitag, Kilian, et al.
Published: (2024)
On-Robot Reinforcement Learning with Goal-Contrastive Rewards
by: Biza, Ondrej, et al.
Published: (2024)
by: Biza, Ondrej, et al.
Published: (2024)
Robotic Skill Diversification via Active Mutation of Reward Functions in Reinforcement Learning During a Liquid Pouring Task
by: van Buuren, Jannick, et al.
Published: (2025)
by: van Buuren, Jannick, et al.
Published: (2025)
Low-cost Real-world Implementation of the Swing-up Pendulum for Deep Reinforcement Learning Experiments
by: Böhm, Peter, et al.
Published: (2025)
by: Böhm, Peter, et al.
Published: (2025)
Revisiting Sparse Rewards for Goal-Reaching Reinforcement Learning
by: Vasan, Gautham, et al.
Published: (2024)
by: Vasan, Gautham, et al.
Published: (2024)
Constraints as Rewards: Reinforcement Learning for Robots without Reward Functions
by: Ishihara, Yu, et al.
Published: (2025)
by: Ishihara, Yu, et al.
Published: (2025)
Real-Time Model Predictive Control for the Swing-Up Problem of an Underactuated Double Pendulum
by: Burchard, Blanka, et al.
Published: (2025)
by: Burchard, Blanka, et al.
Published: (2025)
Generalization in Deep Reinforcement Learning for Robotic Navigation by Reward Shaping
by: Miranda, Victor R. F., et al.
Published: (2022)
by: Miranda, Victor R. F., et al.
Published: (2022)
Trajectory Entropy Reinforcement Learning for Predictable and Robust Control
by: You, Bang, et al.
Published: (2025)
by: You, Bang, et al.
Published: (2025)
Multi-Task Reinforcement Learning for Quadrotors
by: Xing, Jiaxu, et al.
Published: (2024)
by: Xing, Jiaxu, et al.
Published: (2024)
FedEMA: Federated Exponential Moving Averaging with Negative Entropy Regularizer in Autonomous Driving
by: Kou, Wei-Bin, et al.
Published: (2025)
by: Kou, Wei-Bin, et al.
Published: (2025)
ProgVLA: Progress-Aware Robot Manipulation Skill Learning
by: Kim, Seungsu, et al.
Published: (2026)
by: Kim, Seungsu, et al.
Published: (2026)
Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning
by: Yunis, David, et al.
Published: (2023)
by: Yunis, David, et al.
Published: (2023)
Residual Reward Models for Preference-based Reinforcement Learning
by: Cao, Chenyang, et al.
Published: (2025)
by: Cao, Chenyang, et al.
Published: (2025)
DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning
by: Diaz-Bone, Leander, et al.
Published: (2025)
by: Diaz-Bone, Leander, et al.
Published: (2025)
DrS: Learning Reusable Dense Rewards for Multi-Stage Tasks
by: Mu, Tongzhou, et al.
Published: (2024)
by: Mu, Tongzhou, et al.
Published: (2024)
Occupancy Reward Shaping: Improving Credit Assignment for Offline Goal-Conditioned Reinforcement Learning
by: Venugopal, Aravind, et al.
Published: (2026)
by: Venugopal, Aravind, et al.
Published: (2026)
REBEL: Reward Regularization-Based Approach for Robotic Reinforcement Learning from Human Feedback
by: Chakraborty, Souradip, et al.
Published: (2023)
by: Chakraborty, Souradip, et al.
Published: (2023)
Text2Reward: Reward Shaping with Language Models for Reinforcement Learning
by: Xie, Tianbao, et al.
Published: (2023)
by: Xie, Tianbao, et al.
Published: (2023)
BiCQL-ML: A Bi-Level Conservative Q-Learning Framework for Maximum Likelihood Inverse Reinforcement Learning
by: Park, Junsung
Published: (2025)
by: Park, Junsung
Published: (2025)
Deep Reinforcement Learning for Haptic Shared Control in Unknown Tasks
by: Fernandez, Franklin Cardeñoso, et al.
Published: (2021)
by: Fernandez, Franklin Cardeñoso, et al.
Published: (2021)
Communication-Aware Reinforcement Learning for Cooperative Adaptive Cruise Control
by: Jiang, Sicong, et al.
Published: (2024)
by: Jiang, Sicong, et al.
Published: (2024)
MoRe-ERL: Learning Motion Residuals using Episodic Reinforcement Learning
by: Huang, Xi, et al.
Published: (2025)
by: Huang, Xi, et al.
Published: (2025)
Projected Task-Specific Layers for Multi-Task Reinforcement Learning
by: Roberts, Josselin Somerville, et al.
Published: (2023)
by: Roberts, Josselin Somerville, et al.
Published: (2023)
STRIDE: Automating Reward Design, Deep Reinforcement Learning Training and Feedback Optimization in Humanoid Robotics Locomotion
by: Wu, Zhenwei, et al.
Published: (2025)
by: Wu, Zhenwei, et al.
Published: (2025)
Learning Emergent Gaits with Decentralized Phase Oscillators: on the role of Observations, Rewards, and Feedback
by: Zhang, Jenny, et al.
Published: (2024)
by: Zhang, Jenny, et al.
Published: (2024)
A Review of Reward Functions for Reinforcement Learning in the context of Autonomous Driving
by: Abouelazm, Ahmed, et al.
Published: (2024)
by: Abouelazm, Ahmed, et al.
Published: (2024)
Beware Untrusted Simulators -- Reward-Free Backdoor Attacks in Reinforcement Learning
by: Rathbun, Ethan, et al.
Published: (2026)
by: Rathbun, Ethan, et al.
Published: (2026)
Offline Reinforcement Learning using Human-Aligned Reward Labeling for Autonomous Emergency Braking in Occluded Pedestrian Crossing
by: Asodia, Vinal, et al.
Published: (2025)
by: Asodia, Vinal, et al.
Published: (2025)
Reinforcement Learning via Auxiliary Task Distillation
by: Harish, Abhinav Narayan, et al.
Published: (2024)
by: Harish, Abhinav Narayan, et al.
Published: (2024)
MENTOR: Mixture-of-Experts Network with Task-Oriented Perturbation for Visual Reinforcement Learning
by: Huang, Suning, et al.
Published: (2024)
by: Huang, Suning, et al.
Published: (2024)
Partially Equivariant Reinforcement Learning in Symmetry-Breaking Environments
by: Chang, Junwoo, et al.
Published: (2025)
by: Chang, Junwoo, et al.
Published: (2025)
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
by: Singh, Utsav, et al.
Published: (2024)
by: Singh, Utsav, et al.
Published: (2024)
Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward Augmented Imitation
by: Guo, Yihong, et al.
Published: (2024)
by: Guo, Yihong, et al.
Published: (2024)
Similar Items
-
Average-Reward Maximum Entropy Reinforcement Learning for Global Policy in Double Pendulum Tasks
by: Choe, Jean Seong Bjorn, et al.
Published: (2025) -
Maximum Entropy On-Policy Actor-Critic via Entropy Advantage Estimation
by: Choe, Jean Seong Bjorn, et al.
Published: (2024) -
Reward-Punishment Reinforcement Learning with Maximum Entropy
by: Wang, Jiexin, et al.
Published: (2024) -
Reinforcement Learning for Robust Athletic Intelligence: Lessons from the 2nd 'AI Olympics with RealAIGym' Competition
by: Wiebe, Felix, et al.
Published: (2025) -
Decoupling Task and Behavior: A Two-Stage Reward Curriculum in Reinforcement Learning for Robotics
by: Freitag, Kilian, et al.
Published: (2026)