Saved in:
| Main Authors: | Yao, Changwei, Liu, Xinzi, Li, Chen, Savvides, Marios |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.16136 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Language-Model-Assisted Bi-Level Programming for Reward Learning from Internet Videos
by: Mahesheka, Harsh, et al.
Published: (2024)
by: Mahesheka, Harsh, et al.
Published: (2024)
MetaVLA: Unified Meta Co-training For Efficient Embodied Adaption
by: Li, Chen, et al.
Published: (2025)
by: Li, Chen, et al.
Published: (2025)
BiCQL-ML: A Bi-Level Conservative Q-Learning Framework for Maximum Likelihood Inverse Reinforcement Learning
by: Park, Junsung
Published: (2025)
by: Park, Junsung
Published: (2025)
Text2Reward: Reward Shaping with Language Models for Reinforcement Learning
by: Xie, Tianbao, et al.
Published: (2023)
by: Xie, Tianbao, et al.
Published: (2023)
Bi-CL: A Reinforcement Learning Framework for Robots Coordination Through Bi-level Optimization
by: Hu, Zechen, et al.
Published: (2024)
by: Hu, Zechen, et al.
Published: (2024)
ACoT-VLA: Action Chain-of-Thought for Vision-Language-Action Models
by: Zhong, Linqing, et al.
Published: (2026)
by: Zhong, Linqing, et al.
Published: (2026)
Bi-Level Reinforcement Learning Control for an Underactuated Blimp via Center-of-Mass Reconfiguration
by: Wang, Xiaorui, et al.
Published: (2026)
by: Wang, Xiaorui, et al.
Published: (2026)
Embodied Learning of Reward for Musculoskeletal Control with Vision Language Models
by: Soedarmadji, Saraswati, et al.
Published: (2025)
by: Soedarmadji, Saraswati, et al.
Published: (2025)
TeViR: Text-to-Video Reward with Diffusion Models for Efficient Reinforcement Learning
by: Chen, Yuhui, et al.
Published: (2025)
by: Chen, Yuhui, et al.
Published: (2025)
Learning Reward for Robot Skills Using Large Language Models via Self-Alignment
by: Zeng, Yuwei, et al.
Published: (2024)
by: Zeng, Yuwei, et al.
Published: (2024)
RLinf-VLA: A Unified and Efficient Framework for Reinforcement Learning of Vision-Language-Action Models
by: Zang, Hongzhi, et al.
Published: (2025)
by: Zang, Hongzhi, et al.
Published: (2025)
Chain of Uncertain Rewards with Large Language Models for Reinforcement Learning
by: Mo, Shentong
Published: (2026)
by: Mo, Shentong
Published: (2026)
Gen-Drive: Enhancing Diffusion Generative Driving Policies with Reward Modeling and Reinforcement Learning Fine-tuning
by: Huang, Zhiyu, et al.
Published: (2024)
by: Huang, Zhiyu, et al.
Published: (2024)
A Safe Reinforcement Learning driven Weights-varying Model Predictive Control for Autonomous Vehicle Motion Control
by: Zarrouki, Baha, et al.
Published: (2024)
by: Zarrouki, Baha, et al.
Published: (2024)
KineVLA: Towards Kinematics-Aware Vision-Language-Action Models with Bi-Level Action Decomposition
by: Han, Gaoge, et al.
Published: (2026)
by: Han, Gaoge, et al.
Published: (2026)
Actor-Critic for Continuous Action Chunks: A Reinforcement Learning Framework for Long-Horizon Robotic Manipulation with Sparse Reward
by: Yang, Jiarui, et al.
Published: (2025)
by: Yang, Jiarui, et al.
Published: (2025)
Reward Training Wheels: Adaptive Auxiliary Rewards for Robotics Reinforcement Learning
by: Wang, Linji, et al.
Published: (2025)
by: Wang, Linji, et al.
Published: (2025)
Design of Reward Function on Reinforcement Learning for Automated Driving
by: Goto, Takeru, et al.
Published: (2025)
by: Goto, Takeru, et al.
Published: (2025)
Residual Reward Models for Preference-based Reinforcement Learning
by: Cao, Chenyang, et al.
Published: (2025)
by: Cao, Chenyang, et al.
Published: (2025)
GCNT: Graph-Based Transformer Policies for Morphology-Agnostic Reinforcement Learning
by: Luo, Yingbo, et al.
Published: (2025)
by: Luo, Yingbo, et al.
Published: (2025)
Equivariant Reinforcement Learning Frameworks for Quadrotor Low-Level Control
by: Yu, Beomyeol, et al.
Published: (2025)
by: Yu, Beomyeol, et al.
Published: (2025)
Task-Oriented Grasping Using Reinforcement Learning with a Contextual Reward Machine
by: Li, Hui, et al.
Published: (2025)
by: Li, Hui, et al.
Published: (2025)
Efficient Language-instructed Skill Acquisition via Reward-Policy Co-Evolution
by: Huang, Changxin, et al.
Published: (2024)
by: Huang, Changxin, et al.
Published: (2024)
Constraints as Rewards: Reinforcement Learning for Robots without Reward Functions
by: Ishihara, Yu, et al.
Published: (2025)
by: Ishihara, Yu, et al.
Published: (2025)
Curriculum Reinforcement Learning for Complex Reward Functions
by: Freitag, Kilian, et al.
Published: (2024)
by: Freitag, Kilian, et al.
Published: (2024)
On-Robot Reinforcement Learning with Goal-Contrastive Rewards
by: Biza, Ondrej, et al.
Published: (2024)
by: Biza, Ondrej, et al.
Published: (2024)
FlowVLA: Visual Chain of Thought-based Motion Reasoning for Vision-Language-Action Models
by: Zhong, Zhide, et al.
Published: (2025)
by: Zhong, Zhide, et al.
Published: (2025)
V2V-GoT: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multimodal Large Language Models and Graph-of-Thoughts
by: Chiu, Hsu-kuang, et al.
Published: (2025)
by: Chiu, Hsu-kuang, et al.
Published: (2025)
RoboReward: General-Purpose Vision-Language Reward Models for Robotics
by: Lee, Tony, et al.
Published: (2026)
by: Lee, Tony, et al.
Published: (2026)
Eureka: Human-Level Reward Design via Coding Large Language Models
by: Ma, Yecheng Jason, et al.
Published: (2023)
by: Ma, Yecheng Jason, et al.
Published: (2023)
RoVer: Robot Reward Model as Test-Time Verifier for Vision-Language-Action Model
by: Dai, Mingtong, et al.
Published: (2025)
by: Dai, Mingtong, et al.
Published: (2025)
A Graph-Based Reinforcement Learning Approach with Frontier Potential Based Reward for Safe Cluttered Environment Exploration
by: Calzolari, Gabriele, et al.
Published: (2025)
by: Calzolari, Gabriele, et al.
Published: (2025)
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
by: Singh, Utsav, et al.
Published: (2024)
by: Singh, Utsav, et al.
Published: (2024)
Revisiting Sparse Rewards for Goal-Reaching Reinforcement Learning
by: Vasan, Gautham, et al.
Published: (2024)
by: Vasan, Gautham, et al.
Published: (2024)
RLRC: Reinforcement Learning-based Recovery for Compressed Vision-Language-Action Models
by: Chen, Yuxuan, et al.
Published: (2025)
by: Chen, Yuxuan, et al.
Published: (2025)
ELEMENTAL: Interactive Learning from Demonstrations and Vision-Language Models for Reward Design in Robotics
by: Chen, Letian, et al.
Published: (2024)
by: Chen, Letian, et al.
Published: (2024)
Reinforced Embodied Planning with Verifiable Reward for Real-World Robotic Manipulation
by: Bo, Zitong, et al.
Published: (2025)
by: Bo, Zitong, et al.
Published: (2025)
Reward-Punishment Reinforcement Learning with Maximum Entropy
by: Wang, Jiexin, et al.
Published: (2024)
by: Wang, Jiexin, et al.
Published: (2024)
STRIDE: Automating Reward Design, Deep Reinforcement Learning Training and Feedback Optimization in Humanoid Robotics Locomotion
by: Wu, Zhenwei, et al.
Published: (2025)
by: Wu, Zhenwei, et al.
Published: (2025)
LATS: Large Language Model Assisted Teacher-Student Framework for Multi-Agent Reinforcement Learning in Traffic Signal Control
by: Zhang, Yifeng, et al.
Published: (2026)
by: Zhang, Yifeng, et al.
Published: (2026)
Similar Items
-
Language-Model-Assisted Bi-Level Programming for Reward Learning from Internet Videos
by: Mahesheka, Harsh, et al.
Published: (2024) -
MetaVLA: Unified Meta Co-training For Efficient Embodied Adaption
by: Li, Chen, et al.
Published: (2025) -
BiCQL-ML: A Bi-Level Conservative Q-Learning Framework for Maximum Likelihood Inverse Reinforcement Learning
by: Park, Junsung
Published: (2025) -
Text2Reward: Reward Shaping with Language Models for Reinforcement Learning
by: Xie, Tianbao, et al.
Published: (2023) -
Bi-CL: A Reinforcement Learning Framework for Robots Coordination Through Bi-level Optimization
by: Hu, Zechen, et al.
Published: (2024)