Saved in:
| Main Authors: | Feng, Yukai, Wu, Zhiheng, Wu, Zhengxing, Gu, Junwen, Yu, Junzhi |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.19404 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
USIM and U0: A Vision-Language-Action Dataset and Model for General Underwater Robots
by: Gu, Junwen, et al.
Published: (2025)
by: Gu, Junwen, et al.
Published: (2025)
Extending Group Relative Policy Optimization to Continuous Control: A Theoretical Framework for Robotic Reinforcement Learning
by: Khanda, Rajat, et al.
Published: (2025)
by: Khanda, Rajat, et al.
Published: (2025)
AT-Drone: Benchmarking Adaptive Teaming in Multi-Drone Pursuit
by: Li, Yang, et al.
Published: (2025)
by: Li, Yang, et al.
Published: (2025)
Distributed AI Agents for Cognitive Underwater Robot Autonomy
by: Buchholz, Markus, et al.
Published: (2025)
by: Buchholz, Markus, et al.
Published: (2025)
Hierarchical Consensus-Based Multi-Agent Reinforcement Learning for Multi-Robot Cooperation Tasks
by: Feng, Pu, et al.
Published: (2024)
by: Feng, Pu, et al.
Published: (2024)
Human-assisted Robotic Policy Refinement via Action Preference Optimization
by: Xia, Wenke, et al.
Published: (2025)
by: Xia, Wenke, et al.
Published: (2025)
HALO: Learning Human-Robot Collaboration via Heterogeneous-Agent Lyapunov Policy Optimization
by: Zhang, Hao, et al.
Published: (2026)
by: Zhang, Hao, et al.
Published: (2026)
Learning Multi-Pursuit Evasion for Safe Targeted Navigation of Drones
by: Xiao, Jiaping, et al.
Published: (2023)
by: Xiao, Jiaping, et al.
Published: (2023)
Scaf-GRPO: Scaffolded Group Relative Policy Optimization for Enhancing LLM Reasoning
by: Zhang, Xichen, et al.
Published: (2025)
by: Zhang, Xichen, et al.
Published: (2025)
HybridFlow: A Two-Step Generative Policy for Robotic Manipulation
by: Dong, Zhenchen, et al.
Published: (2026)
by: Dong, Zhenchen, et al.
Published: (2026)
TGRPO :Fine-tuning Vision-Language-Action Model via Trajectory-wise Group Relative Policy Optimization
by: Chen, Zengjue, et al.
Published: (2025)
by: Chen, Zengjue, et al.
Published: (2025)
Neural Internal Model Control: Learning a Robust Control Policy via Predictive Error Feedback
by: Gao, Feng, et al.
Published: (2024)
by: Gao, Feng, et al.
Published: (2024)
Policies over Poses: Reinforcement Learning based Distributed Pose-Graph Optimization for Multi-Robot SLAM
by: Ghanta, Sai Krishna, et al.
Published: (2025)
by: Ghanta, Sai Krishna, et al.
Published: (2025)
State-of-the-art in Robot Learning for Multi-Robot Collaboration: A Comprehensive Survey
by: Wu, Bin, et al.
Published: (2024)
by: Wu, Bin, et al.
Published: (2024)
Long-Short Term Agents for Pure-Vision Bronchoscopy Robotic Autonomy
by: Wu, Junyang, et al.
Published: (2026)
by: Wu, Junyang, et al.
Published: (2026)
EndoSERV: A Vision-based Endoluminal Robot Navigation System
by: Wu, Junyang, et al.
Published: (2026)
by: Wu, Junyang, et al.
Published: (2026)
RC-GRPO: Reward-Conditioned Group Relative Policy Optimization for Multi-Turn Tool Calling Agents
by: Zhong, Haitian, et al.
Published: (2026)
by: Zhong, Haitian, et al.
Published: (2026)
PhysiAgent: An Embodied Agent Framework in Physical World
by: Wang, Zhihao, et al.
Published: (2025)
by: Wang, Zhihao, et al.
Published: (2025)
Symmetry-Guided Multi-Agent Inverse Reinforcement Learning
by: Tian, Yongkai, et al.
Published: (2025)
by: Tian, Yongkai, et al.
Published: (2025)
UMI-Underwater: Learning Underwater Manipulation without Underwater Teleoperation
by: Li, Hao, et al.
Published: (2026)
by: Li, Hao, et al.
Published: (2026)
EmbodiedAgent: A Scalable Hierarchical Approach to Overcome Practical Challenge in Multi-Robot Control
by: Wan, Hanwen, et al.
Published: (2025)
by: Wan, Hanwen, et al.
Published: (2025)
Multi-Group Equivariant Augmentation for Reinforcement Learning in Robot Manipulation
by: Lin, Hongbin, et al.
Published: (2025)
by: Lin, Hongbin, et al.
Published: (2025)
Adaptive Diffusion Policy Optimization for Robotic Manipulation
by: Jiang, Huiyun, et al.
Published: (2025)
by: Jiang, Huiyun, et al.
Published: (2025)
Multi-Agent Systems for Robotic Autonomy with LLMs
by: Chen, Junhong, et al.
Published: (2025)
by: Chen, Junhong, et al.
Published: (2025)
HOLA-Drone: Hypergraphic Open-ended Learning for Zero-Shot Multi-Drone Cooperative Pursuit
by: Li, Yang, et al.
Published: (2024)
by: Li, Yang, et al.
Published: (2024)
Hierarchical LLM-Based Multi-Agent Framework with Prompt Optimization for Multi-Robot Task Planning
by: Kawabe, Tomoya, et al.
Published: (2026)
by: Kawabe, Tomoya, et al.
Published: (2026)
Robot Policy Learning with Temporal Optimal Transport Reward
by: Fu, Yuwei, et al.
Published: (2024)
by: Fu, Yuwei, et al.
Published: (2024)
Neural Algorithmic Reasoners informed Large Language Model for Multi-Agent Path Finding
by: Feng, Pu, et al.
Published: (2025)
by: Feng, Pu, et al.
Published: (2025)
Multi-Agent Reinforcement Learning for Unmanned Aerial Vehicle Coordination by Multi-Critic Policy Gradient Optimization
by: Alon, Yoav, et al.
Published: (2020)
by: Alon, Yoav, et al.
Published: (2020)
ManiAgent: An Agentic Framework for General Robotic Manipulation
by: Yang, Yi, et al.
Published: (2025)
by: Yang, Yi, et al.
Published: (2025)
Embedding Autonomous Agents in Resource-Constrained Robotic Platforms
by: Halakou, Negar, et al.
Published: (2026)
by: Halakou, Negar, et al.
Published: (2026)
ESP: Exploiting Symmetry Prior for Multi-Agent Reinforcement Learning
by: Yu, Xin, et al.
Published: (2023)
by: Yu, Xin, et al.
Published: (2023)
DREAM: Domain-aware Reasoning for Efficient Autonomous Underwater Monitoring
by: Wu, Zhenqi, et al.
Published: (2025)
by: Wu, Zhenqi, et al.
Published: (2025)
Robotic World Model: A Neural Network Simulator for Robust Policy Optimization in Robotics
by: Li, Chenhao, et al.
Published: (2025)
by: Li, Chenhao, et al.
Published: (2025)
FreqPolicy: Efficient Flow-based Visuomotor Policy via Frequency Consistency
by: Su, Yifei, et al.
Published: (2025)
by: Su, Yifei, et al.
Published: (2025)
Multi-Robot Pursuit in Parameterized Formation via Imitation Learning
by: Chen, Jinyong, et al.
Published: (2024)
by: Chen, Jinyong, et al.
Published: (2024)
Assigning Credit with Partial Reward Decoupling in Multi-Agent Proximal Policy Optimization
by: Kapoor, Aditya, et al.
Published: (2024)
by: Kapoor, Aditya, et al.
Published: (2024)
Scalable Distance-based Multi-Agent Relative State Estimation via Block Multiconvex Optimization
by: Wu, Tianyue, et al.
Published: (2024)
by: Wu, Tianyue, et al.
Published: (2024)
Reinforcement Learning Enabled Adaptive Multi-Task Control for Bipedal Soccer Robots
by: Zhang, Yulai, et al.
Published: (2026)
by: Zhang, Yulai, et al.
Published: (2026)
CUREE: A Curious Underwater Robot for Ecosystem Exploration
by: Girdhar, Yogesh, et al.
Published: (2023)
by: Girdhar, Yogesh, et al.
Published: (2023)
Similar Items
-
USIM and U0: A Vision-Language-Action Dataset and Model for General Underwater Robots
by: Gu, Junwen, et al.
Published: (2025) -
Extending Group Relative Policy Optimization to Continuous Control: A Theoretical Framework for Robotic Reinforcement Learning
by: Khanda, Rajat, et al.
Published: (2025) -
AT-Drone: Benchmarking Adaptive Teaming in Multi-Drone Pursuit
by: Li, Yang, et al.
Published: (2025) -
Distributed AI Agents for Cognitive Underwater Robot Autonomy
by: Buchholz, Markus, et al.
Published: (2025) -
Hierarchical Consensus-Based Multi-Agent Reinforcement Learning for Multi-Robot Cooperation Tasks
by: Feng, Pu, et al.
Published: (2024)