Saved in:
| Main Authors: | Liu, Bingyi, He, Jinbo, Shi, Haiyong, Wang, Enshu, Han, Weizhen, Hao, Jingxiang, Wang, Peixi, Zhang, Zhuangzhuang |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.05675 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Deep Reinforcement Learning in Parameterized Action Space
by: Hausknecht, Matthew, et al.
Published: (2015)
by: Hausknecht, Matthew, et al.
Published: (2015)
Model-based Reinforcement Learning for Parameterized Action Spaces
by: Zhang, Renhao, et al.
Published: (2024)
by: Zhang, Renhao, et al.
Published: (2024)
mmCooper: A Multi-agent Multi-stage Communication-efficient and Collaboration-robust Cooperative Perception Framework
by: Liu, Bingyi, et al.
Published: (2025)
by: Liu, Bingyi, et al.
Published: (2025)
Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement Learning
by: Zhang, Tianle, et al.
Published: (2024)
by: Zhang, Tianle, et al.
Published: (2024)
Context-Sensitive Abstractions for Reinforcement Learning with Parameterized Actions
by: Nayyar, Rashmeet Kaur, et al.
Published: (2025)
by: Nayyar, Rashmeet Kaur, et al.
Published: (2025)
Discretizing Continuous Action Space with Unimodal Probability Distributions for On-Policy Reinforcement Learning
by: Zhu, Yuanyang, et al.
Published: (2024)
by: Zhu, Yuanyang, et al.
Published: (2024)
Hybrid Differential Reward: Combining Temporal Difference and Action Gradients for Efficient Multi-Agent Reinforcement Learning in Cooperative Driving
by: Han, Ye, et al.
Published: (2025)
by: Han, Ye, et al.
Published: (2025)
Exploiting Hybrid Policy in Reinforcement Learning for Interpretable Temporal Logic Manipulation
by: Zhang, Hao, et al.
Published: (2024)
by: Zhang, Hao, et al.
Published: (2024)
GHQ: Grouped Hybrid Q Learning for Heterogeneous Cooperative Multi-agent Reinforcement Learning
by: Yu, Xiaoyang, et al.
Published: (2023)
by: Yu, Xiaoyang, et al.
Published: (2023)
MTLight: Efficient Multi-Task Reinforcement Learning for Traffic Signal Control
by: Zhu, Liwen, et al.
Published: (2024)
by: Zhu, Liwen, et al.
Published: (2024)
Robust Deep Reinforcement Learning with Adaptive Adversarial Perturbations in Action Space
by: Liu, Qianmei, et al.
Published: (2024)
by: Liu, Qianmei, et al.
Published: (2024)
Deep Reinforcement Learning with Spiking Q-learning
by: Chen, Ding, et al.
Published: (2022)
by: Chen, Ding, et al.
Published: (2022)
Multi-Agent Target Assignment and Path Finding for Intelligent Warehouse: A Cooperative Multi-Agent Deep Reinforcement Learning Perspective
by: Liu, Qi, et al.
Published: (2024)
by: Liu, Qi, et al.
Published: (2024)
MetaVIM: Meta Variationally Intrinsic Motivated Reinforcement Learning for Decentralized Traffic Signal Control
by: Zhu, Liwen, et al.
Published: (2021)
by: Zhu, Liwen, et al.
Published: (2021)
From Noise to Control: Parameterized Diffusion Policies
by: Zhang, Renhao, et al.
Published: (2026)
by: Zhang, Renhao, et al.
Published: (2026)
A Parallel Hybrid Action Space Reinforcement Learning Model for Real-world Adaptive Traffic Signal Control
by: Wang, Yuxuan, et al.
Published: (2025)
by: Wang, Yuxuan, et al.
Published: (2025)
Fully Spiking Actor Network with Intra-layer Connections for Reinforcement Learning
by: Chen, Ding, et al.
Published: (2024)
by: Chen, Ding, et al.
Published: (2024)
Latent Zoning Network: A Unified Principle for Generative Modeling, Representation Learning, and Classification
by: Lin, Zinan, et al.
Published: (2025)
by: Lin, Zinan, et al.
Published: (2025)
Neuro-symbolic Action Masking for Deep Reinforcement Learning
by: Han, Shuai, et al.
Published: (2026)
by: Han, Shuai, et al.
Published: (2026)
Improving Global Parameter-sharing in Physically Heterogeneous Multi-agent Reinforcement Learning with Unified Action Space
by: Yu, Xiaoyang, et al.
Published: (2024)
by: Yu, Xiaoyang, et al.
Published: (2024)
A Light-Weight Framework for Open-Set Object Detection with Decoupled Feature Alignment in Joint Space
by: He, Yonghao, et al.
Published: (2024)
by: He, Yonghao, et al.
Published: (2024)
DAWM: Diffusion Action World Models for Offline Reinforcement Learning via Action-Inferred Transitions
by: Li, Zongyue, et al.
Published: (2025)
by: Li, Zongyue, et al.
Published: (2025)
CLLAP: Contrastive Learning-based LiDAR-Augmented Pretraining for Enhanced Radar-Camera Fusion
by: Liu, Bingyi, et al.
Published: (2026)
by: Liu, Bingyi, et al.
Published: (2026)
Action-Adaptive Continual Learning: Enabling Policy Generalization under Dynamic Action Spaces
by: Pan, Chaofan, et al.
Published: (2025)
by: Pan, Chaofan, et al.
Published: (2025)
In-Context Reinforcement Learning for Variable Action Spaces
by: Sinii, Viacheslav, et al.
Published: (2023)
by: Sinii, Viacheslav, et al.
Published: (2023)
Enhancing Policy Learning with World-Action Model
by: Han, Yuci, et al.
Published: (2026)
by: Han, Yuci, et al.
Published: (2026)
Policy-Based Trajectory Clustering in Offline Reinforcement Learning
by: Hu, Hao, et al.
Published: (2025)
by: Hu, Hao, et al.
Published: (2025)
Mutual Information Regularized Offline Reinforcement Learning
by: Ma, Xiao, et al.
Published: (2022)
by: Ma, Xiao, et al.
Published: (2022)
Diffusion Policies creating a Trust Region for Offline Reinforcement Learning
by: Chen, Tianyu, et al.
Published: (2024)
by: Chen, Tianyu, et al.
Published: (2024)
UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning
by: Lu, Zhengxi, et al.
Published: (2025)
by: Lu, Zhengxi, et al.
Published: (2025)
Policy Optimization in Hybrid Discrete-Continuous Action Spaces via Mixed Gradients
by: Alvo, Matias, et al.
Published: (2026)
by: Alvo, Matias, et al.
Published: (2026)
Self-Correction as Feedback Control: Error Dynamics, Stability Thresholds, and Prompt Interventions in LLMs
by: Liu, Aofan, et al.
Published: (2026)
by: Liu, Aofan, et al.
Published: (2026)
On the Geometry of Reinforcement Learning in Continuous State and Action Spaces
by: Tiwari, Saket, et al.
Published: (2022)
by: Tiwari, Saket, et al.
Published: (2022)
When SparseMoE Meets Noisy Interactions: An Ensemble View on Denoising Recommendation
by: Chen, Weipu, et al.
Published: (2024)
by: Chen, Weipu, et al.
Published: (2024)
Hybrid Action Based Reinforcement Learning for Multi-Objective Compatible Autonomous Driving
by: Jin, Guizhe, et al.
Published: (2025)
by: Jin, Guizhe, et al.
Published: (2025)
Low-Rank Adaptation for Critic Learning in Off-Policy Reinforcement Learning
by: Zhuang, Yuan, et al.
Published: (2026)
by: Zhuang, Yuan, et al.
Published: (2026)
Reinforcing Language Agents via Policy Optimization with Action Decomposition
by: Wen, Muning, et al.
Published: (2024)
by: Wen, Muning, et al.
Published: (2024)
Comparative Analysis of Parameterized Action Actor-Critic Reinforcement Learning Algorithms for Web Search Match Plan Generation
by: Bapoo, Ubayd, et al.
Published: (2025)
by: Bapoo, Ubayd, et al.
Published: (2025)
Bayesian Off-Policy Evaluation and Learning for Large Action Spaces
by: Aouali, Imad, et al.
Published: (2024)
by: Aouali, Imad, et al.
Published: (2024)
MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization
by: Zhao, Tianchen, et al.
Published: (2024)
by: Zhao, Tianchen, et al.
Published: (2024)
Similar Items
-
Deep Reinforcement Learning in Parameterized Action Space
by: Hausknecht, Matthew, et al.
Published: (2015) -
Model-based Reinforcement Learning for Parameterized Action Spaces
by: Zhang, Renhao, et al.
Published: (2024) -
mmCooper: A Multi-agent Multi-stage Communication-efficient and Collaboration-robust Cooperative Perception Framework
by: Liu, Bingyi, et al.
Published: (2025) -
Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement Learning
by: Zhang, Tianle, et al.
Published: (2024) -
Context-Sensitive Abstractions for Reinforcement Learning with Parameterized Actions
by: Nayyar, Rashmeet Kaur, et al.
Published: (2025)