Saved in:
| Main Authors: | Kong, Yilun, Mao, Hangyu, Zhao, Qi, Zhang, Bin, Ruan, Jingqing, Shen, Li, Chang, Yongzhe, Wang, Xueqian, Zhao, Rui, Tao, Dacheng |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.10504 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer
by: Kong, Yilun, et al.
Published: (2025)
by: Kong, Yilun, et al.
Published: (2025)
DEER: A Delay-Resilient Framework for Reinforcement Learning with Variable Delays
by: Xia, Bo, et al.
Published: (2024)
by: Xia, Bo, et al.
Published: (2024)
X-Light: Cross-City Traffic Signal Control Using Transformer on Transformer as Meta Multi-Agent Reinforcement Learner
by: Jiang, Haoyuan, et al.
Published: (2024)
by: Jiang, Haoyuan, et al.
Published: (2024)
Wavelet Fourier Diffuser: Frequency-Aware Diffusion Model for Reinforcement Learning
by: Luo, Yifu, et al.
Published: (2025)
by: Luo, Yifu, et al.
Published: (2025)
Solving Continual Offline Reinforcement Learning with Decision Transformer
by: Huang, Kaixin, et al.
Published: (2024)
by: Huang, Kaixin, et al.
Published: (2024)
CoSLight: Co-optimizing Collaborator Selection and Decision-making to Enhance Traffic Signal Control
by: Ruan, Jingqing, et al.
Published: (2024)
by: Ruan, Jingqing, et al.
Published: (2024)
Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning
by: Hu, Jifeng, et al.
Published: (2025)
by: Hu, Jifeng, et al.
Published: (2025)
UACER: An Uncertainty-Adaptive Critic Ensemble Framework for Robust Adversarial Reinforcement Learning
by: Wu, Jiaxi, et al.
Published: (2025)
by: Wu, Jiaxi, et al.
Published: (2025)
HarmoDT: Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning
by: Hu, Shengchao, et al.
Published: (2024)
by: Hu, Shengchao, et al.
Published: (2024)
TPTU: Large Language Model-based AI Agents for Task Planning and Tool Usage
by: Ruan, Jingqing, et al.
Published: (2023)
by: Ruan, Jingqing, et al.
Published: (2023)
Generalizing Alignment Paradigm of Text-to-Image Generation with Preferences through $f$-divergence Minimization
by: Sun, Haoyuan, et al.
Published: (2024)
by: Sun, Haoyuan, et al.
Published: (2024)
GuideLight: "Industrial Solution" Guidance for More Practical Traffic Signal Control Agents
by: Jiang, Haoyuan, et al.
Published: (2024)
by: Jiang, Haoyuan, et al.
Published: (2024)
Task-Aware Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning
by: Fan, Ziqing, et al.
Published: (2024)
by: Fan, Ziqing, et al.
Published: (2024)
Probing the Safety Response Boundary of Large Language Models via Unsafe Decoding Path Generation
by: Wang, Haoyu, et al.
Published: (2024)
by: Wang, Haoyu, et al.
Published: (2024)
Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic Approach
by: Zhang, Bin, et al.
Published: (2023)
by: Zhang, Bin, et al.
Published: (2023)
Explainable Reinforcement Learning via a Causal World Model
by: Yu, Zhongwei, et al.
Published: (2023)
by: Yu, Zhongwei, et al.
Published: (2023)
Morphology and Behavior Co-Optimization of Modular Satellites for Attitude Control
by: Wang, Yuxing, et al.
Published: (2024)
by: Wang, Yuxing, et al.
Published: (2024)
Q-value Regularized Transformer for Offline Reinforcement Learning
by: Hu, Shengchao, et al.
Published: (2024)
by: Hu, Shengchao, et al.
Published: (2024)
Continual Diffuser (CoD): Mastering Continual Offline Reinforcement Learning with Experience Rehearsal
by: Hu, Jifeng, et al.
Published: (2024)
by: Hu, Jifeng, et al.
Published: (2024)
PTDE: Personalized Training with Distilled Execution for Multi-Agent Reinforcement Learning
by: Chen, Yiqun, et al.
Published: (2022)
by: Chen, Yiqun, et al.
Published: (2022)
DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning
by: He, Longxiang, et al.
Published: (2023)
by: He, Longxiang, et al.
Published: (2023)
Offline Behavior Distillation
by: Lei, Shiye, et al.
Published: (2024)
by: Lei, Shiye, et al.
Published: (2024)
A Comprehensive Survey of Data Augmentation in Visual Reinforcement Learning
by: Ma, Guozheng, et al.
Published: (2022)
by: Ma, Guozheng, et al.
Published: (2022)
Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL
by: Sun, Hao, et al.
Published: (2023)
by: Sun, Hao, et al.
Published: (2023)
Reinforcement Fine-Tuning Powers Reasoning Capability of Multimodal Large Language Models
by: Sun, Haoyuan, et al.
Published: (2025)
by: Sun, Haoyuan, et al.
Published: (2025)
Offline Behavioral Data Selection
by: Lei, Shiye, et al.
Published: (2025)
by: Lei, Shiye, et al.
Published: (2025)
Offline Trajectory Optimization for Offline Reinforcement Learning
by: Zhao, Ziqi, et al.
Published: (2024)
by: Zhao, Ziqi, et al.
Published: (2024)
State Diversity Matters in Offline Behavior Distillation
by: Lei, Shiye, et al.
Published: (2025)
by: Lei, Shiye, et al.
Published: (2025)
AMoPO: Adaptive Multi-objective Preference Optimization without Reward Models and Reference Models
by: Liu, Qi, et al.
Published: (2025)
by: Liu, Qi, et al.
Published: (2025)
Safety Reasoning with Guidelines
by: Wang, Haoyu, et al.
Published: (2025)
by: Wang, Haoyu, et al.
Published: (2025)
Prompt Tuning with Diffusion for Few-Shot Pre-trained Policy Generalization
by: Hu, Shengchao, et al.
Published: (2024)
by: Hu, Shengchao, et al.
Published: (2024)
PET-SQL: A Prompt-Enhanced Two-Round Refinement of Text-to-SQL with Cross-consistency
by: Li, Zhishuai, et al.
Published: (2024)
by: Li, Zhishuai, et al.
Published: (2024)
Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages
by: Ma, Guozheng, et al.
Published: (2023)
by: Ma, Guozheng, et al.
Published: (2023)
Learned Offline Query Planning via Bayesian Optimization
by: Tao, Jeffrey, et al.
Published: (2025)
by: Tao, Jeffrey, et al.
Published: (2025)
Reinforcement Learning Meets Masked Generative Models: Mask-GRPO for Text-to-Image Generation
by: Luo, Yifu, et al.
Published: (2025)
by: Luo, Yifu, et al.
Published: (2025)
Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies
by: Yan, Runze, et al.
Published: (2025)
by: Yan, Runze, et al.
Published: (2025)
Offline Goal-Conditioned Reinforcement Learning for Safety-Critical Tasks with Recovery Policy
by: Cao, Chenyang, et al.
Published: (2024)
by: Cao, Chenyang, et al.
Published: (2024)
Learning Causal Dynamics Models in Object-Oriented Environments
by: Yu, Zhongwei, et al.
Published: (2024)
by: Yu, Zhongwei, et al.
Published: (2024)
Image Captions are Natural Prompts for Text-to-Image Models
by: Lei, Shiye, et al.
Published: (2023)
by: Lei, Shiye, et al.
Published: (2023)
Doubly Mild Generalization for Offline Reinforcement Learning
by: Mao, Yixiu, et al.
Published: (2024)
by: Mao, Yixiu, et al.
Published: (2024)
Similar Items
-
Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer
by: Kong, Yilun, et al.
Published: (2025) -
DEER: A Delay-Resilient Framework for Reinforcement Learning with Variable Delays
by: Xia, Bo, et al.
Published: (2024) -
X-Light: Cross-City Traffic Signal Control Using Transformer on Transformer as Meta Multi-Agent Reinforcement Learner
by: Jiang, Haoyuan, et al.
Published: (2024) -
Wavelet Fourier Diffuser: Frequency-Aware Diffusion Model for Reinforcement Learning
by: Luo, Yifu, et al.
Published: (2025) -
Solving Continual Offline Reinforcement Learning with Decision Transformer
by: Huang, Kaixin, et al.
Published: (2024)