Saved in:
| Main Authors: | Su, Haoran, Deng, Hanxiao, Sun, Yandong |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.22315 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Spatiotemporal Decision Transformer for Traffic Coordination
by: Su, Haoran, et al.
Published: (2026)
by: Su, Haoran, et al.
Published: (2026)
Geometric and Dynamic Scaling in Deep Transformers
by: Su, Haoran, et al.
Published: (2026)
by: Su, Haoran, et al.
Published: (2026)
Maximum Entropy Exploration Without the Rollouts
by: Adamczyk, Jacob, et al.
Published: (2026)
by: Adamczyk, Jacob, et al.
Published: (2026)
Online Finetuning Decision Transformers with Pure RL Gradients
by: Luo, Junkai, et al.
Published: (2026)
by: Luo, Junkai, et al.
Published: (2026)
Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers
by: Yan, Kai, et al.
Published: (2024)
by: Yan, Kai, et al.
Published: (2024)
In-Context Curiosity: Distilling Exploration for Decision-Pretrained Transformers on Bandit Tasks
by: Yang, Huitao, et al.
Published: (2025)
by: Yang, Huitao, et al.
Published: (2025)
Emergence of Exploration in Policy Gradient Reinforcement Learning via Retrying
by: Nishimori, Soichiro, et al.
Published: (2026)
by: Nishimori, Soichiro, et al.
Published: (2026)
Decision Transformer as a Foundation Model for Partially Observable Continuous Control
by: Zhang, Xiangyuan, et al.
Published: (2024)
by: Zhang, Xiangyuan, et al.
Published: (2024)
Uncertainty-Aware Decision Transformer for Stochastic Driving Environments
by: Li, Zenan, et al.
Published: (2023)
by: Li, Zenan, et al.
Published: (2023)
Online Policy Distillation with Decision-Attention
by: Yu, Xinqiang, et al.
Published: (2024)
by: Yu, Xinqiang, et al.
Published: (2024)
Latency and Ordering Effects in Online Decisions
by: Yi, Duo
Published: (2025)
by: Yi, Duo
Published: (2025)
Online Sequential Decision-Making with Unknown Delays
by: Wu, Ping, et al.
Published: (2024)
by: Wu, Ping, et al.
Published: (2024)
Understanding the Training and Generalization of Pretrained Transformer for Sequential Decision Making
by: Wang, Hanzhao, et al.
Published: (2024)
by: Wang, Hanzhao, et al.
Published: (2024)
Adversarially Robust Decision Transformer
by: Tang, Xiaohang, et al.
Published: (2024)
by: Tang, Xiaohang, et al.
Published: (2024)
In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought
by: Huang, Sili, et al.
Published: (2024)
by: Huang, Sili, et al.
Published: (2024)
Beyond the Known: Decision Making with Counterfactual Reasoning Decision Transformer
by: Nguyen, Minh Hoang, et al.
Published: (2025)
by: Nguyen, Minh Hoang, et al.
Published: (2025)
Efficient Imitation Without Demonstrations via Value-Penalized Auxiliary Control from Examples
by: Ablett, Trevor, et al.
Published: (2024)
by: Ablett, Trevor, et al.
Published: (2024)
On Exact Bit-level Reversible Transformers Without Changing Architectures
by: Zhang, Guoqiang, et al.
Published: (2024)
by: Zhang, Guoqiang, et al.
Published: (2024)
Online Decision MetaMorphFormer: A Casual Transformer-Based Reinforcement Learning Framework of Universal Embodied Intelligence
by: Ji, Luo, et al.
Published: (2024)
by: Ji, Luo, et al.
Published: (2024)
Decision Transformer vs. Decision Mamba: Analysing the Complexity of Sequential Decision Making in Atari Games
by: Yan, Ke
Published: (2024)
by: Yan, Ke
Published: (2024)
A Temporally Correlated Latent Exploration for Reinforcement Learning
by: Oh, SuMin, et al.
Published: (2024)
by: Oh, SuMin, et al.
Published: (2024)
Quantifying Symptom Causality in Clinical Decision Making: An Exploration Using CausaLM
by: Shetty, Mehul, et al.
Published: (2025)
by: Shetty, Mehul, et al.
Published: (2025)
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
by: Wilcoxson, Max, et al.
Published: (2024)
by: Wilcoxson, Max, et al.
Published: (2024)
Fairness Without Harm: An Influence-Guided Active Sampling Approach
by: Pang, Jinlong, et al.
Published: (2024)
by: Pang, Jinlong, et al.
Published: (2024)
Scalable Decision Focused Learning via Online Trainable Surrogates
by: Signorelli, Gaetano, et al.
Published: (2025)
by: Signorelli, Gaetano, et al.
Published: (2025)
A Percolation Model of Emergence: Analyzing Transformers Trained on a Formal Language
by: Lubana, Ekdeep Singh, et al.
Published: (2024)
by: Lubana, Ekdeep Singh, et al.
Published: (2024)
The End of Reward Engineering: How LLMs Are Redefining Multi-Agent Coordination
by: Su, Haoran, et al.
Published: (2026)
by: Su, Haoran, et al.
Published: (2026)
Adjusting the Output of Decision Transformer with Action Gradient
by: Lin, Rui, et al.
Published: (2025)
by: Lin, Rui, et al.
Published: (2025)
The Two-Stage Decision-Sampling Hypothesis: Understanding the Emergence of Self-Reflection in RL-Trained LLMs
by: Zhao, Zibo, et al.
Published: (2026)
by: Zhao, Zibo, et al.
Published: (2026)
An Integrated Forecasting Prototype for Emergency Department Boarding Time to Support Proactive Operational Decision Making
by: Vural, Orhun, et al.
Published: (2026)
by: Vural, Orhun, et al.
Published: (2026)
Improving Offline-to-Online Reinforcement Learning with Q Conditioned State Entropy Exploration
by: Zhang, Ziqi, et al.
Published: (2023)
by: Zhang, Ziqi, et al.
Published: (2023)
Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration
by: Wang, Yibo, et al.
Published: (2024)
by: Wang, Yibo, et al.
Published: (2024)
Inner Loop Inference for Pretrained Transformers: Unlocking Latent Capabilities Without Training
by: Lys, Jonathan, et al.
Published: (2026)
by: Lys, Jonathan, et al.
Published: (2026)
Explorative Imitation Learning: A Path Signature Approach for Continuous Environments
by: Gavenski, Nathan, et al.
Published: (2024)
by: Gavenski, Nathan, et al.
Published: (2024)
Optimistic Regret Bounds for Online Learning in Adversarial Markov Decision Processes
by: Moon, Sang Bin, et al.
Published: (2024)
by: Moon, Sang Bin, et al.
Published: (2024)
Empowering Clinicians with Medical Decision Transformers: A Framework for Sepsis Treatment
by: Rahman, Aamer Abdul, et al.
Published: (2024)
by: Rahman, Aamer Abdul, et al.
Published: (2024)
Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts
by: Li, Zhuohua, et al.
Published: (2025)
by: Li, Zhuohua, et al.
Published: (2025)
Solving Continual Offline Reinforcement Learning with Decision Transformer
by: Huang, Kaixin, et al.
Published: (2024)
by: Huang, Kaixin, et al.
Published: (2024)
Large EEG-U-Transformer for Time-Step Level Detection Without Pre-Training
by: Wu, Kerui, et al.
Published: (2025)
by: Wu, Kerui, et al.
Published: (2025)
Probing Network Decisions: Capturing Uncertainties and Unveiling Vulnerabilities Without Label Information
by: Joung, Youngju, et al.
Published: (2025)
by: Joung, Youngju, et al.
Published: (2025)
Similar Items
-
Spatiotemporal Decision Transformer for Traffic Coordination
by: Su, Haoran, et al.
Published: (2026) -
Geometric and Dynamic Scaling in Deep Transformers
by: Su, Haoran, et al.
Published: (2026) -
Maximum Entropy Exploration Without the Rollouts
by: Adamczyk, Jacob, et al.
Published: (2026) -
Online Finetuning Decision Transformers with Pure RL Gradients
by: Luo, Junkai, et al.
Published: (2026) -
Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers
by: Yan, Kai, et al.
Published: (2024)