:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Su, Haoran, Deng, Hanxiao, Sun, Yandong
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2603.22315
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Spatiotemporal Decision Transformer for Traffic Coordination
by: Su, Haoran, et al.
Published: (2026)

Geometric and Dynamic Scaling in Deep Transformers
by: Su, Haoran, et al.
Published: (2026)

Maximum Entropy Exploration Without the Rollouts
by: Adamczyk, Jacob, et al.
Published: (2026)

Online Finetuning Decision Transformers with Pure RL Gradients
by: Luo, Junkai, et al.
Published: (2026)

Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers
by: Yan, Kai, et al.
Published: (2024)

In-Context Curiosity: Distilling Exploration for Decision-Pretrained Transformers on Bandit Tasks
by: Yang, Huitao, et al.
Published: (2025)

Emergence of Exploration in Policy Gradient Reinforcement Learning via Retrying
by: Nishimori, Soichiro, et al.
Published: (2026)

Decision Transformer as a Foundation Model for Partially Observable Continuous Control
by: Zhang, Xiangyuan, et al.
Published: (2024)

Uncertainty-Aware Decision Transformer for Stochastic Driving Environments
by: Li, Zenan, et al.
Published: (2023)

Online Policy Distillation with Decision-Attention
by: Yu, Xinqiang, et al.
Published: (2024)

Latency and Ordering Effects in Online Decisions
by: Yi, Duo
Published: (2025)

Online Sequential Decision-Making with Unknown Delays
by: Wu, Ping, et al.
Published: (2024)

Understanding the Training and Generalization of Pretrained Transformer for Sequential Decision Making
by: Wang, Hanzhao, et al.
Published: (2024)

Adversarially Robust Decision Transformer
by: Tang, Xiaohang, et al.
Published: (2024)

In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought
by: Huang, Sili, et al.
Published: (2024)

Beyond the Known: Decision Making with Counterfactual Reasoning Decision Transformer
by: Nguyen, Minh Hoang, et al.
Published: (2025)

Efficient Imitation Without Demonstrations via Value-Penalized Auxiliary Control from Examples
by: Ablett, Trevor, et al.
Published: (2024)

On Exact Bit-level Reversible Transformers Without Changing Architectures
by: Zhang, Guoqiang, et al.
Published: (2024)

Online Decision MetaMorphFormer: A Casual Transformer-Based Reinforcement Learning Framework of Universal Embodied Intelligence
by: Ji, Luo, et al.
Published: (2024)

Decision Transformer vs. Decision Mamba: Analysing the Complexity of Sequential Decision Making in Atari Games
by: Yan, Ke
Published: (2024)

A Temporally Correlated Latent Exploration for Reinforcement Learning
by: Oh, SuMin, et al.
Published: (2024)

Quantifying Symptom Causality in Clinical Decision Making: An Exploration Using CausaLM
by: Shetty, Mehul, et al.
Published: (2025)

Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
by: Wilcoxson, Max, et al.
Published: (2024)

Fairness Without Harm: An Influence-Guided Active Sampling Approach
by: Pang, Jinlong, et al.
Published: (2024)

Scalable Decision Focused Learning via Online Trainable Surrogates
by: Signorelli, Gaetano, et al.
Published: (2025)

A Percolation Model of Emergence: Analyzing Transformers Trained on a Formal Language
by: Lubana, Ekdeep Singh, et al.
Published: (2024)

The End of Reward Engineering: How LLMs Are Redefining Multi-Agent Coordination
by: Su, Haoran, et al.
Published: (2026)

Adjusting the Output of Decision Transformer with Action Gradient
by: Lin, Rui, et al.
Published: (2025)

The Two-Stage Decision-Sampling Hypothesis: Understanding the Emergence of Self-Reflection in RL-Trained LLMs
by: Zhao, Zibo, et al.
Published: (2026)

An Integrated Forecasting Prototype for Emergency Department Boarding Time to Support Proactive Operational Decision Making
by: Vural, Orhun, et al.
Published: (2026)

Improving Offline-to-Online Reinforcement Learning with Q Conditioned State Entropy Exploration
by: Zhang, Ziqi, et al.
Published: (2023)

Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration
by: Wang, Yibo, et al.
Published: (2024)

Inner Loop Inference for Pretrained Transformers: Unlocking Latent Capabilities Without Training
by: Lys, Jonathan, et al.
Published: (2026)

Explorative Imitation Learning: A Path Signature Approach for Continuous Environments
by: Gavenski, Nathan, et al.
Published: (2024)

Optimistic Regret Bounds for Online Learning in Adversarial Markov Decision Processes
by: Moon, Sang Bin, et al.
Published: (2024)

Empowering Clinicians with Medical Decision Transformers: A Framework for Sepsis Treatment
by: Rahman, Aamer Abdul, et al.
Published: (2024)

Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts
by: Li, Zhuohua, et al.
Published: (2025)

Solving Continual Offline Reinforcement Learning with Decision Transformer
by: Huang, Kaixin, et al.
Published: (2024)

Large EEG-U-Transformer for Time-Step Level Detection Without Pre-Training
by: Wu, Kerui, et al.
Published: (2025)

Probing Network Decisions: Capturing Uncertainties and Unveiling Vulnerabilities Without Label Information
by: Joung, Youngju, et al.
Published: (2025)