Saved in:
| Main Authors: | Rietz, Finn, Smirnov, Oleg, Karimi, Sara, Cao, Lele |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.04979 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Prompt-Tuning Bandits: Enabling Few-Shot Generalization for Efficient Multi-Task Offline RL
by: Rietz, Finn, et al.
Published: (2025)
by: Rietz, Finn, et al.
Published: (2025)
Are We Really Measuring Progress? Transferring Insights from Evaluating Recommender Systems to Temporal Link Prediction
by: Cornell, Filip, et al.
Published: (2025)
by: Cornell, Filip, et al.
Published: (2025)
On the Power of Heuristics in Temporal Graphs
by: Cornell, Filip, et al.
Published: (2025)
by: Cornell, Filip, et al.
Published: (2025)
Pool Me Wisely: On the Effect of Pooling in Transformer-Based Models
by: Ennadir, Sofiane, et al.
Published: (2025)
by: Ennadir, Sofiane, et al.
Published: (2025)
Towards Unified Approaches in Self-Supervised Event Stream Modeling: Progress and Prospects
by: Zólyomi, Levente, et al.
Published: (2025)
by: Zólyomi, Levente, et al.
Published: (2025)
Be Wary of Your Time Series Preprocessing
by: Ennadir, Sofiane, et al.
Published: (2026)
by: Ennadir, Sofiane, et al.
Published: (2026)
Enhancing Graph Classification Robustness with Singular Pooling
by: Ennadir, Sofiane, et al.
Published: (2025)
by: Ennadir, Sofiane, et al.
Published: (2025)
Towards Interpretable Reinforcement Learning with Constrained Normalizing Flow Policies
by: Rietz, Finn, et al.
Published: (2024)
by: Rietz, Finn, et al.
Published: (2024)
APC-RL: Exceeding Data-Driven Behavior Priors with Adaptive Policy Composition
by: Rietz, Finn, et al.
Published: (2026)
by: Rietz, Finn, et al.
Published: (2026)
Behavior Structformer: Learning Players Representations with Structured Tokenization
by: Smirnov, Oleg, et al.
Published: (2024)
by: Smirnov, Oleg, et al.
Published: (2024)
Understanding Players as if They Are Talking to the Game in a Customized Language: A Pilot Study
by: Wang, Tianze, et al.
Published: (2024)
by: Wang, Tianze, et al.
Published: (2024)
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
by: Mukherjee, Subhojyoti, et al.
Published: (2024)
by: Mukherjee, Subhojyoti, et al.
Published: (2024)
tensorflow-riemopt: A Library for Optimization on Riemannian Manifolds
by: Smirnov, Oleg
Published: (2021)
by: Smirnov, Oleg
Published: (2021)
Sample-based Dynamic Hierarchical Transformer with Layer and Head Flexibility via Contextual Bandit
by: Meng, Fanfei, et al.
Published: (2023)
by: Meng, Fanfei, et al.
Published: (2023)
Prompt Optimization with Logged Bandit Data
by: Kiyohara, Haruka, et al.
Published: (2025)
by: Kiyohara, Haruka, et al.
Published: (2025)
Memory Limitations of Prompt Tuning in Transformers
by: Meyer, Maxime, et al.
Published: (2025)
by: Meyer, Maxime, et al.
Published: (2025)
Frequency Matters: When Time Series Foundation Models Fail Under Spectral Shift
by: Wang, Tianze, et al.
Published: (2025)
by: Wang, Tianze, et al.
Published: (2025)
Todyformer: Towards Holistic Dynamic Graph Transformers with Structure-Aware Tokenization
by: Biparva, Mahdi, et al.
Published: (2024)
by: Biparva, Mahdi, et al.
Published: (2024)
Differentially Private Subspace Fine-Tuning for Large Language Models
by: Zheng, Lele, et al.
Published: (2026)
by: Zheng, Lele, et al.
Published: (2026)
In-Context Curiosity: Distilling Exploration for Decision-Pretrained Transformers on Bandit Tasks
by: Yang, Huitao, et al.
Published: (2025)
by: Yang, Huitao, et al.
Published: (2025)
Amplifying, Not Learning: Fine-Tuned AI Text Detectors Amplify a Pretrained Direction
by: Smirnov, Alexander
Published: (2026)
by: Smirnov, Alexander
Published: (2026)
HR-Bandit: Human-AI Collaborated Linear Recourse Bandit
by: Cao, Junyu, et al.
Published: (2024)
by: Cao, Junyu, et al.
Published: (2024)
Prompt Estimation from Prototypes for Federated Prompt Tuning of Vision Transformers
by: Yashwanth, M, et al.
Published: (2025)
by: Yashwanth, M, et al.
Published: (2025)
On Transportability for Structural Causal Bandits
by: Park, Min Woo, et al.
Published: (2025)
by: Park, Min Woo, et al.
Published: (2025)
Learning Markov Decision Processes under Fully Bandit Feedback
by: Zhuo, Zhengjia, et al.
Published: (2026)
by: Zhuo, Zhengjia, et al.
Published: (2026)
PSP: Pre-Training and Structure Prompt Tuning for Graph Neural Networks
by: Ge, Qingqing, et al.
Published: (2023)
by: Ge, Qingqing, et al.
Published: (2023)
ADAPT to Robustify Prompt Tuning Vision Transformers
by: Eskandar, Masih, et al.
Published: (2024)
by: Eskandar, Masih, et al.
Published: (2024)
Multi-Armed Bandits-Based Optimization of Decision Trees
by: Shanto, Hasibul Karim, et al.
Published: (2025)
by: Shanto, Hasibul Karim, et al.
Published: (2025)
Joint Distribution-Informed Shapley Values for Sparse Counterfactual Explanations
by: You, Lei, et al.
Published: (2024)
by: You, Lei, et al.
Published: (2024)
Prompting Decision Transformers for Zero-Shot Reach-Avoid Policies
by: Li, Kevin, et al.
Published: (2025)
by: Li, Kevin, et al.
Published: (2025)
Denoising diffusion models for inverse design of inflatable structures with programmable deformations
by: Karimi, Sara, et al.
Published: (2025)
by: Karimi, Sara, et al.
Published: (2025)
Fundamental Limits of Prompt Tuning Transformers: Universality, Capacity and Efficiency
by: Hu, Jerry Yao-Chieh, et al.
Published: (2024)
by: Hu, Jerry Yao-Chieh, et al.
Published: (2024)
Scalable Graph Self-Supervised Learning
by: Pasand, Ali Saheb, et al.
Published: (2024)
by: Pasand, Ali Saheb, et al.
Published: (2024)
Hierarchical Prompt Decision Transformer: Improving Few-Shot Policy Generalization with Global and Adaptive Guidance
by: Wang, Zhe, et al.
Published: (2024)
by: Wang, Zhe, et al.
Published: (2024)
Scalable and Interpretable Contextual Bandits: A Literature Review and Retail Offer Prototype
by: Tankovic, Nikola, et al.
Published: (2025)
by: Tankovic, Nikola, et al.
Published: (2025)
Stochastic Low-rank Tensor Bandits for Multi-dimensional Online Decision Making
by: Zhou, Jie, et al.
Published: (2020)
by: Zhou, Jie, et al.
Published: (2020)
Bandit and Delayed Feedback in Online Structured Prediction
by: Shibukawa, Yuki, et al.
Published: (2025)
by: Shibukawa, Yuki, et al.
Published: (2025)
Causality for Tabular Data Synthesis: A High-Order Structure Causal Benchmark Framework
by: Tu, Ruibo, et al.
Published: (2024)
by: Tu, Ruibo, et al.
Published: (2024)
S$^2$Transformer: Scalable Structured Transformers for Global Station Weather Forecasting
by: Chen, Hongyi, et al.
Published: (2025)
by: Chen, Hongyi, et al.
Published: (2025)
Buzz, Choose, Forget: A Meta-Bandit Framework for Bee-Like Decision Making
by: Claeys, Emmanuelle, et al.
Published: (2025)
by: Claeys, Emmanuelle, et al.
Published: (2025)
Similar Items
-
Prompt-Tuning Bandits: Enabling Few-Shot Generalization for Efficient Multi-Task Offline RL
by: Rietz, Finn, et al.
Published: (2025) -
Are We Really Measuring Progress? Transferring Insights from Evaluating Recommender Systems to Temporal Link Prediction
by: Cornell, Filip, et al.
Published: (2025) -
On the Power of Heuristics in Temporal Graphs
by: Cornell, Filip, et al.
Published: (2025) -
Pool Me Wisely: On the Effect of Pooling in Transformer-Based Models
by: Ennadir, Sofiane, et al.
Published: (2025) -
Towards Unified Approaches in Self-Supervised Event Stream Modeling: Progress and Prospects
by: Zólyomi, Levente, et al.
Published: (2025)