Saved in:
| Main Authors: | Rietz, Finn, Smirnov, Oleg, Karimi, Sara, Cao, Lele |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.06358 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Prompt Tuning Decision Transformers with Structured and Scalable Bandits
by: Rietz, Finn, et al.
Published: (2025)
by: Rietz, Finn, et al.
Published: (2025)
APC-RL: Exceeding Data-Driven Behavior Priors with Adaptive Policy Composition
by: Rietz, Finn, et al.
Published: (2026)
by: Rietz, Finn, et al.
Published: (2026)
Are We Really Measuring Progress? Transferring Insights from Evaluating Recommender Systems to Temporal Link Prediction
by: Cornell, Filip, et al.
Published: (2025)
by: Cornell, Filip, et al.
Published: (2025)
On the Power of Heuristics in Temporal Graphs
by: Cornell, Filip, et al.
Published: (2025)
by: Cornell, Filip, et al.
Published: (2025)
Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone
by: Mark, Max Sobol, et al.
Published: (2024)
by: Mark, Max Sobol, et al.
Published: (2024)
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
by: Nakamoto, Mitsuhiko, et al.
Published: (2023)
by: Nakamoto, Mitsuhiko, et al.
Published: (2023)
Prompt Tuning with Diffusion for Few-Shot Pre-trained Policy Generalization
by: Hu, Shengchao, et al.
Published: (2024)
by: Hu, Shengchao, et al.
Published: (2024)
Towards Unified Approaches in Self-Supervised Event Stream Modeling: Progress and Prospects
by: Zólyomi, Levente, et al.
Published: (2025)
by: Zólyomi, Levente, et al.
Published: (2025)
Be Wary of Your Time Series Preprocessing
by: Ennadir, Sofiane, et al.
Published: (2026)
by: Ennadir, Sofiane, et al.
Published: (2026)
SGPT: Few-Shot Prompt Tuning for Signed Graphs
by: Zhai, Zian, et al.
Published: (2024)
by: Zhai, Zian, et al.
Published: (2024)
Bridging Online and Offline RL: Contextual Bandit Learning for Multi-Turn Code Generation
by: Chen, Ziru, et al.
Published: (2026)
by: Chen, Ziru, et al.
Published: (2026)
Enhancing Graph Classification Robustness with Singular Pooling
by: Ennadir, Sofiane, et al.
Published: (2025)
by: Ennadir, Sofiane, et al.
Published: (2025)
Towards Interpretable Reinforcement Learning with Constrained Normalizing Flow Policies
by: Rietz, Finn, et al.
Published: (2024)
by: Rietz, Finn, et al.
Published: (2024)
Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models
by: Li, Pengyi, et al.
Published: (2025)
by: Li, Pengyi, et al.
Published: (2025)
Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RL
by: Luo, Qin-Wen, et al.
Published: (2024)
by: Luo, Qin-Wen, et al.
Published: (2024)
Offline Learning for Combinatorial Multi-armed Bandits
by: Liu, Xutong, et al.
Published: (2025)
by: Liu, Xutong, et al.
Published: (2025)
Efficient Online RL Fine Tuning with Offline Pre-trained Policy Only
by: Xiao, Wei, et al.
Published: (2025)
by: Xiao, Wei, et al.
Published: (2025)
Pool Me Wisely: On the Effect of Pooling in Transformer-Based Models
by: Ennadir, Sofiane, et al.
Published: (2025)
by: Ennadir, Sofiane, et al.
Published: (2025)
Adaptive Prompt Tuning: Vision Guided Prompt Tuning with Cross-Attention for Fine-Grained Few-Shot Learning
by: Brouwer, Eric, et al.
Published: (2024)
by: Brouwer, Eric, et al.
Published: (2024)
Offline RL by Reward-Weighted Fine-Tuning for Conversation Optimization
by: Mukherjee, Subhojyoti, et al.
Published: (2025)
by: Mukherjee, Subhojyoti, et al.
Published: (2025)
Active Few-Shot Fine-Tuning
by: Hübotter, Jonas, et al.
Published: (2024)
by: Hübotter, Jonas, et al.
Published: (2024)
Unlearning Offline Stochastic Multi-Armed Bandits
by: Ye, Zichun, et al.
Published: (2026)
by: Ye, Zichun, et al.
Published: (2026)
Efficient Adversarial Attacks on High-dimensional Offline Bandits
by: Hosseini, Seyed Mohammad Hadi, et al.
Published: (2026)
by: Hosseini, Seyed Mohammad Hadi, et al.
Published: (2026)
DSV-LFS: Unifying LLM-Driven Semantic Cues with Visual Features for Robust Few-Shot Segmentation
by: Karimi, Amin, et al.
Published: (2025)
by: Karimi, Amin, et al.
Published: (2025)
A Foundational Multi-Modal Model for Few-Shot Learning
by: Dang, Pengtao, et al.
Published: (2025)
by: Dang, Pengtao, et al.
Published: (2025)
Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL
by: Sun, Hao, et al.
Published: (2023)
by: Sun, Hao, et al.
Published: (2023)
MMSE-Calibrated Few-Shot Prompting for Alzheimer's Detection
by: Sweidan, Jana, et al.
Published: (2025)
by: Sweidan, Jana, et al.
Published: (2025)
Pin-Tuning: Parameter-Efficient In-Context Tuning for Few-Shot Molecular Property Prediction
by: Wang, Liang, et al.
Published: (2024)
by: Wang, Liang, et al.
Published: (2024)
Understanding Players as if They Are Talking to the Game in a Customized Language: A Pilot Study
by: Wang, Tianze, et al.
Published: (2024)
by: Wang, Tianze, et al.
Published: (2024)
Few-Shot Task Learning through Inverse Generative Modeling
by: Netanyahu, Aviv, et al.
Published: (2024)
by: Netanyahu, Aviv, et al.
Published: (2024)
Group-Sensitive Offline Contextual Bandits
by: Guo, Yihong, et al.
Published: (2025)
by: Guo, Yihong, et al.
Published: (2025)
Bayesian Regret Minimization in Offline Bandits
by: Petrik, Marek, et al.
Published: (2023)
by: Petrik, Marek, et al.
Published: (2023)
Flow-Enabled Generalization to Human Demonstrations in Few-Shot Imitation Learning
by: Tang, Runze, et al.
Published: (2026)
by: Tang, Runze, et al.
Published: (2026)
tensorflow-riemopt: A Library for Optimization on Riemannian Manifolds
by: Smirnov, Oleg
Published: (2021)
by: Smirnov, Oleg
Published: (2021)
Offline Multi-task Transfer RL with Representational Penalization
by: Bose, Avinandan, et al.
Published: (2024)
by: Bose, Avinandan, et al.
Published: (2024)
Efficient Multi-objective Prompt Optimization via Pure-exploration Bandits
by: Li, Donghao, et al.
Published: (2026)
by: Li, Donghao, et al.
Published: (2026)
Grounding LTL Tasks in Sub-Symbolic RL Environments for Zero-Shot Generalization
by: Pannacci, Matteo, et al.
Published: (2026)
by: Pannacci, Matteo, et al.
Published: (2026)
Explore to Generalize in Zero-Shot RL
by: Zisselman, Ev, et al.
Published: (2023)
by: Zisselman, Ev, et al.
Published: (2023)
Offline Inverse RL: New Solution Concepts and Provably Efficient Algorithms
by: Lazzati, Filippo, et al.
Published: (2024)
by: Lazzati, Filippo, et al.
Published: (2024)
Few-Shot Inspired Generative Zero-Shot Learning
by: Shohag, Md Shakil Ahamed, et al.
Published: (2025)
by: Shohag, Md Shakil Ahamed, et al.
Published: (2025)
Similar Items
-
Prompt Tuning Decision Transformers with Structured and Scalable Bandits
by: Rietz, Finn, et al.
Published: (2025) -
APC-RL: Exceeding Data-Driven Behavior Priors with Adaptive Policy Composition
by: Rietz, Finn, et al.
Published: (2026) -
Are We Really Measuring Progress? Transferring Insights from Evaluating Recommender Systems to Temporal Link Prediction
by: Cornell, Filip, et al.
Published: (2025) -
On the Power of Heuristics in Temporal Graphs
by: Cornell, Filip, et al.
Published: (2025) -
Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone
by: Mark, Max Sobol, et al.
Published: (2024)