Saved in:
| Main Author: | Arjmandi, Mohsen |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.17683 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
The Oversight Game: Learning to Cooperatively Balance an AI Agent's Safety and Autonomy
by: Overman, William, et al.
Published: (2025)
by: Overman, William, et al.
Published: (2025)
Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies
by: Lou, Zhanzhi, et al.
Published: (2026)
by: Lou, Zhanzhi, et al.
Published: (2026)
Decocted Experience Improves Test-Time Inference in LLM Agents
by: Shen, Maohao, et al.
Published: (2026)
by: Shen, Maohao, et al.
Published: (2026)
Test Time Learning for Time Series Forecasting
by: Christou, Panayiotis, et al.
Published: (2024)
by: Christou, Panayiotis, et al.
Published: (2024)
Self-Improving LLM Agents at Test-Time
by: Acikgoz, Emre Can, et al.
Published: (2025)
by: Acikgoz, Emre Can, et al.
Published: (2025)
Learning to Discover at Test Time
by: Yuksekgonul, Mert, et al.
Published: (2026)
by: Yuksekgonul, Mert, et al.
Published: (2026)
Just-In-Time Reinforcement Learning: Continual Learning in LLM Agents Without Gradient Updates
by: Li, Yibo, et al.
Published: (2026)
by: Li, Yibo, et al.
Published: (2026)
Enabling Self-Improving Agents to Learn at Test Time With Human-In-The-Loop Guidance
by: He, Yufei, et al.
Published: (2025)
by: He, Yufei, et al.
Published: (2025)
Curriculum Learning for LLM Pretraining: An Analysis of Learning Dynamics
by: Elgaar, Mohamed, et al.
Published: (2026)
by: Elgaar, Mohamed, et al.
Published: (2026)
TTCS: Test-Time Curriculum Synthesis for Self-Evolving
by: Yang, Chengyi, et al.
Published: (2026)
by: Yang, Chengyi, et al.
Published: (2026)
Test Time Training for Supervised Causal Learning
by: Deng, Zizhen, et al.
Published: (2026)
by: Deng, Zizhen, et al.
Published: (2026)
Learning on the Job: Test-Time Curricula for Targeted Reinforcement Learning
by: Hübotter, Jonas, et al.
Published: (2025)
by: Hübotter, Jonas, et al.
Published: (2025)
ETTRL: Balancing Exploration and Exploitation in LLM Test-Time Reinforcement Learning Via Entropy Mechanism
by: Liu, Jia, et al.
Published: (2025)
by: Liu, Jia, et al.
Published: (2025)
Curriculum Learning and Imitation Learning for Model-free Control on Financial Time-series
by: Koh, Woosung, et al.
Published: (2023)
by: Koh, Woosung, et al.
Published: (2023)
Universal One-third Time Scaling in Learning Peaked Distributions
by: Liu, Yizhou, et al.
Published: (2026)
by: Liu, Yizhou, et al.
Published: (2026)
Probabilistic Curriculum Learning for Goal-Based Reinforcement Learning
by: Salt, Llewyn, et al.
Published: (2025)
by: Salt, Llewyn, et al.
Published: (2025)
Titans: Learning to Memorize at Test Time
by: Behrouz, Ali, et al.
Published: (2024)
by: Behrouz, Ali, et al.
Published: (2024)
Empowering Time Series Forecasting with LLM-Agents
by: Yeh, Chin-Chia Michael, et al.
Published: (2025)
by: Yeh, Chin-Chia Michael, et al.
Published: (2025)
Enabling Time-series Foundation Model for Building Energy Forecasting via Contrastive Curriculum Learning
by: Liang, Rui, et al.
Published: (2024)
by: Liang, Rui, et al.
Published: (2024)
Test-Time Learning of Causal Structure from Interventional Data
by: Chen, Wei, et al.
Published: (2026)
by: Chen, Wei, et al.
Published: (2026)
Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs
by: Hübotter, Jonas, et al.
Published: (2024)
by: Hübotter, Jonas, et al.
Published: (2024)
Test-Time Compute Games
by: Velasco, Ander Artola, et al.
Published: (2026)
by: Velasco, Ander Artola, et al.
Published: (2026)
Learning Progress Driven Multi-Agent Curriculum
by: Zhao, Wenshuai, et al.
Published: (2022)
by: Zhao, Wenshuai, et al.
Published: (2022)
Test-Time Learning for Large Language Models
by: Hu, Jinwu, et al.
Published: (2025)
by: Hu, Jinwu, et al.
Published: (2025)
Learning to Reason from Feedback at Test-Time
by: Li, Yanyang, et al.
Published: (2025)
by: Li, Yanyang, et al.
Published: (2025)
Reinforcement Learning Teachers of Test Time Scaling
by: Cetin, Edoardo, et al.
Published: (2025)
by: Cetin, Edoardo, et al.
Published: (2025)
Learning to (Learn at Test Time): RNNs with Expressive Hidden States
by: Sun, Yu, et al.
Published: (2024)
by: Sun, Yu, et al.
Published: (2024)
Best-of-$\infty$ -- Asymptotic Performance of Test-Time LLM Ensembling
by: Komiyama, Junpei, et al.
Published: (2025)
by: Komiyama, Junpei, et al.
Published: (2025)
Reinforcement Learning Agent for a 2D Shooter Game
by: Ackermann, Thomas, et al.
Published: (2025)
by: Ackermann, Thomas, et al.
Published: (2025)
TimeSeriesGym: A Scalable Benchmark for (Time Series) Machine Learning Engineering Agents
by: Cai, Yifu, et al.
Published: (2025)
by: Cai, Yifu, et al.
Published: (2025)
Selector-Guided Autonomous Curriculum for One-Shot Reinforcement Learning from Verifiable Rewards
by: Dave, Rudray, et al.
Published: (2026)
by: Dave, Rudray, et al.
Published: (2026)
Agents Learn Their Runtime: Interpreter Persistence as Training-Time Semantics
by: May, Victor, et al.
Published: (2026)
by: May, Victor, et al.
Published: (2026)
Automating Curriculum Learning for Reinforcement Learning using a Skill-Based Bayesian Network
by: Hsiao, Vincent, et al.
Published: (2025)
by: Hsiao, Vincent, et al.
Published: (2025)
ECHO: Entropy-Confidence Hybrid Optimization for Test-Time Reinforcement Learning
by: Zhao, Chu, et al.
Published: (2026)
by: Zhao, Chu, et al.
Published: (2026)
What If Consensus Lies? Selective-Complementary Reinforcement Learning at Test Time
by: Yan, Dong, et al.
Published: (2026)
by: Yan, Dong, et al.
Published: (2026)
Open-World Test-Time Training: Self-Training with Contrast Learning
by: Su, Houcheng, et al.
Published: (2024)
by: Su, Houcheng, et al.
Published: (2024)
Learning Representations in Video Game Agents with Supervised Contrastive Imitation Learning
by: Celemin, Carlos, et al.
Published: (2025)
by: Celemin, Carlos, et al.
Published: (2025)
Curriculum Abductive Learning
by: Hu, Wen-Chao, et al.
Published: (2025)
by: Hu, Wen-Chao, et al.
Published: (2025)
One Rank at a Time: Cascading Error Dynamics in Sequential Learning
by: Vandchali, Mahtab Alizadeh, et al.
Published: (2025)
by: Vandchali, Mahtab Alizadeh, et al.
Published: (2025)
Learning Game-Playing Agents with Generative Code Optimization
by: Kuang, Zhiyi, et al.
Published: (2025)
by: Kuang, Zhiyi, et al.
Published: (2025)
Similar Items
-
The Oversight Game: Learning to Cooperatively Balance an AI Agent's Safety and Autonomy
by: Overman, William, et al.
Published: (2025) -
Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies
by: Lou, Zhanzhi, et al.
Published: (2026) -
Decocted Experience Improves Test-Time Inference in LLM Agents
by: Shen, Maohao, et al.
Published: (2026) -
Test Time Learning for Time Series Forecasting
by: Christou, Panayiotis, et al.
Published: (2024) -
Self-Improving LLM Agents at Test-Time
by: Acikgoz, Emre Can, et al.
Published: (2025)