Saved in:
Bibliographic Details
Main Authors: Fu, Xinghong, Li, Yanhong, Papaioannou, Georgios, Kim, Yoon
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.17634
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910027259838464
author Fu, Xinghong
Li, Yanhong
Papaioannou, Georgios
Kim, Yoon
author_facet Fu, Xinghong
Li, Yanhong
Papaioannou, Georgios
Kim, Yoon
contents Learning time series foundation models has been shown to be a promising approach for zero-shot time series forecasting across diverse time series domains. Insofar as scaling has been a critical driver of performance of foundation models in other modalities such as language and vision, much recent work on time series foundation modeling has focused on scaling. This has resulted in time series foundation models with hundreds of millions of parameters that are, while performant, inefficient and expensive to use in practice. This paper describes a simple recipe for learning efficient foundation models for zero-shot time series forecasting that are orders of magnitude smaller. We show that large-scale transformers are not necessary: small hybrid models that interleave long convolution and linear RNN layers (in particular DeltaNet layers) can match the performance of larger transformer-based models while being more than a hundred times smaller. We also describe several data augmentation and inference strategies that further improve performance. This recipe results in Reverso, a family of efficient time series foundation models for zero-shot forecasting that significantly push the performance-efficiency Pareto frontier.
format Preprint
id arxiv_https___arxiv_org_abs_2602_17634
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Reverso: Efficient Time Series Foundation Models for Zero-shot Forecasting
Fu, Xinghong
Li, Yanhong
Papaioannou, Georgios
Kim, Yoon
Machine Learning
Artificial Intelligence
Learning time series foundation models has been shown to be a promising approach for zero-shot time series forecasting across diverse time series domains. Insofar as scaling has been a critical driver of performance of foundation models in other modalities such as language and vision, much recent work on time series foundation modeling has focused on scaling. This has resulted in time series foundation models with hundreds of millions of parameters that are, while performant, inefficient and expensive to use in practice. This paper describes a simple recipe for learning efficient foundation models for zero-shot time series forecasting that are orders of magnitude smaller. We show that large-scale transformers are not necessary: small hybrid models that interleave long convolution and linear RNN layers (in particular DeltaNet layers) can match the performance of larger transformer-based models while being more than a hundred times smaller. We also describe several data augmentation and inference strategies that further improve performance. This recipe results in Reverso, a family of efficient time series foundation models for zero-shot forecasting that significantly push the performance-efficiency Pareto frontier.
title Reverso: Efficient Time Series Foundation Models for Zero-shot Forecasting
topic Machine Learning
Artificial Intelligence
url https://arxiv.org/abs/2602.17634