Saved in:
Bibliographic Details
Main Authors: Tikhonovich, Daria, Zelinskiy, Nikita, Petrov, Aleksandr V., Spirina, Mayya, Semenov, Andrei, Savchenko, Andrey V., Kuliev, Sergei
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2508.06450
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915434996957184
author Tikhonovich, Daria
Zelinskiy, Nikita
Petrov, Aleksandr V.
Spirina, Mayya
Semenov, Andrei
Savchenko, Andrey V.
Kuliev, Sergei
author_facet Tikhonovich, Daria
Zelinskiy, Nikita
Petrov, Aleksandr V.
Spirina, Mayya
Semenov, Andrei
Savchenko, Andrey V.
Kuliev, Sergei
contents Since their introduction, Transformer-based models, such as SASRec and BERT4Rec, have become common baselines for sequential recommendations, surpassing earlier neural and non-neural methods. A number of following publications have shown that the effectiveness of these models can be improved by, for example, slightly updating the architecture of the Transformer layers, using better training objectives, and employing improved loss functions. However, the additivity of these modular improvements has not been systematically benchmarked - this is the gap we aim to close in this paper. Through our experiments, we identify a very strong model that uses SASRec's training objective, LiGR Transformer layers, and Sampled Softmax Loss. We call this combination eSASRec (Enhanced SASRec). While we primarily focus on realistic, production-like evaluation, in our preliminarily study we find that common academic benchmarks show eSASRec to be 23% more effective compared to the most recent state-of-the-art models, such as ActionPiece. In our main production-like benchmark, eSASRec resides on the Pareto frontier in terms of the accuracy-coverage tradeoff (alongside the recent industrial models HSTU and FuXi. As the modifications compared to the original SASRec are relatively straightforward and no extra features are needed (such as timestamps in HSTU), we believe that eSASRec can be easily integrated into existing recommendation pipelines and can can serve as a strong yet very simple baseline for emerging complicated algorithms. To facilitate this, we provide the open-source implementations for our models and benchmarks in repository https://github.com/blondered/transformer_benchmark
format Preprint
id arxiv_https___arxiv_org_abs_2508_06450
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle eSASRec: Enhancing Transformer-based Recommendations in a Modular Fashion
Tikhonovich, Daria
Zelinskiy, Nikita
Petrov, Aleksandr V.
Spirina, Mayya
Semenov, Andrei
Savchenko, Andrey V.
Kuliev, Sergei
Information Retrieval
Machine Learning
Since their introduction, Transformer-based models, such as SASRec and BERT4Rec, have become common baselines for sequential recommendations, surpassing earlier neural and non-neural methods. A number of following publications have shown that the effectiveness of these models can be improved by, for example, slightly updating the architecture of the Transformer layers, using better training objectives, and employing improved loss functions. However, the additivity of these modular improvements has not been systematically benchmarked - this is the gap we aim to close in this paper. Through our experiments, we identify a very strong model that uses SASRec's training objective, LiGR Transformer layers, and Sampled Softmax Loss. We call this combination eSASRec (Enhanced SASRec). While we primarily focus on realistic, production-like evaluation, in our preliminarily study we find that common academic benchmarks show eSASRec to be 23% more effective compared to the most recent state-of-the-art models, such as ActionPiece. In our main production-like benchmark, eSASRec resides on the Pareto frontier in terms of the accuracy-coverage tradeoff (alongside the recent industrial models HSTU and FuXi. As the modifications compared to the original SASRec are relatively straightforward and no extra features are needed (such as timestamps in HSTU), we believe that eSASRec can be easily integrated into existing recommendation pipelines and can can serve as a strong yet very simple baseline for emerging complicated algorithms. To facilitate this, we provide the open-source implementations for our models and benchmarks in repository https://github.com/blondered/transformer_benchmark
title eSASRec: Enhancing Transformer-based Recommendations in a Modular Fashion
topic Information Retrieval
Machine Learning
url https://arxiv.org/abs/2508.06450