Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Liang, Renzhao, Xu, Sizhe, Xie, Chenggang, Chen, Jingru, Ren, Feiyang, Yang, Shu, Yabe, Takahiro
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Information Theory
Online Access:	https://arxiv.org/abs/2510.19980
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866917035668144128
author	Liang, Renzhao Xu, Sizhe Xie, Chenggang Chen, Jingru Ren, Feiyang Yang, Shu Yabe, Takahiro
author_facet	Liang, Renzhao Xu, Sizhe Xie, Chenggang Chen, Jingru Ren, Feiyang Yang, Shu Yabe, Takahiro
contents	Time series forecasting plays a pivotal role in critical domains such as energy management and financial markets. Although deep learning-based approaches (e.g., MLP, RNN, Transformer) have achieved remarkable progress, the prevailing "long-sequence information gain hypothesis" exhibits inherent limitations. Through systematic experimentation, this study reveals a counterintuitive phenomenon: appropriately truncating historical data can paradoxically enhance prediction accuracy, indicating that existing models learn substantial redundant features (e.g., noise or irrelevant fluctuations) during training, thereby compromising effective signal extraction. Building upon information bottleneck theory, we propose an innovative solution termed Adaptive Masking Loss with Representation Consistency (AMRC), which features two core components: 1) Dynamic masking loss, which adaptively identified highly discriminative temporal segments to guide gradient descent during model training; 2) Representation consistency constraint, which stabilized the mapping relationships among inputs, labels, and predictions. Experimental results demonstrate that AMRC effectively suppresses redundant feature learning while significantly improving model performance. This work not only challenges conventional assumptions in temporal modeling but also provides novel theoretical insights and methodological breakthroughs for developing efficient and robust forecasting models.
format	Preprint
id	arxiv_https___arxiv_org_abs_2510_19980
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Abstain Mask Retain Core: Time Series Prediction by Adaptive Masking Loss with Representation Consistency Liang, Renzhao Xu, Sizhe Xie, Chenggang Chen, Jingru Ren, Feiyang Yang, Shu Yabe, Takahiro Machine Learning Information Theory Time series forecasting plays a pivotal role in critical domains such as energy management and financial markets. Although deep learning-based approaches (e.g., MLP, RNN, Transformer) have achieved remarkable progress, the prevailing "long-sequence information gain hypothesis" exhibits inherent limitations. Through systematic experimentation, this study reveals a counterintuitive phenomenon: appropriately truncating historical data can paradoxically enhance prediction accuracy, indicating that existing models learn substantial redundant features (e.g., noise or irrelevant fluctuations) during training, thereby compromising effective signal extraction. Building upon information bottleneck theory, we propose an innovative solution termed Adaptive Masking Loss with Representation Consistency (AMRC), which features two core components: 1) Dynamic masking loss, which adaptively identified highly discriminative temporal segments to guide gradient descent during model training; 2) Representation consistency constraint, which stabilized the mapping relationships among inputs, labels, and predictions. Experimental results demonstrate that AMRC effectively suppresses redundant feature learning while significantly improving model performance. This work not only challenges conventional assumptions in temporal modeling but also provides novel theoretical insights and methodological breakthroughs for developing efficient and robust forecasting models.
title	Abstain Mask Retain Core: Time Series Prediction by Adaptive Masking Loss with Representation Consistency
topic	Machine Learning Information Theory
url	https://arxiv.org/abs/2510.19980

Similar Items