Saved in:
Bibliographic Details
Main Authors: Liu, Weisi, Han, Guangzeng, Huang, Xiaolei
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.08825
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917920778485760
author Liu, Weisi
Han, Guangzeng
Huang, Xiaolei
author_facet Liu, Weisi
Han, Guangzeng
Huang, Xiaolei
contents Time is implicitly embedded in classification process: classifiers are usually built on existing data while to be applied on future data whose distributions (e.g., label and token) may change. However, existing state-of-the-art classification models merely consider the temporal variations and primarily focus on English corpora, which leaves temporal studies less explored, let alone under multilingual settings. In this study, we fill the gap by treating time as domains (e.g., 2024 vs. 2025), examining temporal effects, and developing a domain adaptation framework to generalize classifiers over time on multiple languages. Our framework proposes Mixture of Temporal Experts (MoTE) to leverage both semantic and data distributional shifts to learn and adapt temporal trends into classification models. Our analysis shows classification performance varies over time across different languages, and we experimentally demonstrate that MoTE can enhance classifier generalizability over temporal data shifts. Our study provides analytic insights and addresses the need for time-aware models that perform robustly in multilingual scenarios.
format Preprint
id arxiv_https___arxiv_org_abs_2502_08825
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Examining and Adapting Time for Multilingual Classification via Mixture of Temporal Experts
Liu, Weisi
Han, Guangzeng
Huang, Xiaolei
Computation and Language
Time is implicitly embedded in classification process: classifiers are usually built on existing data while to be applied on future data whose distributions (e.g., label and token) may change. However, existing state-of-the-art classification models merely consider the temporal variations and primarily focus on English corpora, which leaves temporal studies less explored, let alone under multilingual settings. In this study, we fill the gap by treating time as domains (e.g., 2024 vs. 2025), examining temporal effects, and developing a domain adaptation framework to generalize classifiers over time on multiple languages. Our framework proposes Mixture of Temporal Experts (MoTE) to leverage both semantic and data distributional shifts to learn and adapt temporal trends into classification models. Our analysis shows classification performance varies over time across different languages, and we experimentally demonstrate that MoTE can enhance classifier generalizability over temporal data shifts. Our study provides analytic insights and addresses the need for time-aware models that perform robustly in multilingual scenarios.
title Examining and Adapting Time for Multilingual Classification via Mixture of Temporal Experts
topic Computation and Language
url https://arxiv.org/abs/2502.08825