Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Han, Xue, Hu, Qian, Wang, Yitong, Gao, Wenchun, Zhang, Lianlian, Wang, Qing, Mei, Lijun, Deng, Chao, Feng, Junlan
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2503.04150
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915564643942400
author	Han, Xue Hu, Qian Wang, Yitong Gao, Wenchun Zhang, Lianlian Wang, Qing Mei, Lijun Deng, Chao Feng, Junlan
author_facet	Han, Xue Hu, Qian Wang, Yitong Gao, Wenchun Zhang, Lianlian Wang, Qing Mei, Lijun Deng, Chao Feng, Junlan
contents	Large language models (LLMs) suffer from temporal misalignment issues especially across long span of time. The issue arises from knowing that LLMs are trained on large amounts of data where temporal information is rather sparse over long times, such as thousands of years, resulting in insufficient learning or catastrophic forgetting by the LLMs. This paper proposes a methodology named "Ticktack" for addressing the LLM's long-time span misalignment in a yearly setting. Specifically, we first propose to utilize the sexagenary year expression instead of the Gregorian year expression employed by LLMs, achieving a more uniform distribution in yearly granularity. Then, we employ polar coordinates to model the sexagenary cycle of 60 terms and the year order within each term, with additional temporal encoding to ensure LLMs understand them. Finally, we present a temporal representational alignment approach for post-training LLMs that effectively distinguishes time points with relevant knowledge, hence improving performance on time-related tasks, particularly over a long period. We also create a long time span benchmark for evaluation. Experimental results prove the effectiveness of our proposal.
format	Preprint
id	arxiv_https___arxiv_org_abs_2503_04150
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Temporal Alignment of LLMs through Cycle Encoding for Long-Range Time Representations Han, Xue Hu, Qian Wang, Yitong Gao, Wenchun Zhang, Lianlian Wang, Qing Mei, Lijun Deng, Chao Feng, Junlan Computation and Language Artificial Intelligence Large language models (LLMs) suffer from temporal misalignment issues especially across long span of time. The issue arises from knowing that LLMs are trained on large amounts of data where temporal information is rather sparse over long times, such as thousands of years, resulting in insufficient learning or catastrophic forgetting by the LLMs. This paper proposes a methodology named "Ticktack" for addressing the LLM's long-time span misalignment in a yearly setting. Specifically, we first propose to utilize the sexagenary year expression instead of the Gregorian year expression employed by LLMs, achieving a more uniform distribution in yearly granularity. Then, we employ polar coordinates to model the sexagenary cycle of 60 terms and the year order within each term, with additional temporal encoding to ensure LLMs understand them. Finally, we present a temporal representational alignment approach for post-training LLMs that effectively distinguishes time points with relevant knowledge, hence improving performance on time-related tasks, particularly over a long period. We also create a long time span benchmark for evaluation. Experimental results prove the effectiveness of our proposal.
title	Temporal Alignment of LLMs through Cycle Encoding for Long-Range Time Representations
topic	Computation and Language Artificial Intelligence
url	https://arxiv.org/abs/2503.04150

Similar Items