:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Veisi, Ali, Amirzadeh, Hamidreza, Mansourian, Amir
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2503.08067
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Context-aware Rotary Position Embedding
by: Veisi, Ali, et al.
Published: (2025)

How Language Models Prioritize Contextual Grammatical Cues?
by: Amirzadeh, Hamidreza, et al.
Published: (2024)

data2lang2vec: Data Driven Typological Features Completion
by: Amirzadeh, Hamidreza, et al.
Published: (2024)

In-Context Learning (and Unlearning) of Length Biases
by: Schoch, Stephanie, et al.
Published: (2025)

ParallelComp: Parallel Long-Context Compressor for Length Extrapolation
by: Xiong, Jing, et al.
Published: (2025)

DAPE: Data-Adaptive Positional Encoding for Length Extrapolation
by: Zheng, Chuanyang, et al.
Published: (2024)

CLEX: Continuous Length Extrapolation for Large Language Models
by: Chen, Guanzheng, et al.
Published: (2023)

Extrapolation by Association: Length Generalization Transfer in Transformers
by: Cai, Ziyang, et al.
Published: (2025)

Gammatonegram Representation for End-to-End Dysarthric Speech Processing Tasks: Speech Recognition, Speaker Identification, and Intelligibility Assessment
by: Farhadipour, Aref, et al.
Published: (2023)

Bayesian Network Fusion of Large Language Models for Sentiment Analysis
by: Amirzadeh, Rasoul, et al.
Published: (2025)

Information Entropy Invariance: Enhancing Length Extrapolation in Attention Mechanisms
by: Li, Kewei, et al.
Published: (2025)

Length Extrapolation of Transformers: A Survey from the Perspective of Positional Encoding
by: Zhao, Liang, et al.
Published: (2023)

Bayesian Attention Mechanism: A Probabilistic Framework for Positional Encoding and Context Length Extrapolation
by: Bianchessi, Arthur S., et al.
Published: (2025)

From Interpolation to Extrapolation: Complete Length Generalization for Arithmetic Transformers
by: Duan, Shaoxiong, et al.
Published: (2023)

DAPE V2: Process Attention Score as Feature Map for Length Extrapolation
by: Zheng, Chuanyang, et al.
Published: (2024)

Effective Length Extrapolation via Dimension-Wise Positional Embeddings Manipulation
by: Lu, Yi, et al.
Published: (2025)

Enhancing Length Extrapolation in Sequential Models with Pointer-Augmented Neural Memory
by: Le, Hung, et al.
Published: (2024)

Squeezed Attention: Accelerating Long Context Length LLM Inference
by: Hooper, Coleman, et al.
Published: (2024)

DCIS: Efficient Length Extrapolation of LLMs via Divide-and-Conquer Scaling Factor Search
by: Yang, Lei, et al.
Published: (2024)

KurdSTS: The Kurdish Semantic Textual Similarity
by: Abdullah, Abdulhady Abas, et al.
Published: (2025)

KuBERT: Central Kurdish BERT Model and Its Application for Sentiment Analysis
by: Awlla, Kozhin muhealddin, et al.
Published: (2025)

TokenSelect: Efficient Long-Context Inference and Length Extrapolation for LLMs via Dynamic Token-Level KV Cache Selection
by: Wu, Wei, et al.
Published: (2024)

The Role of Orthographic Consistency in Multilingual Embedding Models for Text Classification in Arabic-Script Languages
by: Abdullah, Abdulhady Abas, et al.
Published: (2025)

Position as Probability: Self-Supervised Transformers that Think Past Their Training for Length Extrapolation
by: Lee, Philip Heejun
Published: (2025)

Evaluating Biases in Context-Dependent Health Questions
by: Levy, Sharon, et al.
Published: (2024)

Softplus Attention with Re-weighting Boosts Length Extrapolation in Large Language Models
by: Gao, Bo, et al.
Published: (2025)

Enhancing Kurdish Text-to-Speech with Native Corpus Training: A High-Quality WaveGlow Vocoder Approach
by: Abdullah, Abdulhady Abas, et al.
Published: (2024)

A Training-Free Length Extrapolation Approach for LLMs: Greedy Attention Logit Interpolation (GALI)
by: Li, Yan, et al.
Published: (2025)

Recall with Reasoning: Chain-of-Thought Distillation for Mamba's Long-Context Memory and Extrapolation
by: Ma, Junyu, et al.
Published: (2025)

Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation
by: He, Zhenyu, et al.
Published: (2024)

Systematic Biases in LLM Simulations of Debates
by: Taubenfeld, Amir, et al.
Published: (2024)

Extrapolation Merging: Keep Improving With Extrapolation and Merging
by: Lin, Yiguan, et al.
Published: (2025)

Base of RoPE Bounds Context Length
by: Men, Xin, et al.
Published: (2024)

Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence
by: Lu, Junru, et al.
Published: (2024)

InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context Memory
by: Xiao, Chaojun, et al.
Published: (2024)

Revisiting Context Choices for Context-aware Machine Translation
by: Rikters, Matīss, et al.
Published: (2021)

Positional Biases Shift as Inputs Approach Context Window Limits
by: Veseli, Blerta, et al.
Published: (2025)

The Impact of Role Design in In-Context Learning for Large Language Models
by: Rouzegar, Hamidreza, et al.
Published: (2025)

Bootstrap Your Own Context Length
by: Wang, Liang, et al.
Published: (2024)

CoBia: Constructed Conversations Can Trigger Otherwise Concealed Societal Biases in LLMs
by: Nikeghbal, Nafiseh, et al.
Published: (2025)