Saved in:
| Main Authors: | Veisi, Ali, Amirzadeh, Hamidreza, Mansourian, Amir |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.08067 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Context-aware Rotary Position Embedding
by: Veisi, Ali, et al.
Published: (2025)
by: Veisi, Ali, et al.
Published: (2025)
How Language Models Prioritize Contextual Grammatical Cues?
by: Amirzadeh, Hamidreza, et al.
Published: (2024)
by: Amirzadeh, Hamidreza, et al.
Published: (2024)
data2lang2vec: Data Driven Typological Features Completion
by: Amirzadeh, Hamidreza, et al.
Published: (2024)
by: Amirzadeh, Hamidreza, et al.
Published: (2024)
In-Context Learning (and Unlearning) of Length Biases
by: Schoch, Stephanie, et al.
Published: (2025)
by: Schoch, Stephanie, et al.
Published: (2025)
ParallelComp: Parallel Long-Context Compressor for Length Extrapolation
by: Xiong, Jing, et al.
Published: (2025)
by: Xiong, Jing, et al.
Published: (2025)
DAPE: Data-Adaptive Positional Encoding for Length Extrapolation
by: Zheng, Chuanyang, et al.
Published: (2024)
by: Zheng, Chuanyang, et al.
Published: (2024)
CLEX: Continuous Length Extrapolation for Large Language Models
by: Chen, Guanzheng, et al.
Published: (2023)
by: Chen, Guanzheng, et al.
Published: (2023)
Extrapolation by Association: Length Generalization Transfer in Transformers
by: Cai, Ziyang, et al.
Published: (2025)
by: Cai, Ziyang, et al.
Published: (2025)
Gammatonegram Representation for End-to-End Dysarthric Speech Processing Tasks: Speech Recognition, Speaker Identification, and Intelligibility Assessment
by: Farhadipour, Aref, et al.
Published: (2023)
by: Farhadipour, Aref, et al.
Published: (2023)
Bayesian Network Fusion of Large Language Models for Sentiment Analysis
by: Amirzadeh, Rasoul, et al.
Published: (2025)
by: Amirzadeh, Rasoul, et al.
Published: (2025)
Information Entropy Invariance: Enhancing Length Extrapolation in Attention Mechanisms
by: Li, Kewei, et al.
Published: (2025)
by: Li, Kewei, et al.
Published: (2025)
Length Extrapolation of Transformers: A Survey from the Perspective of Positional Encoding
by: Zhao, Liang, et al.
Published: (2023)
by: Zhao, Liang, et al.
Published: (2023)
Bayesian Attention Mechanism: A Probabilistic Framework for Positional Encoding and Context Length Extrapolation
by: Bianchessi, Arthur S., et al.
Published: (2025)
by: Bianchessi, Arthur S., et al.
Published: (2025)
From Interpolation to Extrapolation: Complete Length Generalization for Arithmetic Transformers
by: Duan, Shaoxiong, et al.
Published: (2023)
by: Duan, Shaoxiong, et al.
Published: (2023)
DAPE V2: Process Attention Score as Feature Map for Length Extrapolation
by: Zheng, Chuanyang, et al.
Published: (2024)
by: Zheng, Chuanyang, et al.
Published: (2024)
Effective Length Extrapolation via Dimension-Wise Positional Embeddings Manipulation
by: Lu, Yi, et al.
Published: (2025)
by: Lu, Yi, et al.
Published: (2025)
Enhancing Length Extrapolation in Sequential Models with Pointer-Augmented Neural Memory
by: Le, Hung, et al.
Published: (2024)
by: Le, Hung, et al.
Published: (2024)
Squeezed Attention: Accelerating Long Context Length LLM Inference
by: Hooper, Coleman, et al.
Published: (2024)
by: Hooper, Coleman, et al.
Published: (2024)
DCIS: Efficient Length Extrapolation of LLMs via Divide-and-Conquer Scaling Factor Search
by: Yang, Lei, et al.
Published: (2024)
by: Yang, Lei, et al.
Published: (2024)
KurdSTS: The Kurdish Semantic Textual Similarity
by: Abdullah, Abdulhady Abas, et al.
Published: (2025)
by: Abdullah, Abdulhady Abas, et al.
Published: (2025)
KuBERT: Central Kurdish BERT Model and Its Application for Sentiment Analysis
by: Awlla, Kozhin muhealddin, et al.
Published: (2025)
by: Awlla, Kozhin muhealddin, et al.
Published: (2025)
TokenSelect: Efficient Long-Context Inference and Length Extrapolation for LLMs via Dynamic Token-Level KV Cache Selection
by: Wu, Wei, et al.
Published: (2024)
by: Wu, Wei, et al.
Published: (2024)
The Role of Orthographic Consistency in Multilingual Embedding Models for Text Classification in Arabic-Script Languages
by: Abdullah, Abdulhady Abas, et al.
Published: (2025)
by: Abdullah, Abdulhady Abas, et al.
Published: (2025)
Position as Probability: Self-Supervised Transformers that Think Past Their Training for Length Extrapolation
by: Lee, Philip Heejun
Published: (2025)
by: Lee, Philip Heejun
Published: (2025)
Evaluating Biases in Context-Dependent Health Questions
by: Levy, Sharon, et al.
Published: (2024)
by: Levy, Sharon, et al.
Published: (2024)
Softplus Attention with Re-weighting Boosts Length Extrapolation in Large Language Models
by: Gao, Bo, et al.
Published: (2025)
by: Gao, Bo, et al.
Published: (2025)
Enhancing Kurdish Text-to-Speech with Native Corpus Training: A High-Quality WaveGlow Vocoder Approach
by: Abdullah, Abdulhady Abas, et al.
Published: (2024)
by: Abdullah, Abdulhady Abas, et al.
Published: (2024)
A Training-Free Length Extrapolation Approach for LLMs: Greedy Attention Logit Interpolation (GALI)
by: Li, Yan, et al.
Published: (2025)
by: Li, Yan, et al.
Published: (2025)
Recall with Reasoning: Chain-of-Thought Distillation for Mamba's Long-Context Memory and Extrapolation
by: Ma, Junyu, et al.
Published: (2025)
by: Ma, Junyu, et al.
Published: (2025)
Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation
by: He, Zhenyu, et al.
Published: (2024)
by: He, Zhenyu, et al.
Published: (2024)
Systematic Biases in LLM Simulations of Debates
by: Taubenfeld, Amir, et al.
Published: (2024)
by: Taubenfeld, Amir, et al.
Published: (2024)
Extrapolation Merging: Keep Improving With Extrapolation and Merging
by: Lin, Yiguan, et al.
Published: (2025)
by: Lin, Yiguan, et al.
Published: (2025)
Base of RoPE Bounds Context Length
by: Men, Xin, et al.
Published: (2024)
by: Men, Xin, et al.
Published: (2024)
Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence
by: Lu, Junru, et al.
Published: (2024)
by: Lu, Junru, et al.
Published: (2024)
InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context Memory
by: Xiao, Chaojun, et al.
Published: (2024)
by: Xiao, Chaojun, et al.
Published: (2024)
Revisiting Context Choices for Context-aware Machine Translation
by: Rikters, Matīss, et al.
Published: (2021)
by: Rikters, Matīss, et al.
Published: (2021)
Positional Biases Shift as Inputs Approach Context Window Limits
by: Veseli, Blerta, et al.
Published: (2025)
by: Veseli, Blerta, et al.
Published: (2025)
The Impact of Role Design in In-Context Learning for Large Language Models
by: Rouzegar, Hamidreza, et al.
Published: (2025)
by: Rouzegar, Hamidreza, et al.
Published: (2025)
Bootstrap Your Own Context Length
by: Wang, Liang, et al.
Published: (2024)
by: Wang, Liang, et al.
Published: (2024)
CoBia: Constructed Conversations Can Trigger Otherwise Concealed Societal Biases in LLMs
by: Nikeghbal, Nafiseh, et al.
Published: (2025)
by: Nikeghbal, Nafiseh, et al.
Published: (2025)
Similar Items
-
Context-aware Rotary Position Embedding
by: Veisi, Ali, et al.
Published: (2025) -
How Language Models Prioritize Contextual Grammatical Cues?
by: Amirzadeh, Hamidreza, et al.
Published: (2024) -
data2lang2vec: Data Driven Typological Features Completion
by: Amirzadeh, Hamidreza, et al.
Published: (2024) -
In-Context Learning (and Unlearning) of Length Biases
by: Schoch, Stephanie, et al.
Published: (2025) -
ParallelComp: Parallel Long-Context Compressor for Length Extrapolation
by: Xiong, Jing, et al.
Published: (2025)