Saved in:
| Main Authors: | Dai, Chang, Shan, Hongyu, Song, Mingyang, Liang, Di |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.05218 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
HoPE: A Novel Positional Encoding Without Long-Term Decay for Enhanced Context Awareness and Extrapolation
by: Chen, Yuhan, et al.
Published: (2024)
by: Chen, Yuhan, et al.
Published: (2024)
R-Capsule: Compressing High-Level Plans for Efficient Large Language Model Reasoning
by: Shan, Hongyu, et al.
Published: (2025)
by: Shan, Hongyu, et al.
Published: (2025)
HoPE: Hybrid of Position Embedding for Long Context Vision-Language Models
by: Li, Haoran, et al.
Published: (2025)
by: Li, Haoran, et al.
Published: (2025)
PoPE: Legendre Orthogonal Polynomials Based Position Encoding for Large Language Models
by: Aggarwal, Arpit
Published: (2024)
by: Aggarwal, Arpit
Published: (2024)
VRoPE: Rotary Position Embedding for Video Large Language Models
by: Liu, Zikang, et al.
Published: (2025)
by: Liu, Zikang, et al.
Published: (2025)
Rotary Positional Embeddings as Phase Modulation: Theoretical Bounds on the RoPE Base for Long-Context Transformers
by: Liu, Feilong
Published: (2026)
by: Liu, Feilong
Published: (2026)
SeqPE: Transformer with Sequential Position Encoding
by: Li, Huayang, et al.
Published: (2025)
by: Li, Huayang, et al.
Published: (2025)
Bounded Hyperbolic Tangent: A Stable and Efficient Alternative to Pre-Layer Normalization in Large Language Models
by: Byun, Hoyoon, et al.
Published: (2025)
by: Byun, Hoyoon, et al.
Published: (2025)
Adaptive 3D-RoPE: Physics-Aligned Rotary Positional Encoding for Wireless Foundation Models
by: Zhang, Chenyu, et al.
Published: (2026)
by: Zhang, Chenyu, et al.
Published: (2026)
Rotary Position Encodings for Graphs
by: Reid, Isaac, et al.
Published: (2025)
by: Reid, Isaac, et al.
Published: (2025)
Revisiting Catastrophic Forgetting in Large Language Model Tuning
by: Li, Hongyu, et al.
Published: (2024)
by: Li, Hongyu, et al.
Published: (2024)
EmoLLM: Appraisal-Grounded Cognitive-Emotional Co-Reasoning in Large Language Models
by: Zhang, Yifei, et al.
Published: (2026)
by: Zhang, Yifei, et al.
Published: (2026)
TransXSSM: A Hybrid Transformer State Space Model with Unified Rotary Position Embedding
by: Wu, Bingheng, et al.
Published: (2025)
by: Wu, Bingheng, et al.
Published: (2025)
Resonance RoPE: Improving Context Length Generalization of Large Language Models
by: Wang, Suyuchen, et al.
Published: (2024)
by: Wang, Suyuchen, et al.
Published: (2024)
Benchmarking Rotary Position Embeddings for Automatic Speech Recognition
by: Zhang, Shucong, et al.
Published: (2025)
by: Zhang, Shucong, et al.
Published: (2025)
Mitigating Multilingual Hallucination in Large Vision-Language Models
by: Qu, Xiaoye, et al.
Published: (2024)
by: Qu, Xiaoye, et al.
Published: (2024)
Determine-Then-Ensemble: Necessity of Top-k Union for Large Language Model Ensembling
by: Yao, Yuxuan, et al.
Published: (2024)
by: Yao, Yuxuan, et al.
Published: (2024)
Language Models' Factuality Depends on the Language of Inquiry
by: Aggarwal, Tushar, et al.
Published: (2025)
by: Aggarwal, Tushar, et al.
Published: (2025)
Enhancing Building Semantics Preservation in AI Model Training with Large Language Model Encodings
by: Jang, Suhyung, et al.
Published: (2026)
by: Jang, Suhyung, et al.
Published: (2026)
Temporal Alignment of LLMs through Cycle Encoding for Long-Range Time Representations
by: Han, Xue, et al.
Published: (2025)
by: Han, Xue, et al.
Published: (2025)
RoPE Distinguishes Neither Positions Nor Tokens in Long Contexts, Provably
by: Du, Yufeng, et al.
Published: (2026)
by: Du, Yufeng, et al.
Published: (2026)
REEF: Representation Encoding Fingerprints for Large Language Models
by: Zhang, Jie, et al.
Published: (2024)
by: Zhang, Jie, et al.
Published: (2024)
Hyperbolic Fine-Tuning for Large Language Models
by: Yang, Menglin, et al.
Published: (2024)
by: Yang, Menglin, et al.
Published: (2024)
Executing Natural Language-Described Algorithms with Large Language Models: An Investigation
by: Zheng, Xin, et al.
Published: (2024)
by: Zheng, Xin, et al.
Published: (2024)
Length-Aware Rotary Position Embedding for Text-Speech Alignment
by: Kim, Hyeongju, et al.
Published: (2025)
by: Kim, Hyeongju, et al.
Published: (2025)
Recursively Summarizing Enables Long-Term Dialogue Memory in Large Language Models
by: Wang, Qingyue, et al.
Published: (2023)
by: Wang, Qingyue, et al.
Published: (2023)
Cognitive Memory in Large Language Models
by: Shan, Lianlei, et al.
Published: (2025)
by: Shan, Lianlei, et al.
Published: (2025)
Large Language Models are Contrastive Reasoners
by: Yao, Liang
Published: (2024)
by: Yao, Liang
Published: (2024)
Optimizing Length Compression in Large Reasoning Models
by: Cheng, Zhengxiang, et al.
Published: (2025)
by: Cheng, Zhengxiang, et al.
Published: (2025)
The Super Weight in Large Language Models
by: Yu, Mengxia, et al.
Published: (2024)
by: Yu, Mengxia, et al.
Published: (2024)
On the Relationship between Sentence Analogy Identification and Sentence Structure Encoding in Large Language Models
by: Wijesiriwardene, Thilini, et al.
Published: (2023)
by: Wijesiriwardene, Thilini, et al.
Published: (2023)
PE: A Poincare Explanation Method for Fast Text Hierarchy Generation
by: Chen, Qian, et al.
Published: (2024)
by: Chen, Qian, et al.
Published: (2024)
Large Language Model Sourcing: A Survey
by: Pang, Liang, et al.
Published: (2025)
by: Pang, Liang, et al.
Published: (2025)
LongSafety: Evaluating Long-Context Safety of Large Language Models
by: Lu, Yida, et al.
Published: (2025)
by: Lu, Yida, et al.
Published: (2025)
Revisiting the Graph Reasoning Ability of Large Language Models: Case Studies in Translation, Connectivity and Shortest Path
by: Dai, Xinnan, et al.
Published: (2024)
by: Dai, Xinnan, et al.
Published: (2024)
Semantic Structure in Large Language Model Embeddings
by: Kozlowski, Austin C., et al.
Published: (2025)
by: Kozlowski, Austin C., et al.
Published: (2025)
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
by: Chen, Yukang, et al.
Published: (2023)
by: Chen, Yukang, et al.
Published: (2023)
Small Language Model as Data Prospector for Large Language Model
by: Ni, Shiwen, et al.
Published: (2024)
by: Ni, Shiwen, et al.
Published: (2024)
Towards Concise and Adaptive Thinking in Large Reasoning Models: A Survey
by: Zhu, Jason, et al.
Published: (2025)
by: Zhu, Jason, et al.
Published: (2025)
Where to Start Alignment? Diffusion Large Language Model May Demand a Distinct Position
by: Xie, Zhixin, et al.
Published: (2025)
by: Xie, Zhixin, et al.
Published: (2025)
Similar Items
-
HoPE: A Novel Positional Encoding Without Long-Term Decay for Enhanced Context Awareness and Extrapolation
by: Chen, Yuhan, et al.
Published: (2024) -
R-Capsule: Compressing High-Level Plans for Efficient Large Language Model Reasoning
by: Shan, Hongyu, et al.
Published: (2025) -
HoPE: Hybrid of Position Embedding for Long Context Vision-Language Models
by: Li, Haoran, et al.
Published: (2025) -
PoPE: Legendre Orthogonal Polynomials Based Position Encoding for Large Language Models
by: Aggarwal, Arpit
Published: (2024) -
VRoPE: Rotary Position Embedding for Video Large Language Models
by: Liu, Zikang, et al.
Published: (2025)