:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Xindi, Salmani, Mahsa, Omidi, Parsa, Ren, Xiangyu, Rezagholizadeh, Mehdi, Eshaghi, Armaghan
Format:	Preprint
Published:	2024
Subjects:	Computation and Language Machine Learning
Online Access:	https://arxiv.org/abs/2402.02244
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Memory-Augmented Transformers: A Systematic Review from Neuroscience Principles to Enhanced Model Architectures
by: Omidi, Parsa, et al.
Published: (2025)

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference
by: Kavehzadeh, Parsa, et al.
Published: (2023)

Balcony: A Lightweight Approach to Dynamic Inference of Generative Language Models
by: Jamialahmadi, Benyamin, et al.
Published: (2025)

Early Stopping for Large Reasoning Models via Confidence Dynamics
by: Hosseini, Parsa, et al.
Published: (2026)

Long-Context Aware Upcycling: A New Frontier for Hybrid LLM Scaling
by: Fashi, Parsa Ashrafi, et al.
Published: (2026)

Resonance RoPE: Improving Context Length Generalization of Large Language Models
by: Wang, Suyuchen, et al.
Published: (2024)

Leveraging Distillation Techniques for Document Understanding: A Case Study with FLAN-T5
by: Lamott, Marcel, et al.
Published: (2024)

Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Models
by: Huang, Jerry, et al.
Published: (2024)

SLaNC: Static LayerNorm Calibration
by: Salmani, Mahsa, et al.
Published: (2024)

EchoAtt: Attend, Copy, then Adjust for More Efficient Large Language Models
by: Rajabzadeh, Hossein, et al.
Published: (2024)

Zebra-Llama: Towards Extremely Efficient Hybrid Models
by: Yang, Mingyu, et al.
Published: (2025)

QDyLoRA: Quantized Dynamic Low-Rank Adaptation for Efficient Large Language Model Tuning
by: Rajabzadeh, Hossein, et al.
Published: (2024)

DTRNet: Dynamic Token Routing Network to Reduce Quadratic Costs in Transformers
by: Sharma, Aman, et al.
Published: (2025)

Towards Practical Tool Usage for Continually Learning LLMs
by: Huang, Jerry, et al.
Published: (2024)

SELF: Self-Extend the Context Length With Logistic Growth Function
by: Dang, Phat Thanh, et al.
Published: (2025)

LABO: Towards Learning Optimal Label Regularization via Bi-level Optimization
by: Lu, Peng, et al.
Published: (2023)

Do Robot Snakes Dream like Electric Sheep? Investigating the Effects of Architectural Inductive Biases on Hallucination
by: Huang, Jerry, et al.
Published: (2024)

Chain-of-Defensive-Thought: Structured Reasoning Elicits Robustness in Large Language Models against Reference Corruption
by: Wang, Wenxiao, et al.
Published: (2025)

Hijacking Large Language Models via Adversarial In-Context Learning
by: Zhou, Xiangyu, et al.
Published: (2023)

Beyond the Prompt in Large Language Models: Comprehension, In-Context Learning, and Chain-of-Thought
by: Jiao, Yuling, et al.
Published: (2026)

Demystifying OPD: Length Inflation and Stabilization Strategies for Large Language Models
by: Luo, Feng, et al.
Published: (2026)

A Comprehensive Survey on Long Context Language Modeling
by: Liu, Jiaheng, et al.
Published: (2025)

LLLMs: A Data-Driven Survey of Evolving Research on Limitations of Large Language Models
by: Kostikova, Aida, et al.
Published: (2025)

Extending Input Contexts of Language Models through Training on Segmented Sequences
by: Karypis, Petros, et al.
Published: (2023)

Hansel: Output Length Controlling Framework for Large Language Models
by: Song, Seoha, et al.
Published: (2024)

Advancing Graph Representation Learning with Large Language Models: A Comprehensive Survey of Techniques
by: Mao, Qiheng, et al.
Published: (2024)

Systematic Evaluation of Optimization Techniques for Long-Context Language Models
by: Ahmed, Ammar, et al.
Published: (2025)

LIDAO: Towards Limited Interventions for Debiasing (Large) Language Models
by: Liu, Tianci, et al.
Published: (2024)

Towards Modeling Learner Performance with Large Language Models
by: Neshaei, Seyed Parsa, et al.
Published: (2024)

MULTIVERSE: Exposing Large Language Model Alignment Problems in Diverse Worlds
by: Jin, Xiaolong, et al.
Published: (2024)

A Survey on Medical Large Language Models: Technology, Application, Trustworthiness, and Future Directions
by: Liu, Lei, et al.
Published: (2024)

Context-Aware Initialization for Reducing Generative Path Length in Diffusion Language Models
by: Miao, Tongyuan, et al.
Published: (2025)

Fine-tuning Large Language Models with Limited Data: A Survey and Practical Guide
by: Szep, Marton, et al.
Published: (2024)

Large Language Model Selection with Limited Annotations
by: Durmazkeser, Yavuz, et al.
Published: (2026)

Model Hemorrhage and the Robustness Limits of Large Language Models
by: Ma, Ziyang, et al.
Published: (2025)

Morality is Contextual: Learning Interpretable Moral Contexts from Human Data with Probabilistic Clustering and Large Language Models
by: Morlat, Geoffroy, et al.
Published: (2025)

LongEmbed: Extending Embedding Models for Long Context Retrieval
by: Zhu, Dawei, et al.
Published: (2024)

A Survey on Mixture of Experts in Large Language Models
by: Cai, Weilin, et al.
Published: (2024)

SortedNet: A Scalable and Generalized Framework for Training Modular Deep Neural Networks
by: Valipour, Mojtaba, et al.
Published: (2023)

Out-of-Context Reasoning in Large Language Models
by: Shaki, Jonathan, et al.
Published: (2025)