:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Zhou, Hongxu
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Computation and Language
Online Access:	https://arxiv.org/abs/2604.05923
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Flip-Flop Consistency: Unsupervised Training for Robustness to Prompt Perturbations in LLMs
by: Hejabi, Parsa, et al.
Published: (2025)

Lost in State Space: Probing Frozen Mamba Representations
by: Wagh, Bhagyashree, et al.
Published: (2026)

The Illusion of State in State-Space Models
by: Merrill, William, et al.
Published: (2024)

State Space Models as Foundation Models: A Control Theoretic Overview
by: Alonso, Carmen Amo, et al.
Published: (2024)

Task Structure Reverses Layerwise State Encoding in Sequence Models
by: Jiang, Yuhang
Published: (2026)

Monitoring Latent World States in Language Models with Propositional Probes
by: Feng, Jiahai, et al.
Published: (2024)

Rethinking Token Reduction for State Space Models
by: Zhan, Zheng, et al.
Published: (2024)

UNDO: Understanding Distillation as Optimization
by: Jain, Kushal, et al.
Published: (2025)

Parameter-Efficient Fine-Tuning of State Space Models
by: Galim, Kevin, et al.
Published: (2024)

On Pruning State-Space LLMs
by: Ghattas, Tamer, et al.
Published: (2025)

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models
by: Lv, Xingtai, et al.
Published: (2025)

MambaByte: Token-free Selective State Space Model
by: Wang, Junxiong, et al.
Published: (2024)

LOCOST: State-Space Models for Long Document Abstractive Summarization
by: Bronnec, Florian Le, et al.
Published: (2024)

Mimetic Initialization Helps State Space Models Learn to Recall
by: Trockman, Asher, et al.
Published: (2024)

MatMamba: A Matryoshka State Space Model
by: Shukla, Abhinav, et al.
Published: (2024)

The Expressive Capacity of State Space Models: A Formal Language Perspective
by: Sarrof, Yash, et al.
Published: (2024)

Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
by: Ren, Liliang, et al.
Published: (2024)

Studying the Soupability of Documents in State Space Models
by: Jafari, Yasaman, et al.
Published: (2025)

Expansion Span: Combining Fading Memory and Retrieval in Hybrid State Space Models
by: Nunez, Elvis, et al.
Published: (2024)

Stateful KV Cache Management for LLMs: Balancing Space, Time, Accuracy, and Positional Fidelity
by: Poudel, Pratik
Published: (2025)

Semantic Anchors in In-Context Learning: Why Small LLMs Cannot Flip Their Labels
by: Kumar, Anantha Padmanaban Krishna
Published: (2025)

DenseMamba: State Space Models with Dense Hidden Connection for Efficient Large Language Models
by: He, Wei, et al.
Published: (2024)

Geometric Organization of Cognitive States in Transformer Embedding Spaces
by: Zhao, Sophie
Published: (2025)

Flipping Against All Odds: Reducing LLM Coin Flip Bias via Verbalized Rejection Sampling
by: Xiao, Tim Z., et al.
Published: (2025)

On Structured State-Space Duality
by: Hu, Jerry Yao-Chieh, et al.
Published: (2025)

Rethinking State Tracking in Recurrent Models Through Error Control Dynamics
by: Chung, Jiwan, et al.
Published: (2026)

MemMamba: Rethinking Memory Patterns in State Space Model
by: Wang, Youjin, et al.
Published: (2025)

PICASO: Permutation-Invariant Context Composition with State Space Models
by: Liu, Tian Yu, et al.
Published: (2025)

Semantic Structure of Feature Space in Large Language Models
by: Kozlowski, Austin C., et al.
Published: (2026)

A Comparative Analysis of Contextual Representation Flow in State-Space and Transformer Architectures
by: Hoang, Nhat M., et al.
Published: (2025)

Enhanced Structured State Space Models via Grouped FIR Filtering and Attention Sink Mechanisms
by: Meng, Tian, et al.
Published: (2024)

Sessa: Selective State Space Attention
by: Horbatko, Liubomyr
Published: (2026)

Sectoral Coupling in Linguistic State Space
by: Dumbrava, Sebastian
Published: (2025)

Probing Semantic Routing in Large Mixture-of-Expert Models
by: Olson, Matthew Lyle, et al.
Published: (2025)

Taipan: Efficient and Expressive State Space Language Models with Selective Attention
by: Van Nguyen, Chien, et al.
Published: (2024)

On the Expressiveness and Length Generalization of Selective State-Space Models on Regular Languages
by: Terzić, Aleksandar, et al.
Published: (2024)

Characterizing the Behavior of Training Mamba-based State Space Models on GPUs
by: Baruah, Trinayan, et al.
Published: (2025)

Birdie: Advancing State Space Models with Reward-Driven Objectives and Curricula
by: Blouir, Sam, et al.
Published: (2024)

Repeat After Me: Transformers are Better than State Space Models at Copying
by: Jelassi, Samy, et al.
Published: (2024)

MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts
by: Pióro, Maciej, et al.
Published: (2024)