Saved in:
| Main Author: | Zhou, Hongxu |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.05923 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Flip-Flop Consistency: Unsupervised Training for Robustness to Prompt Perturbations in LLMs
by: Hejabi, Parsa, et al.
Published: (2025)
by: Hejabi, Parsa, et al.
Published: (2025)
Lost in State Space: Probing Frozen Mamba Representations
by: Wagh, Bhagyashree, et al.
Published: (2026)
by: Wagh, Bhagyashree, et al.
Published: (2026)
The Illusion of State in State-Space Models
by: Merrill, William, et al.
Published: (2024)
by: Merrill, William, et al.
Published: (2024)
State Space Models as Foundation Models: A Control Theoretic Overview
by: Alonso, Carmen Amo, et al.
Published: (2024)
by: Alonso, Carmen Amo, et al.
Published: (2024)
Task Structure Reverses Layerwise State Encoding in Sequence Models
by: Jiang, Yuhang
Published: (2026)
by: Jiang, Yuhang
Published: (2026)
Monitoring Latent World States in Language Models with Propositional Probes
by: Feng, Jiahai, et al.
Published: (2024)
by: Feng, Jiahai, et al.
Published: (2024)
Rethinking Token Reduction for State Space Models
by: Zhan, Zheng, et al.
Published: (2024)
by: Zhan, Zheng, et al.
Published: (2024)
UNDO: Understanding Distillation as Optimization
by: Jain, Kushal, et al.
Published: (2025)
by: Jain, Kushal, et al.
Published: (2025)
Parameter-Efficient Fine-Tuning of State Space Models
by: Galim, Kevin, et al.
Published: (2024)
by: Galim, Kevin, et al.
Published: (2024)
On Pruning State-Space LLMs
by: Ghattas, Tamer, et al.
Published: (2025)
by: Ghattas, Tamer, et al.
Published: (2025)
Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models
by: Lv, Xingtai, et al.
Published: (2025)
by: Lv, Xingtai, et al.
Published: (2025)
MambaByte: Token-free Selective State Space Model
by: Wang, Junxiong, et al.
Published: (2024)
by: Wang, Junxiong, et al.
Published: (2024)
LOCOST: State-Space Models for Long Document Abstractive Summarization
by: Bronnec, Florian Le, et al.
Published: (2024)
by: Bronnec, Florian Le, et al.
Published: (2024)
Mimetic Initialization Helps State Space Models Learn to Recall
by: Trockman, Asher, et al.
Published: (2024)
by: Trockman, Asher, et al.
Published: (2024)
MatMamba: A Matryoshka State Space Model
by: Shukla, Abhinav, et al.
Published: (2024)
by: Shukla, Abhinav, et al.
Published: (2024)
The Expressive Capacity of State Space Models: A Formal Language Perspective
by: Sarrof, Yash, et al.
Published: (2024)
by: Sarrof, Yash, et al.
Published: (2024)
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
by: Ren, Liliang, et al.
Published: (2024)
by: Ren, Liliang, et al.
Published: (2024)
Studying the Soupability of Documents in State Space Models
by: Jafari, Yasaman, et al.
Published: (2025)
by: Jafari, Yasaman, et al.
Published: (2025)
Expansion Span: Combining Fading Memory and Retrieval in Hybrid State Space Models
by: Nunez, Elvis, et al.
Published: (2024)
by: Nunez, Elvis, et al.
Published: (2024)
Stateful KV Cache Management for LLMs: Balancing Space, Time, Accuracy, and Positional Fidelity
by: Poudel, Pratik
Published: (2025)
by: Poudel, Pratik
Published: (2025)
Semantic Anchors in In-Context Learning: Why Small LLMs Cannot Flip Their Labels
by: Kumar, Anantha Padmanaban Krishna
Published: (2025)
by: Kumar, Anantha Padmanaban Krishna
Published: (2025)
DenseMamba: State Space Models with Dense Hidden Connection for Efficient Large Language Models
by: He, Wei, et al.
Published: (2024)
by: He, Wei, et al.
Published: (2024)
Geometric Organization of Cognitive States in Transformer Embedding Spaces
by: Zhao, Sophie
Published: (2025)
by: Zhao, Sophie
Published: (2025)
Flipping Against All Odds: Reducing LLM Coin Flip Bias via Verbalized Rejection Sampling
by: Xiao, Tim Z., et al.
Published: (2025)
by: Xiao, Tim Z., et al.
Published: (2025)
On Structured State-Space Duality
by: Hu, Jerry Yao-Chieh, et al.
Published: (2025)
by: Hu, Jerry Yao-Chieh, et al.
Published: (2025)
Rethinking State Tracking in Recurrent Models Through Error Control Dynamics
by: Chung, Jiwan, et al.
Published: (2026)
by: Chung, Jiwan, et al.
Published: (2026)
MemMamba: Rethinking Memory Patterns in State Space Model
by: Wang, Youjin, et al.
Published: (2025)
by: Wang, Youjin, et al.
Published: (2025)
PICASO: Permutation-Invariant Context Composition with State Space Models
by: Liu, Tian Yu, et al.
Published: (2025)
by: Liu, Tian Yu, et al.
Published: (2025)
Semantic Structure of Feature Space in Large Language Models
by: Kozlowski, Austin C., et al.
Published: (2026)
by: Kozlowski, Austin C., et al.
Published: (2026)
A Comparative Analysis of Contextual Representation Flow in State-Space and Transformer Architectures
by: Hoang, Nhat M., et al.
Published: (2025)
by: Hoang, Nhat M., et al.
Published: (2025)
Enhanced Structured State Space Models via Grouped FIR Filtering and Attention Sink Mechanisms
by: Meng, Tian, et al.
Published: (2024)
by: Meng, Tian, et al.
Published: (2024)
Sessa: Selective State Space Attention
by: Horbatko, Liubomyr
Published: (2026)
by: Horbatko, Liubomyr
Published: (2026)
Sectoral Coupling in Linguistic State Space
by: Dumbrava, Sebastian
Published: (2025)
by: Dumbrava, Sebastian
Published: (2025)
Probing Semantic Routing in Large Mixture-of-Expert Models
by: Olson, Matthew Lyle, et al.
Published: (2025)
by: Olson, Matthew Lyle, et al.
Published: (2025)
Taipan: Efficient and Expressive State Space Language Models with Selective Attention
by: Van Nguyen, Chien, et al.
Published: (2024)
by: Van Nguyen, Chien, et al.
Published: (2024)
On the Expressiveness and Length Generalization of Selective State-Space Models on Regular Languages
by: Terzić, Aleksandar, et al.
Published: (2024)
by: Terzić, Aleksandar, et al.
Published: (2024)
Characterizing the Behavior of Training Mamba-based State Space Models on GPUs
by: Baruah, Trinayan, et al.
Published: (2025)
by: Baruah, Trinayan, et al.
Published: (2025)
Birdie: Advancing State Space Models with Reward-Driven Objectives and Curricula
by: Blouir, Sam, et al.
Published: (2024)
by: Blouir, Sam, et al.
Published: (2024)
Repeat After Me: Transformers are Better than State Space Models at Copying
by: Jelassi, Samy, et al.
Published: (2024)
by: Jelassi, Samy, et al.
Published: (2024)
MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts
by: Pióro, Maciej, et al.
Published: (2024)
by: Pióro, Maciej, et al.
Published: (2024)
Similar Items
-
Flip-Flop Consistency: Unsupervised Training for Robustness to Prompt Perturbations in LLMs
by: Hejabi, Parsa, et al.
Published: (2025) -
Lost in State Space: Probing Frozen Mamba Representations
by: Wagh, Bhagyashree, et al.
Published: (2026) -
The Illusion of State in State-Space Models
by: Merrill, William, et al.
Published: (2024) -
State Space Models as Foundation Models: A Control Theoretic Overview
by: Alonso, Carmen Amo, et al.
Published: (2024) -
Task Structure Reverses Layerwise State Encoding in Sequence Models
by: Jiang, Yuhang
Published: (2026)