Saved in:
| Main Author: | Singh, Arth |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.08557 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
EMA Is Not All You Need: Mapping the Boundary Between Structure and Content in Recurrent Context
by: Singh, Arth
Published: (2026)
by: Singh, Arth
Published: (2026)
Machine Unlearning for Masked Diffusion Language Models
by: Lee, Georu, et al.
Published: (2026)
by: Lee, Georu, et al.
Published: (2026)
Entropy-Based Measurement of Value Drift and Alignment Work in Large Language Models
by: Fadli, Samih
Published: (2025)
by: Fadli, Samih
Published: (2025)
Let's Think Dot by Dot: Hidden Computation in Transformer Language Models
by: Pfau, Jacob, et al.
Published: (2024)
by: Pfau, Jacob, et al.
Published: (2024)
Truth as a Compression Artifact in Language Model Training
by: Krestnikov, Konstantin
Published: (2026)
by: Krestnikov, Konstantin
Published: (2026)
Turning the TIDE: Cross-Architecture Distillation for Diffusion Large Language Models
by: Zhang, Gongbo, et al.
Published: (2026)
by: Zhang, Gongbo, et al.
Published: (2026)
Intention Collapse: Intention-Level Metrics for Reasoning in Language Models
by: Vera, Patricio
Published: (2026)
by: Vera, Patricio
Published: (2026)
Distributed Multi-Layer Editing for Rule-Level Knowledge in Large Language Models
by: Wang, Yating, et al.
Published: (2026)
by: Wang, Yating, et al.
Published: (2026)
Training Language Models to Win Debates with Self-Play Improves Judge Accuracy
by: Arnesen, Samuel, et al.
Published: (2024)
by: Arnesen, Samuel, et al.
Published: (2024)
Revisiting Parameter-Based Knowledge Editing in Large Language Models: Theoretical Limits and Empirical Evidence
by: Ren, Wanying, et al.
Published: (2026)
by: Ren, Wanying, et al.
Published: (2026)
Assessing Large Language Models on Islamic Legal Reasoning: Evidence from Inheritance Law Evaluation
by: Bouchekif, Abdessalam, et al.
Published: (2025)
by: Bouchekif, Abdessalam, et al.
Published: (2025)
SECURA: Sigmoid-Enhanced CUR Decomposition with Uninterrupted Retention and Low-Rank Adaptation in Large Language Models
by: Zhang, Yuxuan
Published: (2025)
by: Zhang, Yuxuan
Published: (2025)
MeMo: Towards Language Models with Associative Memory Mechanisms
by: Zanzotto, Fabio Massimo, et al.
Published: (2025)
by: Zanzotto, Fabio Massimo, et al.
Published: (2025)
Why Models Know But Don't Say: Chain-of-Thought Faithfulness Divergence Between Thinking Tokens and Answers in Open-Weight Reasoning Models
by: Young, Richard J.
Published: (2026)
by: Young, Richard J.
Published: (2026)
RMGAP: Benchmarking the Generalization of Reward Models across Diverse Preferences
by: Zhou, Yangyang, et al.
Published: (2026)
by: Zhou, Yangyang, et al.
Published: (2026)
Towards Understanding Sycophancy in Language Models
by: Sharma, Mrinank, et al.
Published: (2023)
by: Sharma, Mrinank, et al.
Published: (2023)
ImmigrationQA: A Source-Grounded Dataset and Small-Model Adaptation for U.S. Immigration Law
by: Shportun, Nazarii
Published: (2026)
by: Shportun, Nazarii
Published: (2026)
Prototype Transformer: Towards Language Model Architectures Interpretable by Design
by: Yordanov, Yordan, et al.
Published: (2026)
by: Yordanov, Yordan, et al.
Published: (2026)
Towards Ontology-Enhanced Representation Learning for Large Language Models
by: Ronzano, Francesco, et al.
Published: (2024)
by: Ronzano, Francesco, et al.
Published: (2024)
ADALog: Adaptive Unsupervised Anomaly detection in Logs with Self-attention Masked Language Model
by: Pospieszny, Przemek, et al.
Published: (2025)
by: Pospieszny, Przemek, et al.
Published: (2025)
Evaluating Large Language Models for IUCN Red List Species Information
by: Uryu, Shinya
Published: (2025)
by: Uryu, Shinya
Published: (2025)
Cognitive Load Limits in Large Language Models: Benchmarking Multi-Hop Reasoning
by: Adapala, Sai Teja Reddy
Published: (2025)
by: Adapala, Sai Teja Reddy
Published: (2025)
Domain-Specific Pretraining of Language Models: A Comparative Study in the Medical Field
by: Kerner, Tobias
Published: (2024)
by: Kerner, Tobias
Published: (2024)
Painless Activation Steering: An Automated, Lightweight Approach for Post-Training Large Language Models
by: Cui, Sasha, et al.
Published: (2025)
by: Cui, Sasha, et al.
Published: (2025)
CRAFT: Clustered Regression for Adaptive Filtering of Training data
by: Panda, Parthasarathi, et al.
Published: (2026)
by: Panda, Parthasarathi, et al.
Published: (2026)
Mitigating Hallucinations in Zero-Shot Scientific Summarisation: A Pilot Study
by: Jaaouine, Imane, et al.
Published: (2025)
by: Jaaouine, Imane, et al.
Published: (2025)
The Readout Shortcut: Positional Number Copying Dominates Arithmetic CoT Readout in Small Language Models
by: Liu, Ming
Published: (2026)
by: Liu, Ming
Published: (2026)
Categorical Perception in Large Language Model Hidden States: Structural Warping at Digit-Count Boundaries
by: Cacioli, Jon-Paul
Published: (2026)
by: Cacioli, Jon-Paul
Published: (2026)
Beyond Hallucinations: A Composite Score for Measuring Reliability in Open-Source Large Language Models
by: Salla, Rohit Kumar, et al.
Published: (2025)
by: Salla, Rohit Kumar, et al.
Published: (2025)
Word Overuse and Alignment in Large Language Models: The Influence of Learning from Human Feedback
by: Juzek, Tom S., et al.
Published: (2025)
by: Juzek, Tom S., et al.
Published: (2025)
The Rise of Verbal Tics in Large Language Models: A Systematic Analysis Across Frontier Models
by: Wu, Shuai, et al.
Published: (2026)
by: Wu, Shuai, et al.
Published: (2026)
ELMTEX: Fine-Tuning Large Language Models for Structured Clinical Information Extraction. A Case Study on Clinical Reports
by: Guluzade, Aynur, et al.
Published: (2025)
by: Guluzade, Aynur, et al.
Published: (2025)
Distilling Self-Consistency into Verbal Confidence: A Pre-Registered Negative Result and Post-Hoc Rescue on Gemma 3 4B
by: Cacioli, Jon-Paul
Published: (2026)
by: Cacioli, Jon-Paul
Published: (2026)
Exemplar Retrieval Without Overhypothesis Induction: Limits of Distributional Sequence Learning in Early Word Learning
by: Cacioli, Jon-Paul
Published: (2026)
by: Cacioli, Jon-Paul
Published: (2026)
Align and Shine: Building High-Quality Sentence-Aligned Corpora for Multilingual Text Simplification
by: Hilasaca, Kenji, et al.
Published: (2026)
by: Hilasaca, Kenji, et al.
Published: (2026)
Whether, Not Which: Mechanistic Interpretability Reveals Dissociable Affect Reception and Emotion Categorization in LLMs
by: Keeman, Michael
Published: (2026)
by: Keeman, Michael
Published: (2026)
The Pragmatic Persona: Discovering LLM Persona through Bridging Inference
by: Yang, Jisoo, et al.
Published: (2026)
by: Yang, Jisoo, et al.
Published: (2026)
UrduBench: An Urdu Reasoning Benchmark using Contextually Ensembled Translations with Human-in-the-Loop
by: Shafique, Muhammad Ali, et al.
Published: (2026)
by: Shafique, Muhammad Ali, et al.
Published: (2026)
Eyla: Toward an Identity-Anchored LLM Architecture with Integrated Biological Priors -- Vision, Implementation Attempt, and Lessons from AI-Assisted Development
by: Aditto, Arif
Published: (2026)
by: Aditto, Arif
Published: (2026)
KAConvText: Novel Approach to Burmese Sentence Classification using Kolmogorov-Arnold Convolution
by: Thu, Ye Kyaw, et al.
Published: (2025)
by: Thu, Ye Kyaw, et al.
Published: (2025)
Similar Items
-
EMA Is Not All You Need: Mapping the Boundary Between Structure and Content in Recurrent Context
by: Singh, Arth
Published: (2026) -
Machine Unlearning for Masked Diffusion Language Models
by: Lee, Georu, et al.
Published: (2026) -
Entropy-Based Measurement of Value Drift and Alignment Work in Large Language Models
by: Fadli, Samih
Published: (2025) -
Let's Think Dot by Dot: Hidden Computation in Transformer Language Models
by: Pfau, Jacob, et al.
Published: (2024) -
Truth as a Compression Artifact in Language Model Training
by: Krestnikov, Konstantin
Published: (2026)