:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Sadasivan, Hari
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Artificial Intelligence I.2.7
Online Access:	https://arxiv.org/abs/2604.11867
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning
by: Xu, Shuyao, et al.
Published: (2025)

RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale
by: Goldstein, Daniel, et al.
Published: (2025)

Evaluating the Efficacy of Hybrid Deep Learning Models in Distinguishing AI-Generated Text
by: Oketunji, Abiodun Finbarrs
Published: (2023)

Training Dynamics Underlying Language Model Scaling Laws: Loss Deceleration and Zero-Sum Learning
by: Mircea, Andrei, et al.
Published: (2025)

Meta-Learning at Scale for Large Language Models via Low-Rank Amortized Bayesian Meta-Learning
by: Zhang, Liyi, et al.
Published: (2025)

The Geometry of Thought: How Scale Restructures Reasoning In Large Language Models
by: Anderson, Samuel Cyrenius
Published: (2026)

Weakly Supervised Distillation of Hallucination Signals into Transformer Representations
by: Salehmohamed, Shoaib Sadiq, et al.
Published: (2026)

Scaling Trends for Multi-Hop Contextual Reasoning in Mid-Scale Language Models
by: Steele, Brady, et al.
Published: (2026)

Three Regimes of Context-Parametric Conflict: A Predictive Framework and Empirical Validation
by: Venkata, Pruthvinath Jeripity
Published: (2026)

Turning the TIDE: Cross-Architecture Distillation for Diffusion Large Language Models
by: Zhang, Gongbo, et al.
Published: (2026)

Distilling Knowledge from Large Language Models: A Concept Bottleneck Model for Hate and Counter Speech Recognition
by: Labadie-Tamayo, Roberto, et al.
Published: (2025)

Large Language Model (LLM) Bias Index -- LLMBI
by: Oketunji, Abiodun Finbarrs, et al.
Published: (2023)

Revisiting Intermediate-Layer Matching in Knowledge Distillation: Layer-Selection Strategy Doesn't Matter (Much)
by: Yu, Zony, et al.
Published: (2025)

Beyond the Black Box: A Statistical Model for LLM Reasoning and Inference
by: Dalal, Siddhartha, et al.
Published: (2024)

InfinityMATH: A Scalable Instruction Tuning Dataset in Programmatic Mathematical Reasoning
by: Zhang, Bo-Wen, et al.
Published: (2024)

Extracting Small Translation Specialists from LLMs by Aggressively Pruning Experts
by: Martin, Liu O., et al.
Published: (2026)

Entropy-Based Measurement of Value Drift and Alignment Work in Large Language Models
by: Fadli, Samih
Published: (2025)

Predictable Scale: Part I, Step Law -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining
by: Li, Houyi, et al.
Published: (2025)

Consistency Evaluation of News Article Summaries Generated by Large (and Small) Language Models
by: Gilhuly, Colleen, et al.
Published: (2025)

The Readout Shortcut: Positional Number Copying Dominates Arithmetic CoT Readout in Small Language Models
by: Liu, Ming
Published: (2026)

Social Learning through Interactions with Other Agents: A Survey
by: Hillier, Dylan, et al.
Published: (2024)

A Scalable Communication Protocol for Networks of Large Language Models
by: Marro, Samuele, et al.
Published: (2024)

Behavioural Analysis of Alignment Faking
by: Hadida, Nathaniel Mitrani, et al.
Published: (2026)

Identity as Attractor: Geometric Evidence for Persistent Agent Architecture in LLM Activation Space
by: Vasilenko, Vladimir
Published: (2026)

Robust Tool Use via Fission-GRPO: Learning to Recover from Execution Errors
by: Zhang, Zhiwei, et al.
Published: (2026)

XShare: Collaborative in-Batch Expert Sharing for Faster MoE Inference
by: Vankov, Daniil, et al.
Published: (2026)

GNN for Structural Displacement Prediction
by: Chang, Hung-Fu, et al.
Published: (2026)

Attention Drift: What Autoregressive Speculative Decoding Models Learn
by: Eldenk, Doğaç, et al.
Published: (2026)

How Does Unfaithful Reasoning Emerge from Autoregressive Training? A Study of Synthetic Experiments
by: Wang, Fuxin, et al.
Published: (2026)

CoDA: Coding LM via Diffusion Adaptation
by: Chen, Haolin, et al.
Published: (2025)

End-to-End Optimization of LLM-Driven Multi-Agent Search Systems via Heterogeneous-Group-Based Reinforcement Learning
by: Chen, Guanzhong, et al.
Published: (2025)

Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized Large Language Models
by: Zhang, Zhengxin, et al.
Published: (2024)

REAP the Experts: Why Pruning Prevails for One-Shot MoE compression
by: Lasby, Mike, et al.
Published: (2025)

Random Rule Forest (RRF): Interpretable Ensembles of LLM-Generated Questions for Predicting Startup Success
by: Griffin, Ben, et al.
Published: (2025)

Beyond Memorization: Violating Privacy Via Inference with Large Language Models
by: Staab, Robin, et al.
Published: (2023)

xInv: Explainable Optimization of Inverse Problems
by: Memery, Sean, et al.
Published: (2025)

Solving the Granularity Mismatch: Hierarchical Preference Learning for Long-Horizon LLM Agents
by: Gao, Heyang, et al.
Published: (2025)

Explainable AI for Smart Greenhouse Control: Interpretability of Temporal Fusion Transformer in the Internet of Robotic Things
by: Bashir, Muhammad Jawad, et al.
Published: (2025)

Balancing Efficiency and Effectiveness: An LLM-Infused Approach for Optimized CTR Prediction
by: Zhang, Guoxiao, et al.
Published: (2024)

CortexCompile: Harnessing Cortical-Inspired Architectures for Enhanced Multi-Agent NLP Code Synthesis
by: Ramachandran, Gautham, et al.
Published: (2024)