Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Author:	Magarshak, Gregory
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Artificial Intelligence Information Theory Neural and Evolutionary Computing 68Q32, 68T05, 94A15 I.2.6; I.2.8; F.2.2
Online Access:	https://arxiv.org/abs/2605.04069
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915981838778368
author	Magarshak, Gregory
author_facet	Magarshak, Gregory
contents	We introduce LAWS (Learning from Actual Workloads Symbolically), a self-certifying inference caching architecture that builds a growing library of certified expert functions from deployment observations. Each expert covers a region of input space defined by a node in the Probabilistic Language Trie (PLT) of the base model and carries a formal error bound holding uniformly over all inputs. The central result is a self-certification theorem: for any input x, the LAWS approximation error is bounded by epsilon_fit + 2Lambda(W)C_E, where Lambda(W) is the model Lipschitz constant, C_E is the maximum embedding diameter, and epsilon_fit is the expert training error -- all checkable at deployment time without ground truth. We prove that LAWS generalizes both Mixture-of-Experts and KV prefix caching as special cases and is strictly more expressive than any fixed-K MoE or finite cache. Further results include a monotone hit rate theorem (any-match routing ensures coverage only increases), an expert library growth rate of O(2^H log N) where H is workload entropy, a fleet learning convergence theorem with Omega(K) speedup for K-unit fleets, and an over-the-air update bandwidth bound. We conjecture that LAWS is acquisition-optimal among stationary online caching algorithms and that the effective Lipschitz constant on the training distribution grows polynomially rather than exponentially in depth. Applications are developed for LLM inference, robotic control, and multi-agent edge deployment.
format	Preprint
id	arxiv_https___arxiv_org_abs_2605_04069
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	LAWS: Learning from Actual Workloads Symbolically -- A Self-Certifying Parametrized Cache Architecture for Neural Inference, Robotics, and Edge Deployment Magarshak, Gregory Machine Learning Artificial Intelligence Information Theory Neural and Evolutionary Computing 68Q32, 68T05, 94A15 I.2.6; I.2.8; F.2.2 We introduce LAWS (Learning from Actual Workloads Symbolically), a self-certifying inference caching architecture that builds a growing library of certified expert functions from deployment observations. Each expert covers a region of input space defined by a node in the Probabilistic Language Trie (PLT) of the base model and carries a formal error bound holding uniformly over all inputs. The central result is a self-certification theorem: for any input x, the LAWS approximation error is bounded by epsilon_fit + 2Lambda(W)C_E, where Lambda(W) is the model Lipschitz constant, C_E is the maximum embedding diameter, and epsilon_fit is the expert training error -- all checkable at deployment time without ground truth. We prove that LAWS generalizes both Mixture-of-Experts and KV prefix caching as special cases and is strictly more expressive than any fixed-K MoE or finite cache. Further results include a monotone hit rate theorem (any-match routing ensures coverage only increases), an expert library growth rate of O(2^H log N) where H is workload entropy, a fleet learning convergence theorem with Omega(K) speedup for K-unit fleets, and an over-the-air update bandwidth bound. We conjecture that LAWS is acquisition-optimal among stationary online caching algorithms and that the effective Lipschitz constant on the training distribution grows polynomially rather than exponentially in depth. Applications are developed for LLM inference, robotic control, and multi-agent edge deployment.
title	LAWS: Learning from Actual Workloads Symbolically -- A Self-Certifying Parametrized Cache Architecture for Neural Inference, Robotics, and Edge Deployment
topic	Machine Learning Artificial Intelligence Information Theory Neural and Evolutionary Computing 68Q32, 68T05, 94A15 I.2.6; I.2.8; F.2.2
url	https://arxiv.org/abs/2605.04069

Similar Items