Uloženo v:
| Hlavní autor: | |
|---|---|
| Médium: | Recurso digital |
| Jazyk: | |
| Vydáno: |
Zenodo
2025
|
| On-line přístup: | https://doi.org/10.5281/zenodo.17656331 |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
Obsah:
- <p>We propose a fundamental variational law of intelligent computation, the<br>\emph{Principle of Least Structural Action} (PLSA).<br>Given a reasoning trajectory $\psi(t)$ produced by a solver, algorithm, or<br>large language model (LLM), we define a structural Lagrangian<br>\[<br>L(\psi,\dot\psi)<br>= \alpha K(\psi) + \beta S(\psi) + \gamma C(\psi),<br>\]<br>where $K$ is structural incompressibility (a computable Kolmogorov-style<br>measure), $S$ is structural curvature, and $C$ is task-consistency (correctness<br>potential). </p> <p>PLSA states that intelligent systems select trajectories minimizing the<br>structural action<br>\[<br>\mathcal A[\psi] = \int_0^T L(\psi,\dot\psi)\,dt.<br>\]</p> <p>At the macroscopic level this yields a Structure--Time Law of the form<br>\[<br>T(x)=\Theta\!\left(N(x)^{\beta_T}\,2^{\alpha_T h(x)}\right),<br>\]<br>where $N(x)$ is a size parameter and $h(x)$ is a normalized structural hardness<br>index derived from the trace (for instance, a monotone transform of a<br>conditional incompressibility measure).<br>Equivalently,<br>\[<br>\log_2 T(x)\approx \alpha_T\,h(x)<br>+ \beta_T\,\log_2 N(x) + \gamma_T,<br>\qquad<br>\alpha_T,\beta_T>0.<br>\]<br>We show that this scaling law can be derived as the time-integral of the<br>microscopic PLSA (Theorem~3).</p> <p>We instantiate these ideas empirically on random 3-SAT using a CDCL solver and<br>a small number of LLM chain-of-thought (CoT) runs.<br>On 120 SAT instances, a gzip-based structural overhead $\lambda_K(x)$<br>of the solver trace explains a substantial fraction of runtime variance<br>under a linear model<br>\[<br>\log_2 T(x)\approx a\,\lambda_K(x) + b\,\log_2 n + c<br>\quad(R^2\approx 0.70),<br>\]<br>with fitted coefficients<br>\[<br>a\approx -6.80,\quad b\approx 1.89,\quad c\approx -20.49.<br>\]<br>In our normalization, \emph{harder instances empirically exhibit smaller<br>$\lambda_K(x)$}, so that the fitted coefficient $a<0$ is naturally interpreted<br>after a change of variables to a hardness index $h(x):=1-\lambda_K(x)$.<br>In that parameterization the same regression takes the form<br>\[<br>\log_2 T(x)\approx \alpha_T\,h(x)+\beta_T\,\log_2 n + \gamma_T,<br>\]<br>with<br>\[<br>\alpha_T=-a\approx 6.80>0,\quad<br>\beta_T=b\approx 1.89,\quad<br>\gamma_T = a + c \approx -27.29.<br>\]</p> <p>For SAT instances solved by an LLM via CoT, we observe a dramatic increase in<br>trace overhead (roughly $5$--$7\times$ larger $\lambda_K$) and several<br>orders of magnitude increase in effective runtime relative to the CDCL solver<br>on the same problems, consistent with the view that explicit reasoning<br>corresponds to an expansion of structural residuals and a higher structural<br>action.</p> <p>We conjecture that PLSA is a universal law of intelligence across artificial,<br>biological, and physical systems.</p>