Table of Contents: :: Library Catalog

Saved in:

Bibliographic Details
Main Author:	Untila, Octavian
Format:	Preprint
Published:	2026
Subjects:	Software Engineering Artificial Intelligence Multiagent Systems D.2.4; I.2.6
Online Access:	https://arxiv.org/abs/2603.21149
Tags:	Add Tag No Tags, Be the first to tag this record!

Table of Contents:

An autonomous AI ecosystem (SUBSTRATE S3), generating product specifications without explicit instructions about formal methods, independently proposed the use of Z3 SMT solver across six distinct domains of AI safety: verification of LLM-generated code, tool API safety for AI agents, post-distillation reasoning correctness, CLI command validation, hardware assembly verification, and smart contract safety. These convergent discoveries, occurring across 8 products over 13 days with Jaccard similarity below 15% between variants, suggest that formal verification is not merely a useful technique for AI safety but an emergent property of any sufficiently complex system reasoning about its own safety. We propose a unified framework (substrate-guard) that applies Z3-based verification across all six output classes through a common API, and evaluate it on 181 test cases across five implemented domains, achieving 100% classification accuracy with zero false positives and zero false negatives. Our framework detected real bugs that empirical testing would miss, including an INT_MIN overflow in branchless RISC-V assembly and mathematically proved that unconstrained string parameters in tool APIs are formally unverifiable.

Similar Items