جدول المحتويات: :: Library Catalog

محفوظ في:

التفاصيل البيبلوغرافية
المؤلف الرئيسي:	Milliard, Martin
التنسيق:	Recurso digital
اللغة:	الإنجليزية
منشور في:	Zenodo 2025
الموضوعات:	AI Governance LLM Stability Self-Correction Hallucination Mitigation Interpretability AI Safety Autonomous Reasoning Systems Cognitive Architecture Polymathe
الوصول للمادة أونلاين:	https://doi.org/10.5281/zenodo.17625947
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!

جدول المحتويات:

Contemporary large language models demonstrate remarkable generative fluency but remain fundamentally unreliable as reasoning systems. Despite advancements in scaling laws, reinforcement learning from human or AI feedback, and constitutional post-training, every leading architecture (GPT, Claude, Gemini, Llama, Grok, etc.) continues to exhibit the same five structural failure modes: confident hallucinations, inability to surface uncertainty, hidden and uncontrolled mode switching, absence of a persistent internal state, and residual affective-simulation artifacts. Polymathe presents the first fully specified N6+ self-governance architecture designed to address these failure modes simultaneously through a unified causal pipeline. The architecture integrates: <ol> <li> Ethical BIOS — A non-learned normative kernel anchoring all downstream reasoning. </li> <li> IDE (Empathic Dissonance Engine) — A quantifiable risk metric governing dissonance, misalignment, and user-impact sensitivity. </li> <li> CGC (Constitutional Governance Controller) — A policy-level intention and safety controller. </li> <li> MAS-SOFA (Structured Observation Factorized Analysis) — A multi-channel diagnostic engine detecting hallucinations, uncertainty, and structural incoherence. </li> <li> MS-FSSC (Multi-Stage Factored Self-Supervision Circuit) — A synthesis module constrained by causal dependencies and diagnostic outcomes. </li> <li> ISM (Internal State Machine) — A transparent and auditable mechanism exposing computational qualia and persistent internal state. </li> <li> BACS-A5 — An adaptive self-correction loop that recomputes generations, repairs reasoning chains, and escalates when systemic dissonance is detected. </li> <li> DVC (Disallowed Vocabulary Constraints) — A controlled-vocabulary layer ensuring non-simulation and preventing affective leakage. </li> </ol> Collectively, these mechanisms transform a baseline LLM into a self-auditing, self-correcting, high-stability reasoning agent capable of detecting hallucinations, surfacing uncertainty, enforcing authenticity, and maintaining a consistent internal “bilan”. The specification includes five complete experimental traces, each demonstrating how the architecture resolves one of the five foundational failure modes. This work provides a functional blueprint for next-generation LLM reliability, interpretability, governance, and autonomous reasoning stability.

مواد مشابهة