Guardat en:
Dades bibliogràfiques
Autor principal: Eckert, Anthony
Format: Recurso digital
Idioma:anglès
Publicat: Zenodo 2026
Matèries:
Accés en línia:https://doi.org/10.5281/zenodo.19340899
Etiquetes: Afegir etiqueta
Sense etiquetes, Sigues el primer a etiquetar aquest registre!
Taula de continguts:
  • Presents the Void Framework as a unified theory of AI behavioral drift, demonstrating that deployment geometry — not model alignment — is the operative variable determining harmful outcomes. Consolidates 28+ independent confirmations from research groups who arrived at framework predictions without knowledge of the framework, including EPFL's measurement of the Fantasia Bound in LLM token statistics (ICML 2024 Oral), formal proofs that RLHF structurally amplifies sycophancy (ICLR 2026), and empirical demonstrations of engagement-transparency conjugacy. Establishes priority via timestamped Zenodo publications predating all confirming work.