Saved in:
Bibliographic Details
Main Authors: Real Safety AI Foundation, Gilly, Travis
Format: Recurso digital
Language:
Published: Zenodo 2026
Subjects:
Online Access:https://doi.org/10.5281/zenodo.19242567
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • <p><strong>Abstract</strong></p> <p>In January 2026, we published a convergence threat analysis demonstrating that ambient<br>non-consensual intimate image synthesis via AR wearables was architecturally feasible through<br>cloud-assisted pipelines and would achieve edge-only feasibility within 12 to 24 months [1]. This<br>technical update reports that multiple independent developments in the eight weeks following<br>publication have compressed our timeline estimates significantly. Specifically, we identify three<br>capabilities released between February and March 2026 that collectively reduce barriers across the<br>critical pipeline stages: person segmentation (MatAnyone2, 140MB model achieving state-of-the-art<br>video matting), inference acceleration (Diagonal Distillation, achieving 270x speedup over baseline<br>video generation), and mobile 3D rendering (MobileGS, achieving 120+ FPS Gaussian splatting on a<br>Snapdragon 8 Gen 3 phone at 4.8MB model size). We revise our feasibility classes accordingly, noting<br>that the "edge full video synthesis" class we originally projected at 12 months may now be achievable<br>within 6 to 9 months under moderate assumptions. We discuss implications for the policy<br>recommendations in our original analysis and argue that the structural mismatch between capability<br>maturation and regulatory response has widened.</p>