Saved in:
| Main Author: | Pistillo, Matteo |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.16500 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Assurance of Frontier AI Built for National Security
by: Pistillo, Matteo, et al.
Published: (2025)
by: Pistillo, Matteo, et al.
Published: (2025)
Internal Deployment in the AI Act
by: Pistillo, Matteo
Published: (2025)
by: Pistillo, Matteo
Published: (2025)
Defending Compute Thresholds Against Legal Loopholes
by: Pistillo, Matteo, et al.
Published: (2025)
by: Pistillo, Matteo, et al.
Published: (2025)
Pre-Deployment Information Sharing: A Zoning Taxonomy for Precursory Capabilities
by: Pistillo, Matteo, et al.
Published: (2024)
by: Pistillo, Matteo, et al.
Published: (2024)
Backchaining Loss of Control Mitigations from Mission-Specific Benchmarks in National Security
by: Pistillo, Matteo, et al.
Published: (2026)
by: Pistillo, Matteo, et al.
Published: (2026)
The Loss of Control Playbook: Degrees, Dynamics, and Preparedness
by: Stix, Charlotte, et al.
Published: (2025)
by: Stix, Charlotte, et al.
Published: (2025)
Frontier AI Auditing: Toward Rigorous Third-Party Assessment of Safety and Security Practices at Leading AI Companies
by: Brundage, Miles, et al.
Published: (2026)
by: Brundage, Miles, et al.
Published: (2026)
The California Report on Frontier AI Policy
by: Bommasani, Rishi, et al.
Published: (2025)
by: Bommasani, Rishi, et al.
Published: (2025)
Evaluating AI Providers' Frontier Safety Frameworks
by: Stelling, Lily, et al.
Published: (2025)
by: Stelling, Lily, et al.
Published: (2025)
Evaluating the Critical Risks of Amazon's Nova Premier under the Frontier Model Safety Framework
by: Krishna, Satyapriya, et al.
Published: (2025)
by: Krishna, Satyapriya, et al.
Published: (2025)
The Role of AI Safety Institutes in Contributing to International Standards for Frontier AI Safety
by: Fort, Kristina
Published: (2024)
by: Fort, Kristina
Published: (2024)
Safety Cases: A Scalable Approach to Frontier AI Safety
by: Hilton, Benjamin, et al.
Published: (2025)
by: Hilton, Benjamin, et al.
Published: (2025)
AI Behind Closed Doors: a Primer on The Governance of Internal Deployment
by: Stix, Charlotte, et al.
Published: (2025)
by: Stix, Charlotte, et al.
Published: (2025)
Emerging Practices in Frontier AI Safety Frameworks
by: Buhl, Marie Davidsen, et al.
Published: (2025)
by: Buhl, Marie Davidsen, et al.
Published: (2025)
Enabling Frontier Lab Collaboration to Mitigate AI Safety Risks
by: Felstead, Nicholas
Published: (2025)
by: Felstead, Nicholas
Published: (2025)
Towards Safe Multilingual Frontier AI
by: Kanepajs, Artūrs, et al.
Published: (2024)
by: Kanepajs, Artūrs, et al.
Published: (2024)
FORTRESS: Frontier Risk Evaluation for National Security and Public Safety
by: Knight, Christina Q., et al.
Published: (2025)
by: Knight, Christina Q., et al.
Published: (2025)
Clear, Compelling Arguments: Rethinking the Foundations of Frontier AI Safety Cases
by: Feakins, Shaun, et al.
Published: (2026)
by: Feakins, Shaun, et al.
Published: (2026)
User Privacy and Large Language Models: An Analysis of Frontier Developers' Privacy Policies
by: King, Jennifer, et al.
Published: (2025)
by: King, Jennifer, et al.
Published: (2025)
PolicyLLM: Towards Excellent Comprehension of Public Policy for Large Language Models
by: Bao, Han, et al.
Published: (2026)
by: Bao, Han, et al.
Published: (2026)
Toward an African Agenda for AI Safety
by: Segun, Samuel T., et al.
Published: (2025)
by: Segun, Samuel T., et al.
Published: (2025)
ForesightSafety Bench: A Frontier Risk Evaluation and Governance Framework towards Safe AI
by: Tong, Haibo, et al.
Published: (2026)
by: Tong, Haibo, et al.
Published: (2026)
NeurIPS Should Require Reproducibility Standards for Frontier AI Safety Claims
by: Vishwarupe, Varad, et al.
Published: (2026)
by: Vishwarupe, Varad, et al.
Published: (2026)
Agentic Microphysics: A Manifesto for Generative AI Safety
by: Pierucci, Federico, et al.
Published: (2026)
by: Pierucci, Federico, et al.
Published: (2026)
Towards Resilience and Autonomy-based Approaches for Adolescents Online Safety
by: Park, Jinkyung, et al.
Published: (2025)
by: Park, Jinkyung, et al.
Published: (2025)
Towards Incorporating Researcher Safety into Information Integrity Research Ethics
by: Schafer, Joseph S., et al.
Published: (2023)
by: Schafer, Joseph S., et al.
Published: (2023)
Institutional AI: A Governance Framework for Distributional AGI Safety
by: Pierucci, Federico, et al.
Published: (2026)
by: Pierucci, Federico, et al.
Published: (2026)
A Possibility Frontier Approach to Diverse Talent Selection
by: Natarajan, Neil, et al.
Published: (2025)
by: Natarajan, Neil, et al.
Published: (2025)
Frontier AI developers need an internal audit function
by: Schuett, Jonas
Published: (2023)
by: Schuett, Jonas
Published: (2023)
Generative AI Policies under the Microscope: How CS Conferences Are Navigating the New Frontier in Scholarly Writing
by: Nahar, Mahjabin, et al.
Published: (2024)
by: Nahar, Mahjabin, et al.
Published: (2024)
Phare: A Safety Probe for Large Language Models
by: Jeune, Pierre Le, et al.
Published: (2025)
by: Jeune, Pierre Le, et al.
Published: (2025)
Assessing Computer Science Student Attitudes Towards AI Ethics and Policy
by: Weichert, James, et al.
Published: (2025)
by: Weichert, James, et al.
Published: (2025)
The Homogenization Problem in LLMs: Towards Meaningful Diversity in AI Safety
by: Rios-Sialer, Ian
Published: (2026)
by: Rios-Sialer, Ian
Published: (2026)
Beyond Simulations: What 20,000 Real Conversations Reveal About Mental Health AI Safety
by: Stamatis, Caitlin A., et al.
Published: (2026)
by: Stamatis, Caitlin A., et al.
Published: (2026)
When Does Regulation by Insurance Work? The Case of Frontier AI
by: Trout, Cristian
Published: (2025)
by: Trout, Cristian
Published: (2025)
Certified Safe: A Schematic for Approval Regulation of Frontier AI
by: Salvador, Cole
Published: (2024)
by: Salvador, Cole
Published: (2024)
From Principles to Rules: A Regulatory Approach for Frontier AI
by: Schuett, Jonas, et al.
Published: (2024)
by: Schuett, Jonas, et al.
Published: (2024)
Governing AI Beyond the Pretraining Frontier
by: Caputo, Nicholas A.
Published: (2025)
by: Caputo, Nicholas A.
Published: (2025)
Responsible Reporting for Frontier AI Development
by: Kolt, Noam, et al.
Published: (2024)
by: Kolt, Noam, et al.
Published: (2024)
Toward Inclusive Educational AI: Auditing Frontier LLMs through a Multiplexity Lens
by: Mushtaq, Abdullah, et al.
Published: (2025)
by: Mushtaq, Abdullah, et al.
Published: (2025)
Similar Items
-
Assurance of Frontier AI Built for National Security
by: Pistillo, Matteo, et al.
Published: (2025) -
Internal Deployment in the AI Act
by: Pistillo, Matteo
Published: (2025) -
Defending Compute Thresholds Against Legal Loopholes
by: Pistillo, Matteo, et al.
Published: (2025) -
Pre-Deployment Information Sharing: A Zoning Taxonomy for Precursory Capabilities
by: Pistillo, Matteo, et al.
Published: (2024) -
Backchaining Loss of Control Mitigations from Mission-Specific Benchmarks in National Security
by: Pistillo, Matteo, et al.
Published: (2026)