Saved in:
| Main Author: | Hastings-Woodhouse, Sarah |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.21082 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
An Approach to Technical AGI Safety and Security
by: Shah, Rohin, et al.
Published: (2025)
by: Shah, Rohin, et al.
Published: (2025)
Institutional AI: A Governance Framework for Distributional AGI Safety
by: Pierucci, Federico, et al.
Published: (2026)
by: Pierucci, Federico, et al.
Published: (2026)
AGI, Governments, and Free Societies
by: Bullock, Justin B., et al.
Published: (2025)
by: Bullock, Justin B., et al.
Published: (2025)
Europe and the Geopolitics of AGI: The Need for a Preparedness Plan
by: Negele, Maximilian, et al.
Published: (2026)
by: Negele, Maximilian, et al.
Published: (2026)
Several Issues Regarding Data Governance in AGI
by: Hatta, Masayuki
Published: (2025)
by: Hatta, Masayuki
Published: (2025)
Token Taxes: mitigating AGI's economic risks
by: Irwin, Lucas, et al.
Published: (2026)
by: Irwin, Lucas, et al.
Published: (2026)
Unsocial Intelligence: an Investigation of the Assumptions of AGI Discourse
by: Blili-Hamelin, Borhane, et al.
Published: (2024)
by: Blili-Hamelin, Borhane, et al.
Published: (2024)
Misalignment or misuse? The AGI alignment tradeoff
by: Hellrigel-Holderbaum, Max, et al.
Published: (2025)
by: Hellrigel-Holderbaum, Max, et al.
Published: (2025)
Institutional Management of Information Technology: A Centralised Approach.
by: Dockerill, John
Published: (1987)
by: Dockerill, John
Published: (1987)
Stop treating `AGI' as the north-star goal of AI research
by: Blili-Hamelin, Borhane, et al.
Published: (2025)
by: Blili-Hamelin, Borhane, et al.
Published: (2025)
The Trajectory of Romance Scams in the U.S
by: Herrera, LD, et al.
Published: (2024)
by: Herrera, LD, et al.
Published: (2024)
Policy myopia as a mechanism of gradual disempowerment in Post-AGI governance, Circa 2049
by: Sahoo, Subramanyam
Published: (2026)
by: Sahoo, Subramanyam
Published: (2026)
High vs. Low AGI: Ontology and Conceptual Taxonomy for Geopolitical Coherence
by: Max, Antonio
Published: (2025)
by: Max, Antonio
Published: (2025)
Efficiency vs Demand in AI Electricity: Implications for Post-AGI Scaling
by: Kim, Doyi, et al.
Published: (2026)
by: Kim, Doyi, et al.
Published: (2026)
Against racing to AGI: Cooperation, deterrence, and catastrophic risks
by: Dung, Leonard, et al.
Published: (2025)
by: Dung, Leonard, et al.
Published: (2025)
The Invisibility Hypothesis: Promises of AGI and the Future of the Global South
by: López, L. Julián Lechuga, et al.
Published: (2026)
by: López, L. Julián Lechuga, et al.
Published: (2026)
From Checklists to Clusters: A Homeostatic Account of AGI Evaluation
by: Reynolds, Brett
Published: (2025)
by: Reynolds, Brett
Published: (2025)
How Far Are We From AGI: Are LLMs All We Need?
by: Feng, Tao, et al.
Published: (2024)
by: Feng, Tao, et al.
Published: (2024)
Towards AI-$45^{\circ}$ Law: A Roadmap to Trustworthy AGI
by: Yang, Chao, et al.
Published: (2024)
by: Yang, Chao, et al.
Published: (2024)
Keep the Future Human: Why and How We Should Close the Gates to AGI and Superintelligence, and What We Should Build Instead
by: Aguirre, Anthony
Published: (2023)
by: Aguirre, Anthony
Published: (2023)
Why AI Alignment Failure Is Structural: Learned Human Interaction Structures and AGI as an Endogenous Evolutionary Shock
by: Sornette, Didier, et al.
Published: (2026)
by: Sornette, Didier, et al.
Published: (2026)
Towards New Benchmark for AI Alignment & Sentiment Analysis in Socially Important Issues: A Comparative Study of Human and LLMs in the Context of AGI
by: Bojic, Ljubisa, et al.
Published: (2025)
by: Bojic, Ljubisa, et al.
Published: (2025)
Safety Co-Option and Compromised National Security: The Self-Fulfilling Prophecy of Weakened AI Risk Thresholds
by: Khlaaf, Heidy, et al.
Published: (2025)
by: Khlaaf, Heidy, et al.
Published: (2025)
Dual-Use AI Face Swap Apps Are Mostly Unsafe: A Systematic Safety Audit
by: Daffalla, Alaa, et al.
Published: (2026)
by: Daffalla, Alaa, et al.
Published: (2026)
Toward Secure and Compliant AI: Organizational Standards and Protocols for NLP Model Lifecycle Management
by: Arora, Sunil, et al.
Published: (2025)
by: Arora, Sunil, et al.
Published: (2025)
Jolting Technologies: Superexponential Acceleration in AI Capabilities and Implications for AGI
by: Orban, David
Published: (2025)
by: Orban, David
Published: (2025)
The 2025 AI Agent Index: Documenting Technical and Safety Features of Deployed Agentic AI Systems
by: Staufer, Leon, et al.
Published: (2026)
by: Staufer, Leon, et al.
Published: (2026)
Safety First: Psychological Safety as the Key to AI Transformation
by: Reich, Aaron, et al.
Published: (2026)
by: Reich, Aaron, et al.
Published: (2026)
How Should AI Safety Benchmarks Benchmark Safety?
by: Yu, Cheng, et al.
Published: (2026)
by: Yu, Cheng, et al.
Published: (2026)
Securing Agentic AI Systems -- A Multilayer Security Framework
by: Arora, Sunil, et al.
Published: (2025)
by: Arora, Sunil, et al.
Published: (2025)
Autonomous Penetration Testing: Solving Capture-the-Flag Challenges with LLMs
by: Bakker, Isabelle, et al.
Published: (2025)
by: Bakker, Isabelle, et al.
Published: (2025)
The Arrival of AGI? When Expert Personas Exceed Expert Benchmarks
by: Mullens, Drake, et al.
Published: (2026)
by: Mullens, Drake, et al.
Published: (2026)
Data Augmentation via Diffusion Model to Enhance AI Fairness
by: Blow, Christina Hastings, et al.
Published: (2024)
by: Blow, Christina Hastings, et al.
Published: (2024)
Some Simple Economics of AGI
by: Catalini, Christian, et al.
Published: (2026)
by: Catalini, Christian, et al.
Published: (2026)
SafetyAnalyst: Interpretable, Transparent, and Steerable Safety Moderation for AI Behavior
by: Li, Jing-Jing, et al.
Published: (2024)
by: Li, Jing-Jing, et al.
Published: (2024)
AI Safety for Everyone
by: Gyevnar, Balint, et al.
Published: (2025)
by: Gyevnar, Balint, et al.
Published: (2025)
The Role of AI Safety Institutes in Contributing to International Standards for Frontier AI Safety
by: Fort, Kristina
Published: (2024)
by: Fort, Kristina
Published: (2024)
Scalable and Ethical Insider Threat Detection through Data Synthesis and Analysis by LLMs
by: Gelman, Haywood, et al.
Published: (2025)
by: Gelman, Haywood, et al.
Published: (2025)
Safety cases for frontier AI
by: Buhl, Marie Davidsen, et al.
Published: (2024)
by: Buhl, Marie Davidsen, et al.
Published: (2024)
Auditing Agent Harness Safety
by: Liu, Chengzhi, et al.
Published: (2026)
by: Liu, Chengzhi, et al.
Published: (2026)
Similar Items
-
An Approach to Technical AGI Safety and Security
by: Shah, Rohin, et al.
Published: (2025) -
Institutional AI: A Governance Framework for Distributional AGI Safety
by: Pierucci, Federico, et al.
Published: (2026) -
AGI, Governments, and Free Societies
by: Bullock, Justin B., et al.
Published: (2025) -
Europe and the Geopolitics of AGI: The Need for a Preparedness Plan
by: Negele, Maximilian, et al.
Published: (2026) -
Several Issues Regarding Data Governance in AGI
by: Hatta, Masayuki
Published: (2025)