Saved in:
| Main Authors: | Grey, Markov, Segerie, Charbel-Raphaël |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.13700 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Safety by Measurement: A Systematic Literature Review of AI Safety Evaluation Methods
by: Grey, Markov, et al.
Published: (2025)
by: Grey, Markov, et al.
Published: (2025)
AI Consciousness and Existential Risk
by: VanRullen, Rufin
Published: (2025)
by: VanRullen, Rufin
Published: (2025)
The bitter lesson of misuse detection
by: Mariaccia, Hadrien, et al.
Published: (2025)
by: Mariaccia, Hadrien, et al.
Published: (2025)
Is Power-Seeking AI an Existential Risk?
by: Carlsmith, Joseph
Published: (2022)
by: Carlsmith, Joseph
Published: (2022)
Humanity in the Age of AI: Reassessing 2025's Existential-Risk Narratives
by: Louadi, Mohamed El
Published: (2025)
by: Louadi, Mohamed El
Published: (2025)
When Autonomy Breaks: The Hidden Existential Risk of AI
by: Krook, Joshua
Published: (2025)
by: Krook, Joshua
Published: (2025)
BELLS: A Framework Towards Future Proof Benchmarks for the Evaluation of LLM Safeguards
by: Dorn, Diego, et al.
Published: (2024)
by: Dorn, Diego, et al.
Published: (2024)
Two Types of AI Existential Risk: Decisive and Accumulative
by: Kasirzadeh, Atoosa
Published: (2024)
by: Kasirzadeh, Atoosa
Published: (2024)
Technical Requirements for Halting Dangerous AI Activities
by: Barnett, Peter, et al.
Published: (2025)
by: Barnett, Peter, et al.
Published: (2025)
Why do Experts Disagree on Existential Risk and P(doom)? A Survey of AI Experts
by: Field, Severin
Published: (2025)
by: Field, Severin
Published: (2025)
Anchoring AI Capabilities in Market Valuations: The Capability Realization Rate Model and Valuation Misalignment Risk
by: Fang, Xinmin, et al.
Published: (2025)
by: Fang, Xinmin, et al.
Published: (2025)
Gender Bias of LLM in Economics: An Existentialism Perspective
by: Zhong, Hui, et al.
Published: (2024)
by: Zhong, Hui, et al.
Published: (2024)
Manipulation and the AI Act: Large Language Model Chatbots and the Danger of Mirrors
by: Krook, Joshua
Published: (2025)
by: Krook, Joshua
Published: (2025)
A+AI: Threats to Society, Remedies, and Governance
by: Byrd, Don
Published: (2024)
by: Byrd, Don
Published: (2024)
The False Dawn: Reevaluating Google's Reinforcement Learning for Chip Macro Placement
by: Markov, Igor L.
Published: (2023)
by: Markov, Igor L.
Published: (2023)
A Capability Approach to AI Ethics
by: Ratti, Emanuele, et al.
Published: (2025)
by: Ratti, Emanuele, et al.
Published: (2025)
Misrepresented Technological Solutions in Imagined Futures: The Origins and Dangers of AI Hype in the Research Community
by: Thais, Savannah
Published: (2024)
by: Thais, Savannah
Published: (2024)
Ken Utilization Layer: Hebbian Replay Within a Student's Ken for Adaptive Exercise Recommendation
by: Kuling, Grey, et al.
Published: (2025)
by: Kuling, Grey, et al.
Published: (2025)
The AI Pyramid A Conceptual Framework for Workforce Capability in the Age of AI
by: Khatri, Alok, et al.
Published: (2026)
by: Khatri, Alok, et al.
Published: (2026)
Existential Conversations with Large Language Models: Content, Community, and Culture
by: Shanahan, Murray, et al.
Published: (2024)
by: Shanahan, Murray, et al.
Published: (2024)
AI-Driven Cyber Threat Intelligence Automation
by: Shah, Shrit, et al.
Published: (2024)
by: Shah, Shrit, et al.
Published: (2024)
Safe in the Future, Dangerous in the Past: Dissecting Temporal and Linguistic Vulnerabilities in LLMs
by: Said, Muhammad Abdullahi, et al.
Published: (2025)
by: Said, Muhammad Abdullahi, et al.
Published: (2025)
Military AI Cyber Agents (MAICAs) Constitute a Global Threat to Critical Infrastructure
by: Dubber, Timothy, et al.
Published: (2025)
by: Dubber, Timothy, et al.
Published: (2025)
Administrative Law's Fourth Settlement: AI and the Capability-Accountability Trap
by: Caputo, Nicholas
Published: (2026)
by: Caputo, Nicholas
Published: (2026)
Unveiling AI's Threats to Child Protection: Regulatory efforts to Criminalize AI-Generated CSAM and Emerging Children's Rights Violations
by: Kokolaki, Emmanouela, et al.
Published: (2025)
by: Kokolaki, Emmanouela, et al.
Published: (2025)
Assessing High-Risk AI Systems under the EU AI Act: From Legal Requirements to Technical Verification
by: Buscemi, Alessio, et al.
Published: (2025)
by: Buscemi, Alessio, et al.
Published: (2025)
Subjective $\textit{Isms}$? On the Danger of Conflating Hate and Offence in Abusive Language Detection
by: Curry, Amanda Cercas, et al.
Published: (2024)
by: Curry, Amanda Cercas, et al.
Published: (2024)
AI Risk Categorization Decoded (AIR 2024): From Government Regulations to Corporate Policies
by: Zeng, Yi, et al.
Published: (2024)
by: Zeng, Yi, et al.
Published: (2024)
The Manipulation Problem: Conversational AI as a Threat to Epistemic Agency
by: Rosenberg, Louis
Published: (2023)
by: Rosenberg, Louis
Published: (2023)
AI-Educational Development Loop (AI-EDL): A Conceptual Framework to Bridge AI Capabilities with Classical Educational Theories
by: Yu, Ning, et al.
Published: (2025)
by: Yu, Ning, et al.
Published: (2025)
Mapping AI Risk Mitigations: Evidence Scan and Preliminary AI Risk Mitigation Taxonomy
by: Saeri, Alexander K., et al.
Published: (2025)
by: Saeri, Alexander K., et al.
Published: (2025)
Governable AI: Provable Safety Under Extreme Threat Models
by: Wang, Donglin, et al.
Published: (2025)
by: Wang, Donglin, et al.
Published: (2025)
The Hermeneutic Turn of AI: Are Machines Capable of Interpreting?
by: Demichelis, Remy
Published: (2024)
by: Demichelis, Remy
Published: (2024)
Exploring AI Capabilities in Participatory Budgeting within Smart Cities: The Case of Sao Paulo
by: Sousa, Italo Alberto, et al.
Published: (2025)
by: Sousa, Italo Alberto, et al.
Published: (2025)
PluriHarms: Benchmarking the Full Spectrum of Human Judgments on AI Harm
by: Li, Jing-Jing, et al.
Published: (2026)
by: Li, Jing-Jing, et al.
Published: (2026)
Machine vs Machine: Using AI to Tackle Generative AI Threats in Assessment
by: Torkestani, Mohammad Saleh, et al.
Published: (2025)
by: Torkestani, Mohammad Saleh, et al.
Published: (2025)
AI Ethics and Social Norms: Exploring ChatGPT's Capabilities From What to How
by: Veisi, Omid, et al.
Published: (2025)
by: Veisi, Omid, et al.
Published: (2025)
Frontier Lag: A Bibliometric Audit of Capability Misrepresentation in Academic AI Evaluation
by: Gringras, David, et al.
Published: (2026)
by: Gringras, David, et al.
Published: (2026)
AI Art is Theft: Labour, Extraction, and Exploitation, Or, On the Dangers of Stochastic Pollocks
by: Goetze, Trystan S.
Published: (2024)
by: Goetze, Trystan S.
Published: (2024)
The Biggest Risk of Embodied AI is Governance Lag
by: Liu, Shaoshan
Published: (2026)
by: Liu, Shaoshan
Published: (2026)
Similar Items
-
Safety by Measurement: A Systematic Literature Review of AI Safety Evaluation Methods
by: Grey, Markov, et al.
Published: (2025) -
AI Consciousness and Existential Risk
by: VanRullen, Rufin
Published: (2025) -
The bitter lesson of misuse detection
by: Mariaccia, Hadrien, et al.
Published: (2025) -
Is Power-Seeking AI an Existential Risk?
by: Carlsmith, Joseph
Published: (2022) -
Humanity in the Age of AI: Reassessing 2025's Existential-Risk Narratives
by: Louadi, Mohamed El
Published: (2025)