Saved in:
| Main Authors: | Campos, Simeon, Papadatos, Henry, Roger, Fabien, Touzet, Chloé, Quarks, Otter, Murray, Malcolm |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.06656 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Mapping AI Benchmark Data to Quantitative Risk Estimates Through Expert Elicitation
by: Murray, Malcolm, et al.
Published: (2025)
by: Murray, Malcolm, et al.
Published: (2025)
The Role of Risk Modeling in Advanced AI Risk Management
by: Touzet, Chloé, et al.
Published: (2025)
by: Touzet, Chloé, et al.
Published: (2025)
A Methodology for Quantitative AI Risk Modeling
by: Murray, Malcolm, et al.
Published: (2025)
by: Murray, Malcolm, et al.
Published: (2025)
Evaluating AI Providers' Frontier Safety Frameworks
by: Stelling, Lily, et al.
Published: (2025)
by: Stelling, Lily, et al.
Published: (2025)
Open Problems in Frontier AI Risk Management
by: Ziosi, Marta, et al.
Published: (2026)
by: Ziosi, Marta, et al.
Published: (2026)
Toward Quantitative Modeling of Cybersecurity Risks Due to AI Misuse
by: Barrett, Steve, et al.
Published: (2025)
by: Barrett, Steve, et al.
Published: (2025)
Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report
by: Lab, Shanghai AI, et al.
Published: (2025)
by: Lab, Shanghai AI, et al.
Published: (2025)
Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report v1.5
by: Liu, Dongrui, et al.
Published: (2026)
by: Liu, Dongrui, et al.
Published: (2026)
The Unified Control Framework: Establishing a Common Foundation for Enterprise AI Governance, Risk Management and Regulatory Compliance
by: Eisenberg, Ian W., et al.
Published: (2025)
by: Eisenberg, Ian W., et al.
Published: (2025)
Linear Probe Penalties Reduce LLM Sycophancy
by: Papadatos, Henry, et al.
Published: (2024)
by: Papadatos, Henry, et al.
Published: (2024)
Evaluating the Goal-Directedness of Large Language Models
by: Everitt, Tom, et al.
Published: (2025)
by: Everitt, Tom, et al.
Published: (2025)
Bridging the Communication Gap: Evaluating AI Labeling Practices for Trustworthy AI Development
by: Fischer, Raphael, et al.
Published: (2025)
by: Fischer, Raphael, et al.
Published: (2025)
Emerging Practices in Frontier AI Safety Frameworks
by: Buhl, Marie Davidsen, et al.
Published: (2025)
by: Buhl, Marie Davidsen, et al.
Published: (2025)
Between Innovation and Oversight: A Cross-Regional Study of AI Risk Management Frameworks in the EU, U.S., UK, and China
by: Al-Maamari, Amir
Published: (2025)
by: Al-Maamari, Amir
Published: (2025)
Application of the NIST AI Risk Management Framework to Surveillance Technology
by: Swaminathan, Nandhini, et al.
Published: (2024)
by: Swaminathan, Nandhini, et al.
Published: (2024)
Societal Capacity Assessment Framework: Measuring Resilience to Inform Advanced AI Risk Management
by: Gandhi, Milan, et al.
Published: (2025)
by: Gandhi, Milan, et al.
Published: (2025)
BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices
by: Reuel, Anka, et al.
Published: (2024)
by: Reuel, Anka, et al.
Published: (2024)
AI Risk Management Should Incorporate Both Safety and Security
by: Qi, Xiangyu, et al.
Published: (2024)
by: Qi, Xiangyu, et al.
Published: (2024)
Bridging the Data Gap in AI Reliability Research and Establishing DR-AIR, a Comprehensive Data Repository for AI Reliability
by: Zheng, Simin, et al.
Published: (2025)
by: Zheng, Simin, et al.
Published: (2025)
Risk Sources and Risk Management Measures in Support of Standards for General-Purpose AI Systems
by: Gipiškis, Rokas, et al.
Published: (2024)
by: Gipiškis, Rokas, et al.
Published: (2024)
Quantifying Trust: Financial Risk Management for Trustworthy AI Agents
by: Hua, Wenyue, et al.
Published: (2026)
by: Hua, Wenyue, et al.
Published: (2026)
AI Risk-Management Standards Profile for General-Purpose AI (GPAI) and Foundation Models
by: Barrett, Anthony M., et al.
Published: (2025)
by: Barrett, Anthony M., et al.
Published: (2025)
Bridging the Trust Gap: Clinician-Validated Hybrid Explainable AI for Maternal Health Risk Assessment in Bangladesh
by: Yesmin, Farjana, et al.
Published: (2026)
by: Yesmin, Farjana, et al.
Published: (2026)
Integrating AI's Carbon Footprint into Risk Management Frameworks: Strategies and Tools for Sustainable Compliance in Banking Sector
by: Tkachenko, Nataliya
Published: (2024)
by: Tkachenko, Nataliya
Published: (2024)
Beyond the Data Mesh Illusion: Designing Modern AI-augmented Lakehouses to Bridge the Gap Between Theory and Practice
by: Angélil, Oliver, et al.
Published: (2026)
by: Angélil, Oliver, et al.
Published: (2026)
Identifying the Supply Chain of AI for Trustworthiness and Risk Management in Critical Applications
by: Sheh, Raymond K., et al.
Published: (2025)
by: Sheh, Raymond K., et al.
Published: (2025)
ForesightSafety Bench: A Frontier Risk Evaluation and Governance Framework towards Safe AI
by: Tong, Haibo, et al.
Published: (2026)
by: Tong, Haibo, et al.
Published: (2026)
All Code, No Thought: Current Language Models Struggle to Reason in Ciphered Language
by: Guo, Shiyuan, et al.
Published: (2025)
by: Guo, Shiyuan, et al.
Published: (2025)
Mind the Gap: Bridging the Divide Between AI Aspirations and the Reality of Autonomous Characterization
by: Guinan, Grace, et al.
Published: (2025)
by: Guinan, Grace, et al.
Published: (2025)
Dark Speculation: Combining Qualitative and Quantitative Understanding in Frontier AI Risk Analysis
by: Carpenter, Daniel, et al.
Published: (2025)
by: Carpenter, Daniel, et al.
Published: (2025)
Mapping Industry Practices to the EU AI Act's GPAI Code of Practice Safety and Security Measures
by: Stelling, Lily, et al.
Published: (2025)
by: Stelling, Lily, et al.
Published: (2025)
Perception Gaps in Risk, Benefit, and Value Between Experts and Public Challenge Socially Accepted AI
by: Brauner, Philipp, et al.
Published: (2024)
by: Brauner, Philipp, et al.
Published: (2024)
Bridging the Gap in the Responsible AI Divides
by: Gyevnár, Bálint, et al.
Published: (2026)
by: Gyevnár, Bálint, et al.
Published: (2026)
AutoControl Arena: Synthesizing Executable Test Environments for Frontier AI Risk Evaluation
by: Li, Changyi, et al.
Published: (2026)
by: Li, Changyi, et al.
Published: (2026)
Current state of LLM Risks and AI Guardrails
by: Ayyamperumal, Suriya Ganesh, et al.
Published: (2024)
by: Ayyamperumal, Suriya Ganesh, et al.
Published: (2024)
Bridging the AI Trustworthiness Gap between Functions and Norms
by: Di Scala, Daan, et al.
Published: (2025)
by: Di Scala, Daan, et al.
Published: (2025)
Medchain: Bridging the Gap Between LLM Agents and Clinical Practice with Interactive Sequence
by: Liu, Jie, et al.
Published: (2024)
by: Liu, Jie, et al.
Published: (2024)
Bridging the Gap Between Scientific Laws Derived by AI Systems and Canonical Knowledge via Abductive Inference with AI-Noether
by: Srivastava, Karan, et al.
Published: (2025)
by: Srivastava, Karan, et al.
Published: (2025)
Bridging the Gap: Representation Spaces in Neuro-Symbolic AI
by: Zhang, Xin, et al.
Published: (2024)
by: Zhang, Xin, et al.
Published: (2024)
RISK: A Framework for GUI Agents in E-commerce Risk Management
by: Chen, Renqi, et al.
Published: (2025)
by: Chen, Renqi, et al.
Published: (2025)
Similar Items
-
Mapping AI Benchmark Data to Quantitative Risk Estimates Through Expert Elicitation
by: Murray, Malcolm, et al.
Published: (2025) -
The Role of Risk Modeling in Advanced AI Risk Management
by: Touzet, Chloé, et al.
Published: (2025) -
A Methodology for Quantitative AI Risk Modeling
by: Murray, Malcolm, et al.
Published: (2025) -
Evaluating AI Providers' Frontier Safety Frameworks
by: Stelling, Lily, et al.
Published: (2025) -
Open Problems in Frontier AI Risk Management
by: Ziosi, Marta, et al.
Published: (2026)