Saved in:
| Main Authors: | Tlaie, Alejandro, Farrell, Jimmy |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.07496 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Using AI Alignment Theory to understand the potential pitfalls of regulatory frameworks
by: Tlaie, Alejandro
Published: (2024)
by: Tlaie, Alejandro
Published: (2024)
Expanding External Access To Frontier AI Models For Dangerous Capability Evaluations
by: Charnock, Jacob, et al.
Published: (2026)
by: Charnock, Jacob, et al.
Published: (2026)
GPAI Evaluations Standards Taskforce: Towards Effective AI Governance
by: Paskov, Patricia, et al.
Published: (2024)
by: Paskov, Patricia, et al.
Published: (2024)
A Blueprint for an EU Ecosystem of Secure, Deep and External AI Audits
by: Tlaie, Alejandro
Published: (2025)
by: Tlaie, Alejandro
Published: (2025)
Preliminary suggestions for rigorous GPAI model evaluations
by: Paskov, Patricia, et al.
Published: (2025)
by: Paskov, Patricia, et al.
Published: (2025)
Mapping Industry Practices to the EU AI Act's GPAI Code of Practice Safety and Security Measures
by: Stelling, Lily, et al.
Published: (2025)
by: Stelling, Lily, et al.
Published: (2025)
Assessing confidence in frontier AI safety cases
by: Barrett, Stephen, et al.
Published: (2025)
by: Barrett, Stephen, et al.
Published: (2025)
Exploring and steering the moral compass of Large Language Models
by: Tlaie, Alejandro
Published: (2024)
by: Tlaie, Alejandro
Published: (2024)
AI Risk-Management Standards Profile for General-Purpose AI (GPAI) and Foundation Models
by: Barrett, Anthony M., et al.
Published: (2025)
by: Barrett, Anthony M., et al.
Published: (2025)
Enabling Responsible, Secure and Sustainable Healthcare AI - A Strategic Framework for Clinical and Operational Impact
by: Joseph, Jimmy
Published: (2025)
by: Joseph, Jimmy
Published: (2025)
A Methodology for Quantitative AI Risk Modeling
by: Murray, Malcolm, et al.
Published: (2025)
by: Murray, Malcolm, et al.
Published: (2025)
Quality Assessment of Public Summary of Training Content for GPAI models required by AI Act Article 53(1)(d)
by: Blankvoort, Dick A. H., et al.
Published: (2026)
by: Blankvoort, Dick A. H., et al.
Published: (2026)
The Role of Risk Modeling in Advanced AI Risk Management
by: Touzet, Chloé, et al.
Published: (2025)
by: Touzet, Chloé, et al.
Published: (2025)
External Evaluation of Discrimination Mitigation Efforts in Meta's Ad Delivery
by: Imana, Basileal, et al.
Published: (2025)
by: Imana, Basileal, et al.
Published: (2025)
Federated learning, ethics, and the double black box problem in medical AI
by: Hatherley, Joshua, et al.
Published: (2025)
by: Hatherley, Joshua, et al.
Published: (2025)
An External Fairness Evaluation of LinkedIn Talent Search
by: Behzad, Tina, et al.
Published: (2025)
by: Behzad, Tina, et al.
Published: (2025)
Toward Quantitative Modeling of Cybersecurity Risks Due to AI Misuse
by: Barrett, Steve, et al.
Published: (2025)
by: Barrett, Steve, et al.
Published: (2025)
Safe for Whom? Rethinking How We Evaluate the Safety of LLMs for Real Users
by: Kempermann, Manon, et al.
Published: (2025)
by: Kempermann, Manon, et al.
Published: (2025)
Descriptions of women are longer than that of men: An analysis of gender portrayal prompts in Stable Diffusion
by: Asadchy, Yan, et al.
Published: (2024)
by: Asadchy, Yan, et al.
Published: (2024)
Creating and Evaluating Privacy and Security Micro-Lessons for Elementary School Children
by: Gao, Lan, et al.
Published: (2025)
by: Gao, Lan, et al.
Published: (2025)
FORTRESS: Frontier Risk Evaluation for National Security and Public Safety
by: Knight, Christina Q., et al.
Published: (2025)
by: Knight, Christina Q., et al.
Published: (2025)
Assessing the Impact of External and Internal Factors on Emergency Department Overcrowding
by: Ahmed, Abdulaziz, et al.
Published: (2025)
by: Ahmed, Abdulaziz, et al.
Published: (2025)
Lessons from External Review of DeepMind's Scheming Inability Safety Case
by: Barrett, Stephen, et al.
Published: (2026)
by: Barrett, Stephen, et al.
Published: (2026)
From Replacement to Orchestration: A Socio-Technical Architecture for Agentic AI in Corporate R&D
by: Boussaid, Haithem, et al.
Published: (2026)
by: Boussaid, Haithem, et al.
Published: (2026)
European Football Player Valuation: Integrating Financial Models and Network Theory
by: Cohen, Albert, et al.
Published: (2023)
by: Cohen, Albert, et al.
Published: (2023)
When Can We Trust LLMs in Mental Health? Large-Scale Benchmarks for Reliable LLM Evaluation
by: Badawi, Abeer, et al.
Published: (2025)
by: Badawi, Abeer, et al.
Published: (2025)
Evaluating Organization Security: User Stories of European Union NIS2 Directive
by: Seeba, Mari, et al.
Published: (2025)
by: Seeba, Mari, et al.
Published: (2025)
Rethinking Optimization: A Systems-Based Approach to Social Externalities
by: Nokhiz, Pegah, et al.
Published: (2025)
by: Nokhiz, Pegah, et al.
Published: (2025)
Intimacy as Service, Harm as Externality: Critical Perspectives on AI Companion Platform Accountability
by: Eom, Dayeon, et al.
Published: (2026)
by: Eom, Dayeon, et al.
Published: (2026)
Small Models Achieve Large Language Model Performance: Evaluating Reasoning-Enabled AI for Secure Child Welfare Research
by: Qi, Zia, et al.
Published: (2025)
by: Qi, Zia, et al.
Published: (2025)
Purer than pure: how purity reshapes the upstream materiality of the semiconductor industry
by: Roussilhe, Gauthier, et al.
Published: (2025)
by: Roussilhe, Gauthier, et al.
Published: (2025)
Secure On-Premise Deployment of Open-Weights Large Language Models in Radiology: An Isolation-First Architecture with Prospective Pilot Evaluation
by: Nowak, Sebastian, et al.
Published: (2026)
by: Nowak, Sebastian, et al.
Published: (2026)
Exploring the Adversarial Robustness of Face Forgery Detection with Decision-based Black-box Attacks
by: Chen, Zhaoyu, et al.
Published: (2023)
by: Chen, Zhaoyu, et al.
Published: (2023)
A New Exploration into Chinese Characters: from Simplification to Deeper Understanding
by: Gong, Wen G.
Published: (2025)
by: Gong, Wen G.
Published: (2025)
MoReBench: Evaluating Procedural and Pluralistic Moral Reasoning in Language Models, More than Outcomes
by: Chiu, Yu Ying, et al.
Published: (2025)
by: Chiu, Yu Ying, et al.
Published: (2025)
Online search is more likely to lead students to validate true news than to refute false ones
by: Bouleimen, Azza, et al.
Published: (2023)
by: Bouleimen, Azza, et al.
Published: (2023)
Assurance of Frontier AI Built for National Security
by: Pistillo, Matteo, et al.
Published: (2025)
by: Pistillo, Matteo, et al.
Published: (2025)
Large Language Models Leverage External Knowledge to Extend Clinical Insight Beyond Language Boundaries
by: Wu, Jiageng, et al.
Published: (2023)
by: Wu, Jiageng, et al.
Published: (2023)
Reporting and Analysing the Environmental Impact of Language Models on the Example of Commonsense Question Answering with External Knowledge
by: Usmanova, Aida, et al.
Published: (2024)
by: Usmanova, Aida, et al.
Published: (2024)
More than Carbon: Cradle-to-Grave environmental impacts of GenAI training on the Nvidia A100 GPU
by: Falk, Sophia, et al.
Published: (2025)
by: Falk, Sophia, et al.
Published: (2025)
Similar Items
-
Using AI Alignment Theory to understand the potential pitfalls of regulatory frameworks
by: Tlaie, Alejandro
Published: (2024) -
Expanding External Access To Frontier AI Models For Dangerous Capability Evaluations
by: Charnock, Jacob, et al.
Published: (2026) -
GPAI Evaluations Standards Taskforce: Towards Effective AI Governance
by: Paskov, Patricia, et al.
Published: (2024) -
A Blueprint for an EU Ecosystem of Secure, Deep and External AI Audits
by: Tlaie, Alejandro
Published: (2025) -
Preliminary suggestions for rigorous GPAI model evaluations
by: Paskov, Patricia, et al.
Published: (2025)