:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Tlaie, Alejandro, Farrell, Jimmy
Format:	Preprint
Published:	2025
Subjects:	Computers and Society
Online Access:	https://arxiv.org/abs/2503.07496
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Using AI Alignment Theory to understand the potential pitfalls of regulatory frameworks
by: Tlaie, Alejandro
Published: (2024)

Expanding External Access To Frontier AI Models For Dangerous Capability Evaluations
by: Charnock, Jacob, et al.
Published: (2026)

GPAI Evaluations Standards Taskforce: Towards Effective AI Governance
by: Paskov, Patricia, et al.
Published: (2024)

A Blueprint for an EU Ecosystem of Secure, Deep and External AI Audits
by: Tlaie, Alejandro
Published: (2025)

Preliminary suggestions for rigorous GPAI model evaluations
by: Paskov, Patricia, et al.
Published: (2025)

Mapping Industry Practices to the EU AI Act's GPAI Code of Practice Safety and Security Measures
by: Stelling, Lily, et al.
Published: (2025)

Assessing confidence in frontier AI safety cases
by: Barrett, Stephen, et al.
Published: (2025)

Exploring and steering the moral compass of Large Language Models
by: Tlaie, Alejandro
Published: (2024)

AI Risk-Management Standards Profile for General-Purpose AI (GPAI) and Foundation Models
by: Barrett, Anthony M., et al.
Published: (2025)

Enabling Responsible, Secure and Sustainable Healthcare AI - A Strategic Framework for Clinical and Operational Impact
by: Joseph, Jimmy
Published: (2025)

A Methodology for Quantitative AI Risk Modeling
by: Murray, Malcolm, et al.
Published: (2025)

Quality Assessment of Public Summary of Training Content for GPAI models required by AI Act Article 53(1)(d)
by: Blankvoort, Dick A. H., et al.
Published: (2026)

The Role of Risk Modeling in Advanced AI Risk Management
by: Touzet, Chloé, et al.
Published: (2025)

External Evaluation of Discrimination Mitigation Efforts in Meta's Ad Delivery
by: Imana, Basileal, et al.
Published: (2025)

Federated learning, ethics, and the double black box problem in medical AI
by: Hatherley, Joshua, et al.
Published: (2025)

An External Fairness Evaluation of LinkedIn Talent Search
by: Behzad, Tina, et al.
Published: (2025)

Toward Quantitative Modeling of Cybersecurity Risks Due to AI Misuse
by: Barrett, Steve, et al.
Published: (2025)

Safe for Whom? Rethinking How We Evaluate the Safety of LLMs for Real Users
by: Kempermann, Manon, et al.
Published: (2025)

Descriptions of women are longer than that of men: An analysis of gender portrayal prompts in Stable Diffusion
by: Asadchy, Yan, et al.
Published: (2024)

Creating and Evaluating Privacy and Security Micro-Lessons for Elementary School Children
by: Gao, Lan, et al.
Published: (2025)

FORTRESS: Frontier Risk Evaluation for National Security and Public Safety
by: Knight, Christina Q., et al.
Published: (2025)

Assessing the Impact of External and Internal Factors on Emergency Department Overcrowding
by: Ahmed, Abdulaziz, et al.
Published: (2025)

Lessons from External Review of DeepMind's Scheming Inability Safety Case
by: Barrett, Stephen, et al.
Published: (2026)

From Replacement to Orchestration: A Socio-Technical Architecture for Agentic AI in Corporate R&D
by: Boussaid, Haithem, et al.
Published: (2026)

European Football Player Valuation: Integrating Financial Models and Network Theory
by: Cohen, Albert, et al.
Published: (2023)

When Can We Trust LLMs in Mental Health? Large-Scale Benchmarks for Reliable LLM Evaluation
by: Badawi, Abeer, et al.
Published: (2025)

Evaluating Organization Security: User Stories of European Union NIS2 Directive
by: Seeba, Mari, et al.
Published: (2025)

Rethinking Optimization: A Systems-Based Approach to Social Externalities
by: Nokhiz, Pegah, et al.
Published: (2025)

Intimacy as Service, Harm as Externality: Critical Perspectives on AI Companion Platform Accountability
by: Eom, Dayeon, et al.
Published: (2026)

Small Models Achieve Large Language Model Performance: Evaluating Reasoning-Enabled AI for Secure Child Welfare Research
by: Qi, Zia, et al.
Published: (2025)

Purer than pure: how purity reshapes the upstream materiality of the semiconductor industry
by: Roussilhe, Gauthier, et al.
Published: (2025)

Secure On-Premise Deployment of Open-Weights Large Language Models in Radiology: An Isolation-First Architecture with Prospective Pilot Evaluation
by: Nowak, Sebastian, et al.
Published: (2026)

Exploring the Adversarial Robustness of Face Forgery Detection with Decision-based Black-box Attacks
by: Chen, Zhaoyu, et al.
Published: (2023)

A New Exploration into Chinese Characters: from Simplification to Deeper Understanding
by: Gong, Wen G.
Published: (2025)

MoReBench: Evaluating Procedural and Pluralistic Moral Reasoning in Language Models, More than Outcomes
by: Chiu, Yu Ying, et al.
Published: (2025)

Online search is more likely to lead students to validate true news than to refute false ones
by: Bouleimen, Azza, et al.
Published: (2023)

Assurance of Frontier AI Built for National Security
by: Pistillo, Matteo, et al.
Published: (2025)

Large Language Models Leverage External Knowledge to Extend Clinical Insight Beyond Language Boundaries
by: Wu, Jiageng, et al.
Published: (2023)

Reporting and Analysing the Environmental Impact of Language Models on the Example of Commonsense Question Answering with External Knowledge
by: Usmanova, Aida, et al.
Published: (2024)

More than Carbon: Cradle-to-Grave environmental impacts of GenAI training on the Nvidia A100 GPU
by: Falk, Sophia, et al.
Published: (2025)