Tabla de Contenidos: :: Library Catalog

Guardado en:

Detalles Bibliográficos
Autor principal:	Gavaza, Victoria
Formato:	Recurso digital
Lenguaje:	inglés
Publicado:	Zenodo 2026
Materias:	AI Governance Human Judgement Language Mediated Decision Making Professional Responsibility Institutional Accountability AI Safety Decision Support Systems Risk Classification Human-AI Interaction Model Evaluation Interpretability Chain of thought monitoring Evaluation Limits
Acceso en línea:	https://doi.org/10.5281/zenodo.18475763
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Tabla de Contenidos:

<p><strong>AI laboratories invest substantial effort in evaluation, benchmarking, and safety testing to ensure system reliability and mitigate harm. These practices predominantly assess model behaviour through outputs, task performance, and controlled test conditions. This paper identifies a structural limitation of such approaches: AI-mediated judgment formation can introduce governance-relevant risk even when models pass established evaluations. Drawing on AI safety research, evaluation science, and human–AI interaction literature, the paper demonstrates that laboratory sufficiency does not imply governance sufficiency. The resulting gap is not a failure of scientific diligence, but a boundary mismatch between evaluation objects and real-world judgment-shaping interaction.</strong></p>

Ejemplares similares