:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Mazzu, James M.
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.12963
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Supertrust foundational alignment: mutual trust must replace permanent control for safe superintelligence
by: Mazzu, James M.
Published: (2024)

Factorizing formal contexts from closures of necessity operators
by: Aragón, Roberto G., et al.
Published: (2026)

Not quite Sherlock Holmes: Language model predictions do not reliably differentiate impossible from improbable events
by: Michaelov, James A., et al.
Published: (2025)

Quantifying intrinsic causal contributions via structure preserving interventions
by: Janzing, Dominik, et al.
Published: (2020)

Kallini et al. (2024) do not compare impossible languages with constituency-based ones
by: Hunter, Tim
Published: (2024)

A note on the impossibility of conditional PAC-efficient reasoning in large language models
by: Zeng, Hao
Published: (2025)

Evaluating whether AI models would sabotage AI safety research
by: Kirk, Robert, et al.
Published: (2026)

Landscape of AI safety concerns -- A methodology to support safety assurance for AI-based autonomous systems
by: Schnitzer, Ronald, et al.
Published: (2024)

Comprehensive AI governance requires addressing non-model gains
by: Goemans, Arthur, et al.
Published: (2026)

Retrying vs Resampling in AI Control
by: Lucassen, James, et al.
Published: (2026)

A cross-regional review of AI safety regulations in the commercial aviation
by: Barr, Penny A., et al.
Published: (2025)

Playing games with knowledge: AI-Induced delusions need game theoretic interventions
by: Beaumaster, Will, et al.
Published: (2026)

Towards evaluations-based safety cases for AI scheming
by: Balesni, Mikita, et al.
Published: (2024)

Taming the Centaur(s) with LAPITHS: a framework for a theoretically grounded interpretation of AI performances
by: Da Pelo, Matteo, et al.
Published: (2026)

A sketch of an AI control safety case
by: Korbak, Tomek, et al.
Published: (2025)

Affirmative safety: An approach to risk management for high-risk AI
by: Wasil, Akash R., et al.
Published: (2024)

Towards provable probabilistic safety for scalable embodied AI systems
by: He, Linxuan, et al.
Published: (2025)

Fragilidade na formação dos profissionais de saúde quanto à Língua Brasileira de Sinais: reflexo na atenção à saúde dos surdos
by: Thiago Mazzu-Nascimento
Published: (2020)

ClashEval: Quantifying the tug-of-war between an LLM's internal prior and external evidence
by: Wu, Kevin, et al.
Published: (2024)

The impact of intrinsic rewards on exploration in Reinforcement Learning
by: Kayal, Aya, et al.
Published: (2025)

Sustainable AI Processing at the Edge
by: Ollivier, Sébastien, et al.
Published: (2022)

"Just a strange pic": Evaluating 'safety' in GenAI Image safety annotation tasks from diverse annotators' perspectives
by: Wang, Ding, et al.
Published: (2025)

Position: Ensuring mutual privacy is necessary for effective external evaluation of proprietary AI systems
by: Bucknall, Ben, et al.
Published: (2025)

On the Sustainability of AI Inferences in the Edge
by: Sobhani, Ghazal, et al.
Published: (2025)

Efficiency Will Not Lead to Sustainable Reasoning AI
by: Wiesner, Philipp, et al.
Published: (2025)

AI Sustainability in Practice Part One: Foundations for Sustainable AI Projects
by: Leslie, David, et al.
Published: (2024)

AI Sustainability in Practice Part Two: Sustainability Throughout the AI Workflow
by: Leslie, David, et al.
Published: (2024)

Super Co-alignment of Human and AI for Sustainable Symbiotic Society
by: Zeng, Yi, et al.
Published: (2025)

Strategizing against Q-learners: A Control-theoretical Approach
by: Arslantas, Yuksel, et al.
Published: (2024)

Quality Assessment of Public Summary of Training Content for GPAI models required by AI Act Article 53(1)(d)
by: Blankvoort, Dick A. H., et al.
Published: (2026)

The Environmental Impact of AI Servers and Sustainable Solutions
by: Patel, Aadi, et al.
Published: (2025)

A theoretical guarantee for SyncRank
by: Rao, Yang
Published: (2025)

AI Application in Anti-Money Laundering for Sustainable and Transparent Financial Systems
by: Nie, Chuanhao, et al.
Published: (2025)

SustainDC: Benchmarking for Sustainable Data Center Control
by: Naug, Avisek, et al.
Published: (2024)

The 2025 OpenAI Preparedness Framework does not guarantee any AI risk mitigation practices: a proof-of-concept for affordance analyses of AI safety policies
by: Coggins, Sam, et al.
Published: (2025)

Information-theoretic analysis of world models in optimal reward maximizers
by: Harwood, Alfred, et al.
Published: (2026)

An alignment safety case sketch based on debate
by: Buhl, Marie Davidsen, et al.
Published: (2025)

The potential functions of an international institution for AI safety. Insights from adjacent policy areas and recent trends
by: De Castris, A. Leone, et al.
Published: (2024)

The receptron is a nonlinear threshold logic gate with intrinsic multi-dimensional selective capabilities for analog inputs
by: Paroli, B., et al.
Published: (2025)

Reasonably reasoning AI agents can avoid game-theoretic failures in zero-shot, provably
by: Kang, Enoch Hyunwook
Published: (2026)