Saved in:
| Main Author: | Mazzu, James M. |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.12963 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Supertrust foundational alignment: mutual trust must replace permanent control for safe superintelligence
by: Mazzu, James M.
Published: (2024)
by: Mazzu, James M.
Published: (2024)
Factorizing formal contexts from closures of necessity operators
by: Aragón, Roberto G., et al.
Published: (2026)
by: Aragón, Roberto G., et al.
Published: (2026)
Not quite Sherlock Holmes: Language model predictions do not reliably differentiate impossible from improbable events
by: Michaelov, James A., et al.
Published: (2025)
by: Michaelov, James A., et al.
Published: (2025)
Quantifying intrinsic causal contributions via structure preserving interventions
by: Janzing, Dominik, et al.
Published: (2020)
by: Janzing, Dominik, et al.
Published: (2020)
Kallini et al. (2024) do not compare impossible languages with constituency-based ones
by: Hunter, Tim
Published: (2024)
by: Hunter, Tim
Published: (2024)
A note on the impossibility of conditional PAC-efficient reasoning in large language models
by: Zeng, Hao
Published: (2025)
by: Zeng, Hao
Published: (2025)
Evaluating whether AI models would sabotage AI safety research
by: Kirk, Robert, et al.
Published: (2026)
by: Kirk, Robert, et al.
Published: (2026)
Landscape of AI safety concerns -- A methodology to support safety assurance for AI-based autonomous systems
by: Schnitzer, Ronald, et al.
Published: (2024)
by: Schnitzer, Ronald, et al.
Published: (2024)
Comprehensive AI governance requires addressing non-model gains
by: Goemans, Arthur, et al.
Published: (2026)
by: Goemans, Arthur, et al.
Published: (2026)
Retrying vs Resampling in AI Control
by: Lucassen, James, et al.
Published: (2026)
by: Lucassen, James, et al.
Published: (2026)
A cross-regional review of AI safety regulations in the commercial aviation
by: Barr, Penny A., et al.
Published: (2025)
by: Barr, Penny A., et al.
Published: (2025)
Playing games with knowledge: AI-Induced delusions need game theoretic interventions
by: Beaumaster, Will, et al.
Published: (2026)
by: Beaumaster, Will, et al.
Published: (2026)
Towards evaluations-based safety cases for AI scheming
by: Balesni, Mikita, et al.
Published: (2024)
by: Balesni, Mikita, et al.
Published: (2024)
Taming the Centaur(s) with LAPITHS: a framework for a theoretically grounded interpretation of AI performances
by: Da Pelo, Matteo, et al.
Published: (2026)
by: Da Pelo, Matteo, et al.
Published: (2026)
A sketch of an AI control safety case
by: Korbak, Tomek, et al.
Published: (2025)
by: Korbak, Tomek, et al.
Published: (2025)
Affirmative safety: An approach to risk management for high-risk AI
by: Wasil, Akash R., et al.
Published: (2024)
by: Wasil, Akash R., et al.
Published: (2024)
Towards provable probabilistic safety for scalable embodied AI systems
by: He, Linxuan, et al.
Published: (2025)
by: He, Linxuan, et al.
Published: (2025)
Fragilidade na formação dos profissionais de saúde quanto à Língua Brasileira de Sinais: reflexo na atenção à saúde dos surdos
by: Thiago Mazzu-Nascimento
Published: (2020)
by: Thiago Mazzu-Nascimento
Published: (2020)
ClashEval: Quantifying the tug-of-war between an LLM's internal prior and external evidence
by: Wu, Kevin, et al.
Published: (2024)
by: Wu, Kevin, et al.
Published: (2024)
The impact of intrinsic rewards on exploration in Reinforcement Learning
by: Kayal, Aya, et al.
Published: (2025)
by: Kayal, Aya, et al.
Published: (2025)
Sustainable AI Processing at the Edge
by: Ollivier, Sébastien, et al.
Published: (2022)
by: Ollivier, Sébastien, et al.
Published: (2022)
"Just a strange pic": Evaluating 'safety' in GenAI Image safety annotation tasks from diverse annotators' perspectives
by: Wang, Ding, et al.
Published: (2025)
by: Wang, Ding, et al.
Published: (2025)
Position: Ensuring mutual privacy is necessary for effective external evaluation of proprietary AI systems
by: Bucknall, Ben, et al.
Published: (2025)
by: Bucknall, Ben, et al.
Published: (2025)
On the Sustainability of AI Inferences in the Edge
by: Sobhani, Ghazal, et al.
Published: (2025)
by: Sobhani, Ghazal, et al.
Published: (2025)
Efficiency Will Not Lead to Sustainable Reasoning AI
by: Wiesner, Philipp, et al.
Published: (2025)
by: Wiesner, Philipp, et al.
Published: (2025)
AI Sustainability in Practice Part One: Foundations for Sustainable AI Projects
by: Leslie, David, et al.
Published: (2024)
by: Leslie, David, et al.
Published: (2024)
AI Sustainability in Practice Part Two: Sustainability Throughout the AI Workflow
by: Leslie, David, et al.
Published: (2024)
by: Leslie, David, et al.
Published: (2024)
Super Co-alignment of Human and AI for Sustainable Symbiotic Society
by: Zeng, Yi, et al.
Published: (2025)
by: Zeng, Yi, et al.
Published: (2025)
Strategizing against Q-learners: A Control-theoretical Approach
by: Arslantas, Yuksel, et al.
Published: (2024)
by: Arslantas, Yuksel, et al.
Published: (2024)
Quality Assessment of Public Summary of Training Content for GPAI models required by AI Act Article 53(1)(d)
by: Blankvoort, Dick A. H., et al.
Published: (2026)
by: Blankvoort, Dick A. H., et al.
Published: (2026)
The Environmental Impact of AI Servers and Sustainable Solutions
by: Patel, Aadi, et al.
Published: (2025)
by: Patel, Aadi, et al.
Published: (2025)
A theoretical guarantee for SyncRank
by: Rao, Yang
Published: (2025)
by: Rao, Yang
Published: (2025)
AI Application in Anti-Money Laundering for Sustainable and Transparent Financial Systems
by: Nie, Chuanhao, et al.
Published: (2025)
by: Nie, Chuanhao, et al.
Published: (2025)
SustainDC: Benchmarking for Sustainable Data Center Control
by: Naug, Avisek, et al.
Published: (2024)
by: Naug, Avisek, et al.
Published: (2024)
The 2025 OpenAI Preparedness Framework does not guarantee any AI risk mitigation practices: a proof-of-concept for affordance analyses of AI safety policies
by: Coggins, Sam, et al.
Published: (2025)
by: Coggins, Sam, et al.
Published: (2025)
Information-theoretic analysis of world models in optimal reward maximizers
by: Harwood, Alfred, et al.
Published: (2026)
by: Harwood, Alfred, et al.
Published: (2026)
An alignment safety case sketch based on debate
by: Buhl, Marie Davidsen, et al.
Published: (2025)
by: Buhl, Marie Davidsen, et al.
Published: (2025)
The potential functions of an international institution for AI safety. Insights from adjacent policy areas and recent trends
by: De Castris, A. Leone, et al.
Published: (2024)
by: De Castris, A. Leone, et al.
Published: (2024)
The receptron is a nonlinear threshold logic gate with intrinsic multi-dimensional selective capabilities for analog inputs
by: Paroli, B., et al.
Published: (2025)
by: Paroli, B., et al.
Published: (2025)
Reasonably reasoning AI agents can avoid game-theoretic failures in zero-shot, provably
by: Kang, Enoch Hyunwook
Published: (2026)
by: Kang, Enoch Hyunwook
Published: (2026)
Similar Items
-
Supertrust foundational alignment: mutual trust must replace permanent control for safe superintelligence
by: Mazzu, James M.
Published: (2024) -
Factorizing formal contexts from closures of necessity operators
by: Aragón, Roberto G., et al.
Published: (2026) -
Not quite Sherlock Holmes: Language model predictions do not reliably differentiate impossible from improbable events
by: Michaelov, James A., et al.
Published: (2025) -
Quantifying intrinsic causal contributions via structure preserving interventions
by: Janzing, Dominik, et al.
Published: (2020) -
Kallini et al. (2024) do not compare impossible languages with constituency-based ones
by: Hunter, Tim
Published: (2024)