:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Vallstrom, Daniel
Format:	Preprint
Published:	2024
Subjects:	Physics and Society Artificial Intelligence
Online Access:	https://arxiv.org/abs/2404.03685
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Measuring What AI Systems Might Do: Towards A Measurement Science in AI
by: Voudouris, Konstantinos, et al.
Published: (2026)

A mathematical theory of evolution for self-designing AIs
by: Harris, Kenneth D
Published: (2026)

Bringing AI Participation Down to Scale: A Comment on Open AIs Democratic Inputs to AI Project
by: Moats, David, et al.
Published: (2024)

Compression, The Fermi Paradox and Artificial Super-Intelligence
by: Bennett, Michael Timothy
Published: (2021)

Q-Learning-Driven Adaptive Rewiring for Cooperative Control in Heterogeneous Networks
by: Weng, Yi-Ning, et al.
Published: (2025)

The Anatomy Spread of Online Opinion Polarization: The Pivotal Role of Super-Spreaders in Social Networks
by: Kawahata, Yasuko
Published: (2023)

On the Diminishing Returns of Width for Continual Learning
by: Guha, Etash, et al.
Published: (2024)

Note: Evolutionary Game Theory Focus Informational Health: The Cocktail Party Effect Through Werewolfgame under Incomplete Information and ESS Search Method Using Expected Gains of Repeated Dilemmas
by: Kawahata, Yasuko
Published: (2024)

Runtime Monitoring and Enforcement of Conditional Fairness in Generative AIs
by: Cheng, Chih-Hong, et al.
Published: (2024)

The AI Alignment Paradox
by: West, Robert, et al.
Published: (2024)

The AI Double Standard: Humans Judge All AIs for the Actions of One
by: Manoli, Aikaterina, et al.
Published: (2024)

What AIs are not Learning (and Why)
by: Stefik, Mark
Published: (2024)

Where AI Assurance Might Go Wrong: Initial lessons from engineering of critical systems
by: Bloomfield, Robin, et al.
Published: (2025)

How to Count AIs: Individuation and Liability for AI Agents
by: Arbel, Yonathan, et al.
Published: (2026)

Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs
by: Mazeika, Mantas, et al.
Published: (2025)

Decentralized Traffic Flow Optimization Through Intrinsic Motivation
by: Papala, Himaja, et al.
Published: (2025)

The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs
by: Sinha, Akshit, et al.
Published: (2025)

The Diminishing Returns of Early-Exit Decoding in Modern LLMs
by: Wei, Rui, et al.
Published: (2026)

The Psychology of Learning from Machines: Anthropomorphic AI and the Paradox of Automation in Education
by: Qadir, Junaid, et al.
Published: (2026)

ShaRP: Explaining Rankings and Preferences with Shapley Values
by: Pliatsika, Venetia, et al.
Published: (2024)

Beyond Ethics: How Inclusive Innovation Drives Economic Returns in Medical AI
by: Unnikrishnan, Balagopal, et al.
Published: (2025)

The Adoption Paradox for Veterinary Professionals in China: High Use of Artificial Intelligence Despite Low Familiarity
by: Li, Shumin, et al.
Published: (2025)

Social Media Informatics for Sustainable Cities and Societies: An Overview of the Applications, associated Challenges, and Potential Solutions
by: Khan, Jebran, et al.
Published: (2024)

Traits of a Leader: User Influence Level Prediction through Sociolinguistic Modeling
by: Katerenchuk, Denys, et al.
Published: (2025)

Towards an AI Observatory for the Nuclear Sector: A tool for anticipatory governance
by: Verma, Aditi, et al.
Published: (2025)

A Survey of Physics-Informed AI for Complex Urban Systems
by: Xu, En, et al.
Published: (2025)

Working with Large Language Models to Enhance Messaging Effectiveness for Vaccine Confidence
by: Gullison, Lucinda, et al.
Published: (2025)

Forecasting Open-Weight AI Model Growth on HuggingFace
by: Bhandari, Kushal Raj, et al.
Published: (2025)

The Cognitive Kardashev Scale: Quantifying the Material Envelope of Civilisational Computation
by: Sharma, Sachin
Published: (2026)

AIs and Humans with Agency
by: Mumford, David
Published: (2026)

The Planetary Cost of AI Acceleration, Part II: The 10th Planetary Boundary and the 6.5-Year Countdown
by: Zhu, William Yicheng, et al.
Published: (2026)

When AIs Judge AIs: The Rise of Agent-as-a-Judge Evaluation for LLMs
by: Yu, Fangyi
Published: (2025)

Bootstrapping Developmental AIs: From Simple Competences to Intelligent Human-Compatible AIs
by: Stefik, Mark, et al.
Published: (2023)

Intrinsic Barriers to Explaining Deep Foundation Models
by: Tan, Zhen, et al.
Published: (2025)

Evolutionary Reinforcement Learning based AI tutor for Socratic Interdisciplinary Instruction
by: Jiang, Mei, et al.
Published: (2025)

What Is AI Safety? What Do We Want It to Be?
by: Harding, Jacqueline, et al.
Published: (2025)

Implicit Bias-Like Patterns in Reasoning Models
by: Lee, Messi H. J., et al.
Published: (2025)

Math Education Digital Shadows for facilitating learning with LLMs: Math performance, anxiety and confidence in simulated students and AIs
by: Esposito, Naomi, et al.
Published: (2026)

Explaining How Quantization Disparately Skews a Model
by: Bellam, Abhimanyu, et al.
Published: (2025)

Crafting Desirable Climate Trajectories with RL Explored Socio-Environmental Simulations
by: Rudd-Jones, James, et al.
Published: (2024)