Saved in:
| Main Author: | Vallstrom, Daniel |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.03685 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Measuring What AI Systems Might Do: Towards A Measurement Science in AI
by: Voudouris, Konstantinos, et al.
Published: (2026)
by: Voudouris, Konstantinos, et al.
Published: (2026)
A mathematical theory of evolution for self-designing AIs
by: Harris, Kenneth D
Published: (2026)
by: Harris, Kenneth D
Published: (2026)
Bringing AI Participation Down to Scale: A Comment on Open AIs Democratic Inputs to AI Project
by: Moats, David, et al.
Published: (2024)
by: Moats, David, et al.
Published: (2024)
Compression, The Fermi Paradox and Artificial Super-Intelligence
by: Bennett, Michael Timothy
Published: (2021)
by: Bennett, Michael Timothy
Published: (2021)
Q-Learning-Driven Adaptive Rewiring for Cooperative Control in Heterogeneous Networks
by: Weng, Yi-Ning, et al.
Published: (2025)
by: Weng, Yi-Ning, et al.
Published: (2025)
The Anatomy Spread of Online Opinion Polarization: The Pivotal Role of Super-Spreaders in Social Networks
by: Kawahata, Yasuko
Published: (2023)
by: Kawahata, Yasuko
Published: (2023)
On the Diminishing Returns of Width for Continual Learning
by: Guha, Etash, et al.
Published: (2024)
by: Guha, Etash, et al.
Published: (2024)
Note: Evolutionary Game Theory Focus Informational Health: The Cocktail Party Effect Through Werewolfgame under Incomplete Information and ESS Search Method Using Expected Gains of Repeated Dilemmas
by: Kawahata, Yasuko
Published: (2024)
by: Kawahata, Yasuko
Published: (2024)
Runtime Monitoring and Enforcement of Conditional Fairness in Generative AIs
by: Cheng, Chih-Hong, et al.
Published: (2024)
by: Cheng, Chih-Hong, et al.
Published: (2024)
The AI Alignment Paradox
by: West, Robert, et al.
Published: (2024)
by: West, Robert, et al.
Published: (2024)
The AI Double Standard: Humans Judge All AIs for the Actions of One
by: Manoli, Aikaterina, et al.
Published: (2024)
by: Manoli, Aikaterina, et al.
Published: (2024)
What AIs are not Learning (and Why)
by: Stefik, Mark
Published: (2024)
by: Stefik, Mark
Published: (2024)
Where AI Assurance Might Go Wrong: Initial lessons from engineering of critical systems
by: Bloomfield, Robin, et al.
Published: (2025)
by: Bloomfield, Robin, et al.
Published: (2025)
How to Count AIs: Individuation and Liability for AI Agents
by: Arbel, Yonathan, et al.
Published: (2026)
by: Arbel, Yonathan, et al.
Published: (2026)
Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs
by: Mazeika, Mantas, et al.
Published: (2025)
by: Mazeika, Mantas, et al.
Published: (2025)
Decentralized Traffic Flow Optimization Through Intrinsic Motivation
by: Papala, Himaja, et al.
Published: (2025)
by: Papala, Himaja, et al.
Published: (2025)
The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs
by: Sinha, Akshit, et al.
Published: (2025)
by: Sinha, Akshit, et al.
Published: (2025)
The Diminishing Returns of Early-Exit Decoding in Modern LLMs
by: Wei, Rui, et al.
Published: (2026)
by: Wei, Rui, et al.
Published: (2026)
The Psychology of Learning from Machines: Anthropomorphic AI and the Paradox of Automation in Education
by: Qadir, Junaid, et al.
Published: (2026)
by: Qadir, Junaid, et al.
Published: (2026)
ShaRP: Explaining Rankings and Preferences with Shapley Values
by: Pliatsika, Venetia, et al.
Published: (2024)
by: Pliatsika, Venetia, et al.
Published: (2024)
Beyond Ethics: How Inclusive Innovation Drives Economic Returns in Medical AI
by: Unnikrishnan, Balagopal, et al.
Published: (2025)
by: Unnikrishnan, Balagopal, et al.
Published: (2025)
The Adoption Paradox for Veterinary Professionals in China: High Use of Artificial Intelligence Despite Low Familiarity
by: Li, Shumin, et al.
Published: (2025)
by: Li, Shumin, et al.
Published: (2025)
Social Media Informatics for Sustainable Cities and Societies: An Overview of the Applications, associated Challenges, and Potential Solutions
by: Khan, Jebran, et al.
Published: (2024)
by: Khan, Jebran, et al.
Published: (2024)
Traits of a Leader: User Influence Level Prediction through Sociolinguistic Modeling
by: Katerenchuk, Denys, et al.
Published: (2025)
by: Katerenchuk, Denys, et al.
Published: (2025)
Towards an AI Observatory for the Nuclear Sector: A tool for anticipatory governance
by: Verma, Aditi, et al.
Published: (2025)
by: Verma, Aditi, et al.
Published: (2025)
A Survey of Physics-Informed AI for Complex Urban Systems
by: Xu, En, et al.
Published: (2025)
by: Xu, En, et al.
Published: (2025)
Working with Large Language Models to Enhance Messaging Effectiveness for Vaccine Confidence
by: Gullison, Lucinda, et al.
Published: (2025)
by: Gullison, Lucinda, et al.
Published: (2025)
Forecasting Open-Weight AI Model Growth on HuggingFace
by: Bhandari, Kushal Raj, et al.
Published: (2025)
by: Bhandari, Kushal Raj, et al.
Published: (2025)
The Cognitive Kardashev Scale: Quantifying the Material Envelope of Civilisational Computation
by: Sharma, Sachin
Published: (2026)
by: Sharma, Sachin
Published: (2026)
AIs and Humans with Agency
by: Mumford, David
Published: (2026)
by: Mumford, David
Published: (2026)
The Planetary Cost of AI Acceleration, Part II: The 10th Planetary Boundary and the 6.5-Year Countdown
by: Zhu, William Yicheng, et al.
Published: (2026)
by: Zhu, William Yicheng, et al.
Published: (2026)
When AIs Judge AIs: The Rise of Agent-as-a-Judge Evaluation for LLMs
by: Yu, Fangyi
Published: (2025)
by: Yu, Fangyi
Published: (2025)
Bootstrapping Developmental AIs: From Simple Competences to Intelligent Human-Compatible AIs
by: Stefik, Mark, et al.
Published: (2023)
by: Stefik, Mark, et al.
Published: (2023)
Intrinsic Barriers to Explaining Deep Foundation Models
by: Tan, Zhen, et al.
Published: (2025)
by: Tan, Zhen, et al.
Published: (2025)
Evolutionary Reinforcement Learning based AI tutor for Socratic Interdisciplinary Instruction
by: Jiang, Mei, et al.
Published: (2025)
by: Jiang, Mei, et al.
Published: (2025)
What Is AI Safety? What Do We Want It to Be?
by: Harding, Jacqueline, et al.
Published: (2025)
by: Harding, Jacqueline, et al.
Published: (2025)
Implicit Bias-Like Patterns in Reasoning Models
by: Lee, Messi H. J., et al.
Published: (2025)
by: Lee, Messi H. J., et al.
Published: (2025)
Math Education Digital Shadows for facilitating learning with LLMs: Math performance, anxiety and confidence in simulated students and AIs
by: Esposito, Naomi, et al.
Published: (2026)
by: Esposito, Naomi, et al.
Published: (2026)
Explaining How Quantization Disparately Skews a Model
by: Bellam, Abhimanyu, et al.
Published: (2025)
by: Bellam, Abhimanyu, et al.
Published: (2025)
Crafting Desirable Climate Trajectories with RL Explored Socio-Environmental Simulations
by: Rudd-Jones, James, et al.
Published: (2024)
by: Rudd-Jones, James, et al.
Published: (2024)
Similar Items
-
Measuring What AI Systems Might Do: Towards A Measurement Science in AI
by: Voudouris, Konstantinos, et al.
Published: (2026) -
A mathematical theory of evolution for self-designing AIs
by: Harris, Kenneth D
Published: (2026) -
Bringing AI Participation Down to Scale: A Comment on Open AIs Democratic Inputs to AI Project
by: Moats, David, et al.
Published: (2024) -
Compression, The Fermi Paradox and Artificial Super-Intelligence
by: Bennett, Michael Timothy
Published: (2021) -
Q-Learning-Driven Adaptive Rewiring for Cooperative Control in Heterogeneous Networks
by: Weng, Yi-Ning, et al.
Published: (2025)