:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Bailey, Richard M.
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence I.2.0
Online Access:	https://arxiv.org/abs/2510.15772
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Graceful task adaptation with a bi-hemispheric RL agent
by: Nicholas, Grant, et al.
Published: (2024)

ConciseRL: Conciseness-Guided Reinforcement Learning for Efficient Reasoning Models
by: Dumitru, Razvan-Gabriel, et al.
Published: (2025)

Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and Global Optimality
by: Ged, François, et al.
Published: (2023)

The algorithmic muse and the public domain: Why copyrights legal philosophy precludes protection for generative AI outputs
by: Elmahjub, Ezieddin
Published: (2025)

How much can change in a year? Revisiting Evaluation in Multi-Agent Reinforcement Learning
by: Singh, Siddarth, et al.
Published: (2023)

Efficiently Quantifying Individual Agent Importance in Cooperative MARL
by: Mahjoub, Omayma, et al.
Published: (2023)

Sensemaking in Novel Environments: How Human Cognition Can Inform Artificial Agents
by: Patterson, Robert E., et al.
Published: (2025)

Training Language Models to Win Debates with Self-Play Improves Judge Accuracy
by: Arnesen, Samuel, et al.
Published: (2024)

From Language Models to Practical Self-Improving Computer Agents
by: Sheng, Alex
Published: (2024)

An Axiomatic Approach to General Intelligence: SANC(E3) -- Self-organizing Active Network of Concepts with Energy E3
by: Kwon, Daesuk, et al.
Published: (2026)

Interpolative Decoding: Exploring the Spectrum of Personality Traits in LLMs
by: Yeh, Eric, et al.
Published: (2025)

Discerning What Matters: A Multi-Dimensional Assessment of Moral Competence in LLMs
by: Kilov, Daniel, et al.
Published: (2025)

Do Chains-of-Thoughts of Large Language Models Suffer from Hallucinations, Cognitive Biases, or Phobias in Bayesian Reasoning?
by: Araya, Roberto
Published: (2025)

HCAST: Human-Calibrated Autonomy Software Tasks
by: Rein, David, et al.
Published: (2025)

Mutagenesis screen to map the functions of parameters of Large Language Models
by: Hu, Yue, et al.
Published: (2024)

Appraisal-Guided Proximal Policy Optimization: Modeling Psychological Disorders in Dynamic Grid World
by: Prasad, Hari, et al.
Published: (2024)

Machine Learning and Theory Ladenness -- A Phenomenological Account
by: Termine, Alberto, et al.
Published: (2024)

A Case-Based Persistent Memory for a Large Language Model
by: Watson, Ian
Published: (2023)

Reward is not enough: can we liberate AI from the reinforcement learning paradigm?
by: Glukhov, Vacslav
Published: (2022)

From Pixels to Digital Agents: An Empirical Study on the Taxonomy and Technological Trends of Reinforcement Learning Environments
by: Luo, Lijing, et al.
Published: (2026)

Complete Implementation of WXF Chinese Chess Rules
by: Tan, Daniel, et al.
Published: (2024)

Developing trustworthy AI applications with foundation models
by: Mock, Michael, et al.
Published: (2024)

How VADER is your AI? Towards a definition of artificial intelligence systems appropriate for regulation
by: Bezerra, Leonardo C. T., et al.
Published: (2024)

ARCTraj: A Dataset and Benchmark of Human Reasoning Trajectories for Abstract Problem Solving
by: Kim, Sejin, et al.
Published: (2025)

Position Paper: Bounded Alignment: What (Not) To Expect From AGI Agents
by: Minai, Ali A.
Published: (2025)

Beyond Mimicry: Preference Coherence in LLMs
by: Mikaelson, Luhan, et al.
Published: (2025)

EduQate: Generating Adaptive Curricula through RMABs in Education Settings
by: Tio, Sidney, et al.
Published: (2024)

Intervention Complexity as a Canonical Reward and a Measure of Intelligence
by: McCane, Brendan
Published: (2026)

Benchmarking AI for low-resource contexts: Thinking beyond leaderboards
by: Pant, Aakash, et al.
Published: (2026)

Right-to-Act: A Pre-Execution Non-Compensatory Decision Protocol for AI Systems
by: Lavi, Gadi
Published: (2026)

Pareto-Optimized Open-Source LLMs for Healthcare via Context Retrieval
by: Bayarri-Planas, Jordi, et al.
Published: (2024)

Fanar: An Arabic-Centric Multimodal Generative AI Platform
by: Fanar Team, et al.
Published: (2025)

The dynamics of belief: continuously monitoring and visualising complex systems
by: Beggs, Edwin J., et al.
Published: (2022)

The Station: An Open-World Environment for AI-Driven Discovery
by: Chung, Stephen, et al.
Published: (2025)

How Data Quality Affects Machine Learning Models for Credit Risk Assessment
by: Maurino, Andrea
Published: (2025)

A domain-specific language for describing machine learning datasets
by: Giner-Miguelez, Joan, et al.
Published: (2022)

ETOM: A Five-Level Benchmark for Evaluating Tool Orchestration within the MCP Ecosystem
by: Dong, Jia-Kai, et al.
Published: (2025)

Deciphering Digital Detectives: Understanding LLM Behaviors and Capabilities in Multi-Agent Mystery Games
by: Wu, Dekun, et al.
Published: (2023)

Augmenting deep neural networks with symbolic knowledge: Towards trustworthy and interpretable AI for education
by: Hooshyar, Danial, et al.
Published: (2023)

Measuring proximity to standard planes during fetal brain ultrasound scanning
by: Di Vece, Chiara, et al.
Published: (2024)