Saved in:
| Main Author: | Bailey, Richard M. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.15772 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Graceful task adaptation with a bi-hemispheric RL agent
by: Nicholas, Grant, et al.
Published: (2024)
by: Nicholas, Grant, et al.
Published: (2024)
ConciseRL: Conciseness-Guided Reinforcement Learning for Efficient Reasoning Models
by: Dumitru, Razvan-Gabriel, et al.
Published: (2025)
by: Dumitru, Razvan-Gabriel, et al.
Published: (2025)
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and Global Optimality
by: Ged, François, et al.
Published: (2023)
by: Ged, François, et al.
Published: (2023)
The algorithmic muse and the public domain: Why copyrights legal philosophy precludes protection for generative AI outputs
by: Elmahjub, Ezieddin
Published: (2025)
by: Elmahjub, Ezieddin
Published: (2025)
How much can change in a year? Revisiting Evaluation in Multi-Agent Reinforcement Learning
by: Singh, Siddarth, et al.
Published: (2023)
by: Singh, Siddarth, et al.
Published: (2023)
Efficiently Quantifying Individual Agent Importance in Cooperative MARL
by: Mahjoub, Omayma, et al.
Published: (2023)
by: Mahjoub, Omayma, et al.
Published: (2023)
Sensemaking in Novel Environments: How Human Cognition Can Inform Artificial Agents
by: Patterson, Robert E., et al.
Published: (2025)
by: Patterson, Robert E., et al.
Published: (2025)
Training Language Models to Win Debates with Self-Play Improves Judge Accuracy
by: Arnesen, Samuel, et al.
Published: (2024)
by: Arnesen, Samuel, et al.
Published: (2024)
From Language Models to Practical Self-Improving Computer Agents
by: Sheng, Alex
Published: (2024)
by: Sheng, Alex
Published: (2024)
An Axiomatic Approach to General Intelligence: SANC(E3) -- Self-organizing Active Network of Concepts with Energy E3
by: Kwon, Daesuk, et al.
Published: (2026)
by: Kwon, Daesuk, et al.
Published: (2026)
Interpolative Decoding: Exploring the Spectrum of Personality Traits in LLMs
by: Yeh, Eric, et al.
Published: (2025)
by: Yeh, Eric, et al.
Published: (2025)
Discerning What Matters: A Multi-Dimensional Assessment of Moral Competence in LLMs
by: Kilov, Daniel, et al.
Published: (2025)
by: Kilov, Daniel, et al.
Published: (2025)
Do Chains-of-Thoughts of Large Language Models Suffer from Hallucinations, Cognitive Biases, or Phobias in Bayesian Reasoning?
by: Araya, Roberto
Published: (2025)
by: Araya, Roberto
Published: (2025)
HCAST: Human-Calibrated Autonomy Software Tasks
by: Rein, David, et al.
Published: (2025)
by: Rein, David, et al.
Published: (2025)
Mutagenesis screen to map the functions of parameters of Large Language Models
by: Hu, Yue, et al.
Published: (2024)
by: Hu, Yue, et al.
Published: (2024)
Appraisal-Guided Proximal Policy Optimization: Modeling Psychological Disorders in Dynamic Grid World
by: Prasad, Hari, et al.
Published: (2024)
by: Prasad, Hari, et al.
Published: (2024)
Machine Learning and Theory Ladenness -- A Phenomenological Account
by: Termine, Alberto, et al.
Published: (2024)
by: Termine, Alberto, et al.
Published: (2024)
A Case-Based Persistent Memory for a Large Language Model
by: Watson, Ian
Published: (2023)
by: Watson, Ian
Published: (2023)
Reward is not enough: can we liberate AI from the reinforcement learning paradigm?
by: Glukhov, Vacslav
Published: (2022)
by: Glukhov, Vacslav
Published: (2022)
From Pixels to Digital Agents: An Empirical Study on the Taxonomy and Technological Trends of Reinforcement Learning Environments
by: Luo, Lijing, et al.
Published: (2026)
by: Luo, Lijing, et al.
Published: (2026)
Complete Implementation of WXF Chinese Chess Rules
by: Tan, Daniel, et al.
Published: (2024)
by: Tan, Daniel, et al.
Published: (2024)
Developing trustworthy AI applications with foundation models
by: Mock, Michael, et al.
Published: (2024)
by: Mock, Michael, et al.
Published: (2024)
How VADER is your AI? Towards a definition of artificial intelligence systems appropriate for regulation
by: Bezerra, Leonardo C. T., et al.
Published: (2024)
by: Bezerra, Leonardo C. T., et al.
Published: (2024)
ARCTraj: A Dataset and Benchmark of Human Reasoning Trajectories for Abstract Problem Solving
by: Kim, Sejin, et al.
Published: (2025)
by: Kim, Sejin, et al.
Published: (2025)
Position Paper: Bounded Alignment: What (Not) To Expect From AGI Agents
by: Minai, Ali A.
Published: (2025)
by: Minai, Ali A.
Published: (2025)
Beyond Mimicry: Preference Coherence in LLMs
by: Mikaelson, Luhan, et al.
Published: (2025)
by: Mikaelson, Luhan, et al.
Published: (2025)
EduQate: Generating Adaptive Curricula through RMABs in Education Settings
by: Tio, Sidney, et al.
Published: (2024)
by: Tio, Sidney, et al.
Published: (2024)
Intervention Complexity as a Canonical Reward and a Measure of Intelligence
by: McCane, Brendan
Published: (2026)
by: McCane, Brendan
Published: (2026)
Benchmarking AI for low-resource contexts: Thinking beyond leaderboards
by: Pant, Aakash, et al.
Published: (2026)
by: Pant, Aakash, et al.
Published: (2026)
Right-to-Act: A Pre-Execution Non-Compensatory Decision Protocol for AI Systems
by: Lavi, Gadi
Published: (2026)
by: Lavi, Gadi
Published: (2026)
Pareto-Optimized Open-Source LLMs for Healthcare via Context Retrieval
by: Bayarri-Planas, Jordi, et al.
Published: (2024)
by: Bayarri-Planas, Jordi, et al.
Published: (2024)
Fanar: An Arabic-Centric Multimodal Generative AI Platform
by: Fanar Team, et al.
Published: (2025)
by: Fanar Team, et al.
Published: (2025)
The dynamics of belief: continuously monitoring and visualising complex systems
by: Beggs, Edwin J., et al.
Published: (2022)
by: Beggs, Edwin J., et al.
Published: (2022)
The Station: An Open-World Environment for AI-Driven Discovery
by: Chung, Stephen, et al.
Published: (2025)
by: Chung, Stephen, et al.
Published: (2025)
How Data Quality Affects Machine Learning Models for Credit Risk Assessment
by: Maurino, Andrea
Published: (2025)
by: Maurino, Andrea
Published: (2025)
A domain-specific language for describing machine learning datasets
by: Giner-Miguelez, Joan, et al.
Published: (2022)
by: Giner-Miguelez, Joan, et al.
Published: (2022)
ETOM: A Five-Level Benchmark for Evaluating Tool Orchestration within the MCP Ecosystem
by: Dong, Jia-Kai, et al.
Published: (2025)
by: Dong, Jia-Kai, et al.
Published: (2025)
Deciphering Digital Detectives: Understanding LLM Behaviors and Capabilities in Multi-Agent Mystery Games
by: Wu, Dekun, et al.
Published: (2023)
by: Wu, Dekun, et al.
Published: (2023)
Augmenting deep neural networks with symbolic knowledge: Towards trustworthy and interpretable AI for education
by: Hooshyar, Danial, et al.
Published: (2023)
by: Hooshyar, Danial, et al.
Published: (2023)
Measuring proximity to standard planes during fetal brain ultrasound scanning
by: Di Vece, Chiara, et al.
Published: (2024)
by: Di Vece, Chiara, et al.
Published: (2024)
Similar Items
-
Graceful task adaptation with a bi-hemispheric RL agent
by: Nicholas, Grant, et al.
Published: (2024) -
ConciseRL: Conciseness-Guided Reinforcement Learning for Efficient Reasoning Models
by: Dumitru, Razvan-Gabriel, et al.
Published: (2025) -
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and Global Optimality
by: Ged, François, et al.
Published: (2023) -
The algorithmic muse and the public domain: Why copyrights legal philosophy precludes protection for generative AI outputs
by: Elmahjub, Ezieddin
Published: (2025) -
How much can change in a year? Revisiting Evaluation in Multi-Agent Reinforcement Learning
by: Singh, Siddarth, et al.
Published: (2023)