:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Rane, Sunayana, Lake, Brenden M., Griffiths, Thomas L.
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.21683
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Concept Alignment
by: Rane, Sunayana, et al.
Published: (2024)

The Reasonable Person Standard for AI
by: Rane, Sunayana
Published: (2024)

CoLLEGe: Concept Embedding Generation for Large Language Models
by: Teehan, Ryan, et al.
Published: (2024)

Whither symbols in the era of advanced neural networks?
by: Griffiths, Thomas L., et al.
Published: (2025)

Are they human? Detecting large language models by probing human memory constraints
by: Schug, Simon, et al.
Published: (2026)

Overcoming classic challenges for artificial neural networks by providing incentives and practice
by: Irie, Kazuki, et al.
Published: (2024)

Comparing Abstraction in Humans and Large Language Models Using Multimodal Serial Reproduction
by: Kumar, Sreejan, et al.
Published: (2024)

SAGE-Eval: Evaluating LLMs for Systematic Generalizations of Safety Facts
by: Yueh-Han, Chen, et al.
Published: (2025)

An explainable transformer circuit for compositional generalization
by: Tang, Cheng, et al.
Published: (2025)

Compositional learning of functions in humans and machines
by: Zhou, Yanli, et al.
Published: (2024)

Rapid Word Learning Through Meta In-Context Learning
by: Wang, Wentao, et al.
Published: (2025)

Do Large Language Models Reason Causally Like Us? Even Better?
by: Dettki, Hanna M., et al.
Published: (2025)

Goals as Reward-Producing Programs
by: Davidson, Guy, et al.
Published: (2024)

H-ARC: A Robust Estimate of Human Performance on the Abstraction and Reasoning Corpus Benchmark
by: LeGris, Solim, et al.
Published: (2024)

Distilling Symbolic Priors for Concept Learning into Neural Networks
by: Marinescu, Ioana, et al.
Published: (2024)

HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator
by: Yang, Fan, et al.
Published: (2024)

Understanding the Limits of Vision Language Models Through the Lens of the Binding Problem
by: Campbell, Declan, et al.
Published: (2024)

BRITE: A Benchmark for Reliable and Interpretable T2V Evaluation on Implausible Scenarios
by: Tilak, Advait, et al.
Published: (2026)

Convolutional Neural Networks Can (Meta-)Learn the Same-Different Relation
by: Gupta, Max, et al.
Published: (2025)

Fluent but Foreign: Even Regional LLMs Lack Cultural Alignment
by: Agarwal, Dhruv, et al.
Published: (2025)

TRAVL: A Recipe for Making Video-Language Models Better Judges of Physics Implausibility
by: Motamed, Saman, et al.
Published: (2025)

Beyond the Doors of Perception: Vision Transformers Represent Relations Between Objects
by: Lepori, Michael A., et al.
Published: (2024)

Cat-DPO: Category-Adaptive Safety Alignment
by: Yang, Tiankai, et al.
Published: (2026)

Optimizing Path Planning using Deep Reinforcement Learning for UGVs in Precision Agriculture
by: Patade, Laukik, et al.
Published: (2026)

Toward Efficient Exploration by Large Language Model Agents
by: Arumugam, Dilip, et al.
Published: (2025)

Deep Neural Networks Can Learn Generalizable Same-Different Visual Relations
by: Tartaglini, Alexa R., et al.
Published: (2023)

Probing the Probes: Methods and Metrics for Concept Alignment
by: Lysnæs-Larsen, Jacob, et al.
Published: (2025)

Conformal Prediction as Bayesian Quadrature
by: Snell, Jake C., et al.
Published: (2025)

Incoherent Probability Judgments in Large Language Models
by: Zhu, Jian-Qiao, et al.
Published: (2024)

A Rational Analysis of the Effects of Sycophantic AI
by: Batista, Rafael M., et al.
Published: (2026)

Learning Like Humans: Analogical Concept Learning for Generalized Category Discovery
by: Han, Jizhou, et al.
Published: (2026)

GOAL: Geometrically Optimal Alignment for Continual Generalized Category Discovery
by: Han, Jizhou, et al.
Published: (2026)

Learning Human-like Representations to Enable Learning Human Values
by: Wynn, Andrea, et al.
Published: (2023)

Prototype-Grounded Concept Models for Verifiable Concept Alignment
by: Colamonaco, Stefano, et al.
Published: (2026)

Using Reinforcement Learning to Train Large Language Models to Explain Human Decisions
by: Zhu, Jian-Qiao, et al.
Published: (2025)

Deciphering the Factors Influencing the Efficacy of Chain-of-Thought: Probability, Memorization, and Noisy Reasoning
by: Prabhakar, Akshara, et al.
Published: (2024)

Recovering Event Probabilities from Large Language Model Embeddings via Axiomatic Constraints
by: Zhu, Jian-Qiao, et al.
Published: (2025)

Steering Risk Preferences in Large Language Models by Aligning Behavioral and Neural Representations
by: Zhu, Jian-Qiao, et al.
Published: (2025)

Recovering Mental Representations from Large Language Models with Markov Chain Monte Carlo
by: Zhu, Jian-Qiao, et al.
Published: (2024)

Process Matters more than Output for Distinguishing Humans from Machines
by: Rmus, Milena, et al.
Published: (2026)