Saved in:
| Main Authors: | Rane, Sunayana, Lake, Brenden M., Griffiths, Thomas L. |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.21683 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Concept Alignment
by: Rane, Sunayana, et al.
Published: (2024)
by: Rane, Sunayana, et al.
Published: (2024)
The Reasonable Person Standard for AI
by: Rane, Sunayana
Published: (2024)
by: Rane, Sunayana
Published: (2024)
CoLLEGe: Concept Embedding Generation for Large Language Models
by: Teehan, Ryan, et al.
Published: (2024)
by: Teehan, Ryan, et al.
Published: (2024)
Whither symbols in the era of advanced neural networks?
by: Griffiths, Thomas L., et al.
Published: (2025)
by: Griffiths, Thomas L., et al.
Published: (2025)
Are they human? Detecting large language models by probing human memory constraints
by: Schug, Simon, et al.
Published: (2026)
by: Schug, Simon, et al.
Published: (2026)
Overcoming classic challenges for artificial neural networks by providing incentives and practice
by: Irie, Kazuki, et al.
Published: (2024)
by: Irie, Kazuki, et al.
Published: (2024)
Comparing Abstraction in Humans and Large Language Models Using Multimodal Serial Reproduction
by: Kumar, Sreejan, et al.
Published: (2024)
by: Kumar, Sreejan, et al.
Published: (2024)
SAGE-Eval: Evaluating LLMs for Systematic Generalizations of Safety Facts
by: Yueh-Han, Chen, et al.
Published: (2025)
by: Yueh-Han, Chen, et al.
Published: (2025)
An explainable transformer circuit for compositional generalization
by: Tang, Cheng, et al.
Published: (2025)
by: Tang, Cheng, et al.
Published: (2025)
Compositional learning of functions in humans and machines
by: Zhou, Yanli, et al.
Published: (2024)
by: Zhou, Yanli, et al.
Published: (2024)
Rapid Word Learning Through Meta In-Context Learning
by: Wang, Wentao, et al.
Published: (2025)
by: Wang, Wentao, et al.
Published: (2025)
Do Large Language Models Reason Causally Like Us? Even Better?
by: Dettki, Hanna M., et al.
Published: (2025)
by: Dettki, Hanna M., et al.
Published: (2025)
Goals as Reward-Producing Programs
by: Davidson, Guy, et al.
Published: (2024)
by: Davidson, Guy, et al.
Published: (2024)
H-ARC: A Robust Estimate of Human Performance on the Abstraction and Reasoning Corpus Benchmark
by: LeGris, Solim, et al.
Published: (2024)
by: LeGris, Solim, et al.
Published: (2024)
Distilling Symbolic Priors for Concept Learning into Neural Networks
by: Marinescu, Ioana, et al.
Published: (2024)
by: Marinescu, Ioana, et al.
Published: (2024)
HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator
by: Yang, Fan, et al.
Published: (2024)
by: Yang, Fan, et al.
Published: (2024)
Understanding the Limits of Vision Language Models Through the Lens of the Binding Problem
by: Campbell, Declan, et al.
Published: (2024)
by: Campbell, Declan, et al.
Published: (2024)
BRITE: A Benchmark for Reliable and Interpretable T2V Evaluation on Implausible Scenarios
by: Tilak, Advait, et al.
Published: (2026)
by: Tilak, Advait, et al.
Published: (2026)
Convolutional Neural Networks Can (Meta-)Learn the Same-Different Relation
by: Gupta, Max, et al.
Published: (2025)
by: Gupta, Max, et al.
Published: (2025)
Fluent but Foreign: Even Regional LLMs Lack Cultural Alignment
by: Agarwal, Dhruv, et al.
Published: (2025)
by: Agarwal, Dhruv, et al.
Published: (2025)
TRAVL: A Recipe for Making Video-Language Models Better Judges of Physics Implausibility
by: Motamed, Saman, et al.
Published: (2025)
by: Motamed, Saman, et al.
Published: (2025)
Beyond the Doors of Perception: Vision Transformers Represent Relations Between Objects
by: Lepori, Michael A., et al.
Published: (2024)
by: Lepori, Michael A., et al.
Published: (2024)
Cat-DPO: Category-Adaptive Safety Alignment
by: Yang, Tiankai, et al.
Published: (2026)
by: Yang, Tiankai, et al.
Published: (2026)
Optimizing Path Planning using Deep Reinforcement Learning for UGVs in Precision Agriculture
by: Patade, Laukik, et al.
Published: (2026)
by: Patade, Laukik, et al.
Published: (2026)
Toward Efficient Exploration by Large Language Model Agents
by: Arumugam, Dilip, et al.
Published: (2025)
by: Arumugam, Dilip, et al.
Published: (2025)
Deep Neural Networks Can Learn Generalizable Same-Different Visual Relations
by: Tartaglini, Alexa R., et al.
Published: (2023)
by: Tartaglini, Alexa R., et al.
Published: (2023)
Probing the Probes: Methods and Metrics for Concept Alignment
by: Lysnæs-Larsen, Jacob, et al.
Published: (2025)
by: Lysnæs-Larsen, Jacob, et al.
Published: (2025)
Conformal Prediction as Bayesian Quadrature
by: Snell, Jake C., et al.
Published: (2025)
by: Snell, Jake C., et al.
Published: (2025)
Incoherent Probability Judgments in Large Language Models
by: Zhu, Jian-Qiao, et al.
Published: (2024)
by: Zhu, Jian-Qiao, et al.
Published: (2024)
A Rational Analysis of the Effects of Sycophantic AI
by: Batista, Rafael M., et al.
Published: (2026)
by: Batista, Rafael M., et al.
Published: (2026)
Learning Like Humans: Analogical Concept Learning for Generalized Category Discovery
by: Han, Jizhou, et al.
Published: (2026)
by: Han, Jizhou, et al.
Published: (2026)
GOAL: Geometrically Optimal Alignment for Continual Generalized Category Discovery
by: Han, Jizhou, et al.
Published: (2026)
by: Han, Jizhou, et al.
Published: (2026)
Learning Human-like Representations to Enable Learning Human Values
by: Wynn, Andrea, et al.
Published: (2023)
by: Wynn, Andrea, et al.
Published: (2023)
Prototype-Grounded Concept Models for Verifiable Concept Alignment
by: Colamonaco, Stefano, et al.
Published: (2026)
by: Colamonaco, Stefano, et al.
Published: (2026)
Using Reinforcement Learning to Train Large Language Models to Explain Human Decisions
by: Zhu, Jian-Qiao, et al.
Published: (2025)
by: Zhu, Jian-Qiao, et al.
Published: (2025)
Deciphering the Factors Influencing the Efficacy of Chain-of-Thought: Probability, Memorization, and Noisy Reasoning
by: Prabhakar, Akshara, et al.
Published: (2024)
by: Prabhakar, Akshara, et al.
Published: (2024)
Recovering Event Probabilities from Large Language Model Embeddings via Axiomatic Constraints
by: Zhu, Jian-Qiao, et al.
Published: (2025)
by: Zhu, Jian-Qiao, et al.
Published: (2025)
Steering Risk Preferences in Large Language Models by Aligning Behavioral and Neural Representations
by: Zhu, Jian-Qiao, et al.
Published: (2025)
by: Zhu, Jian-Qiao, et al.
Published: (2025)
Recovering Mental Representations from Large Language Models with Markov Chain Monte Carlo
by: Zhu, Jian-Qiao, et al.
Published: (2024)
by: Zhu, Jian-Qiao, et al.
Published: (2024)
Process Matters more than Output for Distinguishing Humans from Machines
by: Rmus, Milena, et al.
Published: (2026)
by: Rmus, Milena, et al.
Published: (2026)
Similar Items
-
Concept Alignment
by: Rane, Sunayana, et al.
Published: (2024) -
The Reasonable Person Standard for AI
by: Rane, Sunayana
Published: (2024) -
CoLLEGe: Concept Embedding Generation for Large Language Models
by: Teehan, Ryan, et al.
Published: (2024) -
Whither symbols in the era of advanced neural networks?
by: Griffiths, Thomas L., et al.
Published: (2025) -
Are they human? Detecting large language models by probing human memory constraints
by: Schug, Simon, et al.
Published: (2026)