Guardado en:
| Autores principales: | Yang, Tiancheng, Schonlau, Matthias, Sucholutsky, Ilia |
|---|---|
| Formato: | Preprint |
| Publicado: |
2026
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2605.30087 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
The Hammock Plot: Where Categorical and Numerical Data Relax Together
por: Schonlau, Matthias, et al.
Publicado: (2025)
por: Schonlau, Matthias, et al.
Publicado: (2025)
Learning Human-like Representations to Enable Learning Human Values
por: Wynn, Andrea, et al.
Publicado: (2023)
por: Wynn, Andrea, et al.
Publicado: (2023)
Revisiting Rogers' Paradox in the Context of Human-AI Interaction
por: Collins, Katherine M., et al.
Publicado: (2025)
por: Collins, Katherine M., et al.
Publicado: (2025)
What is a Number, That a Large Language Model May Know It?
por: Marjieh, Raja, et al.
Publicado: (2025)
por: Marjieh, Raja, et al.
Publicado: (2025)
AIRHILT: A Human-in-the-Loop Testbed for Multimodal Conflict Detection in Aviation
por: Garib, Omar, et al.
Publicado: (2025)
por: Garib, Omar, et al.
Publicado: (2025)
Do Large Language Models Mentalize When They Teach?
por: Harootonian, Sevan K., et al.
Publicado: (2026)
por: Harootonian, Sevan K., et al.
Publicado: (2026)
Analyzing the Roles of Language and Vision in Learning from Limited Data
por: Chen, Allison, et al.
Publicado: (2024)
por: Chen, Allison, et al.
Publicado: (2024)
Comprehensive Comparison of RAG Methods Across Multi-Domain Conversational QA
por: Alushi, Klejda, et al.
Publicado: (2026)
por: Alushi, Klejda, et al.
Publicado: (2026)
According to Me: Long-Term Personalized Referential Memory QA
por: Mei, Jingbiao, et al.
Publicado: (2026)
por: Mei, Jingbiao, et al.
Publicado: (2026)
Concept Alignment
por: Rane, Sunayana, et al.
Publicado: (2024)
por: Rane, Sunayana, et al.
Publicado: (2024)
When Personalization Meets Reality: A Multi-Faceted Analysis of Personalized Preference Learning
por: Dong, Yijiang River, et al.
Publicado: (2025)
por: Dong, Yijiang River, et al.
Publicado: (2025)
Identifying, Evaluating, and Mitigating Risks of AI Thought Partnerships
por: Oktar, Kerem, et al.
Publicado: (2025)
por: Oktar, Kerem, et al.
Publicado: (2025)
AirQA: A Comprehensive QA Dataset for AI Research with Instance-Level Evaluation
por: Huang, Tiancheng, et al.
Publicado: (2025)
por: Huang, Tiancheng, et al.
Publicado: (2025)
Large Language Models Assume People are More Rational than We Really are
por: Liu, Ryan, et al.
Publicado: (2024)
por: Liu, Ryan, et al.
Publicado: (2024)
A Rational Analysis of the Speech-to-Song Illusion
por: Marjieh, Raja, et al.
Publicado: (2024)
por: Marjieh, Raja, et al.
Publicado: (2024)
Response-Aware User Memory Selection for LLM Personalization
por: Fisher, Jillian, et al.
Publicado: (2026)
por: Fisher, Jillian, et al.
Publicado: (2026)
Towards Formalizing Spuriousness of Biased Datasets Using Partial Information Decomposition
por: Halder, Barproda, et al.
Publicado: (2024)
por: Halder, Barproda, et al.
Publicado: (2024)
Mind Your Step (by Step): Chain-of-Thought can Reduce Performance on Tasks where Thinking Makes Humans Worse
por: Liu, Ryan, et al.
Publicado: (2024)
por: Liu, Ryan, et al.
Publicado: (2024)
Heterogeneous Directed Hypergraph Neural Network over abstract syntax tree (AST) for Code Classification
por: Yang, Guang, et al.
Publicado: (2023)
por: Yang, Guang, et al.
Publicado: (2023)
Field strength-dependent performance variability in deep learning-based analysis of magnetic resonance imaging
por: Qadir, Muhammad Ibtsaam, et al.
Publicado: (2025)
por: Qadir, Muhammad Ibtsaam, et al.
Publicado: (2025)
Memory-QA: Answering Recall Questions Based on Multimodal Memories
por: Jiang, Hongda, et al.
Publicado: (2025)
por: Jiang, Hongda, et al.
Publicado: (2025)
Using LLMs to Advance the Cognitive Science of Collectives
por: Sucholutsky, Ilia, et al.
Publicado: (2025)
por: Sucholutsky, Ilia, et al.
Publicado: (2025)
LifeBench: A Benchmark for Long-Horizon Multi-Source Memory
por: Cheng, Zihao, et al.
Publicado: (2026)
por: Cheng, Zihao, et al.
Publicado: (2026)
Medical Model Synthesis Architectures: A Case Study
por: Collins, Katherine M., et al.
Publicado: (2026)
por: Collins, Katherine M., et al.
Publicado: (2026)
On Benchmarking Human-Like Intelligence in Machines
por: Ying, Lance, et al.
Publicado: (2025)
por: Ying, Lance, et al.
Publicado: (2025)
Learning with Language-Guided State Abstractions
por: Peng, Andi, et al.
Publicado: (2024)
por: Peng, Andi, et al.
Publicado: (2024)
Why Human Guidance Matters in Collaborative Vibe Coding
por: Hu, Haoyu, et al.
Publicado: (2026)
por: Hu, Haoyu, et al.
Publicado: (2026)
Beyond Playtesting: A Generative Multi-Agent Simulation System for Massively Multiplayer Online Games
por: Zhang, Ran, et al.
Publicado: (2025)
por: Zhang, Ran, et al.
Publicado: (2025)
Multilevel Interpretability Of Artificial Neural Networks: Leveraging Framework And Methods From Neuroscience
por: He, Zhonghao, et al.
Publicado: (2024)
por: He, Zhonghao, et al.
Publicado: (2024)
MultiDx: A Multi-Source Knowledge Integration Framework towards Diagnostic Reasoning
por: Deng, Yimin, et al.
Publicado: (2026)
por: Deng, Yimin, et al.
Publicado: (2026)
Improving the Efficiency of Language Agent Teams with Adaptive Task Graphs
por: Mieczkowski, Elizabeth, et al.
Publicado: (2026)
por: Mieczkowski, Elizabeth, et al.
Publicado: (2026)
Expand Heterogeneous Learning Systems with Selective Multi-Source Knowledge Fusion
por: Dai, Gaole, et al.
Publicado: (2024)
por: Dai, Gaole, et al.
Publicado: (2024)
Social Bias Benchmark for Generation: A Comparison of Generation and QA-Based Evaluations
por: Jin, Jiho, et al.
Publicado: (2025)
por: Jin, Jiho, et al.
Publicado: (2025)
Preference-Conditioned Language-Guided Abstraction
por: Peng, Andi, et al.
Publicado: (2024)
por: Peng, Andi, et al.
Publicado: (2024)
M3SciQA: A Multi-Modal Multi-Document Scientific QA Benchmark for Evaluating Foundation Models
por: Li, Chuhan, et al.
Publicado: (2024)
por: Li, Chuhan, et al.
Publicado: (2024)
HiQA: A Hierarchical Contextual Augmentation RAG for Multi-Documents QA
por: Chen, Xinyue, et al.
Publicado: (2024)
por: Chen, Xinyue, et al.
Publicado: (2024)
DRAGged into Conflicts: Detecting and Addressing Conflicting Sources in Search-Augmented LLMs
por: Cattan, Arie, et al.
Publicado: (2025)
por: Cattan, Arie, et al.
Publicado: (2025)
ReAgent: Reversible Multi-Agent Reasoning for Knowledge-Enhanced Multi-Hop QA
por: Zhao, Xinjie, et al.
Publicado: (2025)
por: Zhao, Xinjie, et al.
Publicado: (2025)
Advancing MAPF Toward the Real World: A Scalable Multi-Agent Realistic Testbed (SMART)
por: Yan, Jingtian, et al.
Publicado: (2025)
por: Yan, Jingtian, et al.
Publicado: (2025)
Synthetic Data-Driven Prompt Tuning for Financial QA over Tables and Documents
por: Yu, Yaoning, et al.
Publicado: (2025)
por: Yu, Yaoning, et al.
Publicado: (2025)
Ejemplares similares
-
The Hammock Plot: Where Categorical and Numerical Data Relax Together
por: Schonlau, Matthias, et al.
Publicado: (2025) -
Learning Human-like Representations to Enable Learning Human Values
por: Wynn, Andrea, et al.
Publicado: (2023) -
Revisiting Rogers' Paradox in the Context of Human-AI Interaction
por: Collins, Katherine M., et al.
Publicado: (2025) -
What is a Number, That a Large Language Model May Know It?
por: Marjieh, Raja, et al.
Publicado: (2025) -
AIRHILT: A Human-in-the-Loop Testbed for Multimodal Conflict Detection in Aviation
por: Garib, Omar, et al.
Publicado: (2025)