:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Yang, Tiancheng, Schonlau, Matthias, Sucholutsky, Ilia
Formato:	Preprint
Publicado:	2026
Materias:	Artificial Intelligence
Acceso en línea:	https://arxiv.org/abs/2605.30087
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

The Hammock Plot: Where Categorical and Numerical Data Relax Together
por: Schonlau, Matthias, et al.
Publicado: (2025)

Learning Human-like Representations to Enable Learning Human Values
por: Wynn, Andrea, et al.
Publicado: (2023)

Revisiting Rogers' Paradox in the Context of Human-AI Interaction
por: Collins, Katherine M., et al.
Publicado: (2025)

What is a Number, That a Large Language Model May Know It?
por: Marjieh, Raja, et al.
Publicado: (2025)

AIRHILT: A Human-in-the-Loop Testbed for Multimodal Conflict Detection in Aviation
por: Garib, Omar, et al.
Publicado: (2025)

Do Large Language Models Mentalize When They Teach?
por: Harootonian, Sevan K., et al.
Publicado: (2026)

Analyzing the Roles of Language and Vision in Learning from Limited Data
por: Chen, Allison, et al.
Publicado: (2024)

Comprehensive Comparison of RAG Methods Across Multi-Domain Conversational QA
por: Alushi, Klejda, et al.
Publicado: (2026)

According to Me: Long-Term Personalized Referential Memory QA
por: Mei, Jingbiao, et al.
Publicado: (2026)

Concept Alignment
por: Rane, Sunayana, et al.
Publicado: (2024)

When Personalization Meets Reality: A Multi-Faceted Analysis of Personalized Preference Learning
por: Dong, Yijiang River, et al.
Publicado: (2025)

Identifying, Evaluating, and Mitigating Risks of AI Thought Partnerships
por: Oktar, Kerem, et al.
Publicado: (2025)

AirQA: A Comprehensive QA Dataset for AI Research with Instance-Level Evaluation
por: Huang, Tiancheng, et al.
Publicado: (2025)

Large Language Models Assume People are More Rational than We Really are
por: Liu, Ryan, et al.
Publicado: (2024)

A Rational Analysis of the Speech-to-Song Illusion
por: Marjieh, Raja, et al.
Publicado: (2024)

Response-Aware User Memory Selection for LLM Personalization
por: Fisher, Jillian, et al.
Publicado: (2026)

Towards Formalizing Spuriousness of Biased Datasets Using Partial Information Decomposition
por: Halder, Barproda, et al.
Publicado: (2024)

Mind Your Step (by Step): Chain-of-Thought can Reduce Performance on Tasks where Thinking Makes Humans Worse
por: Liu, Ryan, et al.
Publicado: (2024)

Heterogeneous Directed Hypergraph Neural Network over abstract syntax tree (AST) for Code Classification
por: Yang, Guang, et al.
Publicado: (2023)

Field strength-dependent performance variability in deep learning-based analysis of magnetic resonance imaging
por: Qadir, Muhammad Ibtsaam, et al.
Publicado: (2025)

Memory-QA: Answering Recall Questions Based on Multimodal Memories
por: Jiang, Hongda, et al.
Publicado: (2025)

Using LLMs to Advance the Cognitive Science of Collectives
por: Sucholutsky, Ilia, et al.
Publicado: (2025)

LifeBench: A Benchmark for Long-Horizon Multi-Source Memory
por: Cheng, Zihao, et al.
Publicado: (2026)

Medical Model Synthesis Architectures: A Case Study
por: Collins, Katherine M., et al.
Publicado: (2026)

On Benchmarking Human-Like Intelligence in Machines
por: Ying, Lance, et al.
Publicado: (2025)

Learning with Language-Guided State Abstractions
por: Peng, Andi, et al.
Publicado: (2024)

Why Human Guidance Matters in Collaborative Vibe Coding
por: Hu, Haoyu, et al.
Publicado: (2026)

Beyond Playtesting: A Generative Multi-Agent Simulation System for Massively Multiplayer Online Games
por: Zhang, Ran, et al.
Publicado: (2025)

Multilevel Interpretability Of Artificial Neural Networks: Leveraging Framework And Methods From Neuroscience
por: He, Zhonghao, et al.
Publicado: (2024)

MultiDx: A Multi-Source Knowledge Integration Framework towards Diagnostic Reasoning
por: Deng, Yimin, et al.
Publicado: (2026)

Improving the Efficiency of Language Agent Teams with Adaptive Task Graphs
por: Mieczkowski, Elizabeth, et al.
Publicado: (2026)

Expand Heterogeneous Learning Systems with Selective Multi-Source Knowledge Fusion
por: Dai, Gaole, et al.
Publicado: (2024)

Social Bias Benchmark for Generation: A Comparison of Generation and QA-Based Evaluations
por: Jin, Jiho, et al.
Publicado: (2025)

Preference-Conditioned Language-Guided Abstraction
por: Peng, Andi, et al.
Publicado: (2024)

M3SciQA: A Multi-Modal Multi-Document Scientific QA Benchmark for Evaluating Foundation Models
por: Li, Chuhan, et al.
Publicado: (2024)

HiQA: A Hierarchical Contextual Augmentation RAG for Multi-Documents QA
por: Chen, Xinyue, et al.
Publicado: (2024)

DRAGged into Conflicts: Detecting and Addressing Conflicting Sources in Search-Augmented LLMs
por: Cattan, Arie, et al.
Publicado: (2025)

ReAgent: Reversible Multi-Agent Reasoning for Knowledge-Enhanced Multi-Hop QA
por: Zhao, Xinjie, et al.
Publicado: (2025)

Advancing MAPF Toward the Real World: A Scalable Multi-Agent Realistic Testbed (SMART)
por: Yan, Jingtian, et al.
Publicado: (2025)

Synthetic Data-Driven Prompt Tuning for Financial QA over Tables and Documents
por: Yu, Yaoning, et al.
Publicado: (2025)