:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Vaduguru, Saujas, Hua, Yilun, Artzi, Yoav, Fried, Daniel
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2510.24023
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Post-training for Efficient Communication via Convention Formation
by: Hua, Yilun, et al.
Published: (2025)

Analyzing Information Sharing and Coordination in Multi-Agent Planning
by: Ou, Tianyue, et al.
Published: (2025)

Is the Pope Catholic? Yes, the Pope is Catholic. Generative Evaluation of Non-Literal Intent Resolution in LLMs
by: Yerukola, Akhila, et al.
Published: (2024)

Generating Pragmatic Examples to Train Neural Program Synthesizers
by: Vaduguru, Saujas, et al.
Published: (2023)

Talk Less, Interact Better: Evaluating In-context Conversational Adaptation in Multimodal LLMs
by: Hua, Yilun, et al.
Published: (2024)

Identifying & Interactively Refining Ambiguous User Goals for Data Visualization Code Generation
by: İnan, Mert, et al.
Published: (2025)

Amortizing Pragmatic Program Synthesis with Rankings
by: Pu, Yewen, et al.
Published: (2024)

Amortizing Pragmatic Program Synthesis with Rankings
by: Pu, Yewen, et al.
Published: (2023)

mrCAD: Multimodal Refinement of Computer-aided Designs
by: McCarthy, William P., et al.
Published: (2025)

Lost in Backpropagation: The LM Head is a Gradient Bottleneck
by: Godey, Nathan, et al.
Published: (2026)

No Mean Feat: Simple, Strong Baselines for Context Compression
by: Feldman, Yair, et al.
Published: (2025)

CoGen: Learning from Feedback with Coupled Comprehension and Generation
by: Gul, Mustafa Omer, et al.
Published: (2024)

Breadcrumbs Reasoning: Memory-Efficient Reasoning with Compression Beacons
by: Monea, Giovanni, et al.
Published: (2025)

A Surprising Failure? Multimodal LLMs and the NLVR Challenge
by: Wu, Anne, et al.
Published: (2024)

A Joint Study of Phrase Grounding and Task Performance in Vision and Language Models
by: Kojima, Noriyuki, et al.
Published: (2023)

LLMs Are In-Context Bandit Reinforcement Learners
by: Monea, Giovanni, et al.
Published: (2024)

Ad hoc conventions generalize to new referents
by: Ji, Anya, et al.
Published: (2025)

IncDSI: Incrementally Updatable Document Retrieval
by: Kishore, Varsha, et al.
Published: (2023)

Knot So Simple: A Minimalistic Environment for Spatial Reasoning
by: Chen, Zizhao, et al.
Published: (2025)

Retrospective Learning from Interactions
by: Chen, Zizhao, et al.
Published: (2024)

Cancer Type, Stage and Prognosis Assessment from Pathology Reports using LLMs
by: Saluja, Rachit, et al.
Published: (2025)

Evaluating Large Language Model Biases in Persona-Steered Generation
by: Liu, Andy, et al.
Published: (2024)

Psychologically Potent, Computationally Invisible: LLMs Generate Social-Comparison-Eliciting Posts They Fail to Detect
by: Zhao, Hua, et al.
Published: (2026)

Hint-Augmented Re-ranking: Efficient Product Search using LLM-Based Query Decomposition
by: Zhu, Yilun, et al.
Published: (2025)

Inducing Programmatic Skills for Agentic Tasks
by: Wang, Zora Zhiruo, et al.
Published: (2025)

Agent Workflow Memory
by: Wang, Zora Zhiruo, et al.
Published: (2024)

Measuring Fine-Grained Negotiation Tactics of Humans and LLMs in Diplomacy
by: Li, Wenkai, et al.
Published: (2025)

Multi-Agent Computer Use
by: Koh, Jing Yu, et al.
Published: (2026)

Evaluating the Utility of Grounding Documents with Reference-Free LLM-based Metrics
by: Hua, Yilun, et al.
Published: (2026)

Synergy-of-Thoughts: Eliciting Efficient Reasoning in Hybrid Language Models
by: Shang, Yu, et al.
Published: (2024)

Bayesian Preference Elicitation with Language Models
by: Handa, Kunal, et al.
Published: (2024)

Activation Control for Efficiently Eliciting Long Chain-of-thought Ability of Language Models
by: Zhao, Zekai, et al.
Published: (2025)

ECCO: Can We Improve Model-Generated Code Efficiency Without Sacrificing Functional Correctness?
by: Waghjale, Siddhant, et al.
Published: (2024)

Pre-training Limited Memory Language Models with Internal and External Knowledge
by: Zhao, Linxi, et al.
Published: (2025)

AutoElicit: Using Large Language Models for Expert Prior Elicitation in Predictive Modelling
by: Capstick, Alexander, et al.
Published: (2024)

Data Augmentation for Code Translation with Comparable Corpora and Multiple References
by: Xie, Yiqing, et al.
Published: (2023)

CRScore: Grounding Automated Evaluation of Code Review Comments in Code Claims and Smells
by: Naik, Atharva, et al.
Published: (2024)

Z1: Efficient Test-time Scaling with Code
by: Yu, Zhaojian, et al.
Published: (2025)

Unsupervised Elicitation of Language Models
by: Wen, Jiaxin, et al.
Published: (2025)

Paraphrase Types Elicit Prompt Engineering Capabilities
by: Wahle, Jan Philip, et al.
Published: (2024)