Saved in:
| Main Authors: | Vaduguru, Saujas, Hua, Yilun, Artzi, Yoav, Fried, Daniel |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.24023 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Post-training for Efficient Communication via Convention Formation
by: Hua, Yilun, et al.
Published: (2025)
by: Hua, Yilun, et al.
Published: (2025)
Analyzing Information Sharing and Coordination in Multi-Agent Planning
by: Ou, Tianyue, et al.
Published: (2025)
by: Ou, Tianyue, et al.
Published: (2025)
Is the Pope Catholic? Yes, the Pope is Catholic. Generative Evaluation of Non-Literal Intent Resolution in LLMs
by: Yerukola, Akhila, et al.
Published: (2024)
by: Yerukola, Akhila, et al.
Published: (2024)
Generating Pragmatic Examples to Train Neural Program Synthesizers
by: Vaduguru, Saujas, et al.
Published: (2023)
by: Vaduguru, Saujas, et al.
Published: (2023)
Talk Less, Interact Better: Evaluating In-context Conversational Adaptation in Multimodal LLMs
by: Hua, Yilun, et al.
Published: (2024)
by: Hua, Yilun, et al.
Published: (2024)
Identifying & Interactively Refining Ambiguous User Goals for Data Visualization Code Generation
by: İnan, Mert, et al.
Published: (2025)
by: İnan, Mert, et al.
Published: (2025)
Amortizing Pragmatic Program Synthesis with Rankings
by: Pu, Yewen, et al.
Published: (2024)
by: Pu, Yewen, et al.
Published: (2024)
Amortizing Pragmatic Program Synthesis with Rankings
by: Pu, Yewen, et al.
Published: (2023)
by: Pu, Yewen, et al.
Published: (2023)
mrCAD: Multimodal Refinement of Computer-aided Designs
by: McCarthy, William P., et al.
Published: (2025)
by: McCarthy, William P., et al.
Published: (2025)
Lost in Backpropagation: The LM Head is a Gradient Bottleneck
by: Godey, Nathan, et al.
Published: (2026)
by: Godey, Nathan, et al.
Published: (2026)
No Mean Feat: Simple, Strong Baselines for Context Compression
by: Feldman, Yair, et al.
Published: (2025)
by: Feldman, Yair, et al.
Published: (2025)
CoGen: Learning from Feedback with Coupled Comprehension and Generation
by: Gul, Mustafa Omer, et al.
Published: (2024)
by: Gul, Mustafa Omer, et al.
Published: (2024)
Breadcrumbs Reasoning: Memory-Efficient Reasoning with Compression Beacons
by: Monea, Giovanni, et al.
Published: (2025)
by: Monea, Giovanni, et al.
Published: (2025)
A Surprising Failure? Multimodal LLMs and the NLVR Challenge
by: Wu, Anne, et al.
Published: (2024)
by: Wu, Anne, et al.
Published: (2024)
A Joint Study of Phrase Grounding and Task Performance in Vision and Language Models
by: Kojima, Noriyuki, et al.
Published: (2023)
by: Kojima, Noriyuki, et al.
Published: (2023)
LLMs Are In-Context Bandit Reinforcement Learners
by: Monea, Giovanni, et al.
Published: (2024)
by: Monea, Giovanni, et al.
Published: (2024)
Ad hoc conventions generalize to new referents
by: Ji, Anya, et al.
Published: (2025)
by: Ji, Anya, et al.
Published: (2025)
IncDSI: Incrementally Updatable Document Retrieval
by: Kishore, Varsha, et al.
Published: (2023)
by: Kishore, Varsha, et al.
Published: (2023)
Knot So Simple: A Minimalistic Environment for Spatial Reasoning
by: Chen, Zizhao, et al.
Published: (2025)
by: Chen, Zizhao, et al.
Published: (2025)
Retrospective Learning from Interactions
by: Chen, Zizhao, et al.
Published: (2024)
by: Chen, Zizhao, et al.
Published: (2024)
Cancer Type, Stage and Prognosis Assessment from Pathology Reports using LLMs
by: Saluja, Rachit, et al.
Published: (2025)
by: Saluja, Rachit, et al.
Published: (2025)
Evaluating Large Language Model Biases in Persona-Steered Generation
by: Liu, Andy, et al.
Published: (2024)
by: Liu, Andy, et al.
Published: (2024)
Psychologically Potent, Computationally Invisible: LLMs Generate Social-Comparison-Eliciting Posts They Fail to Detect
by: Zhao, Hua, et al.
Published: (2026)
by: Zhao, Hua, et al.
Published: (2026)
Hint-Augmented Re-ranking: Efficient Product Search using LLM-Based Query Decomposition
by: Zhu, Yilun, et al.
Published: (2025)
by: Zhu, Yilun, et al.
Published: (2025)
Inducing Programmatic Skills for Agentic Tasks
by: Wang, Zora Zhiruo, et al.
Published: (2025)
by: Wang, Zora Zhiruo, et al.
Published: (2025)
Agent Workflow Memory
by: Wang, Zora Zhiruo, et al.
Published: (2024)
by: Wang, Zora Zhiruo, et al.
Published: (2024)
Measuring Fine-Grained Negotiation Tactics of Humans and LLMs in Diplomacy
by: Li, Wenkai, et al.
Published: (2025)
by: Li, Wenkai, et al.
Published: (2025)
Multi-Agent Computer Use
by: Koh, Jing Yu, et al.
Published: (2026)
by: Koh, Jing Yu, et al.
Published: (2026)
Evaluating the Utility of Grounding Documents with Reference-Free LLM-based Metrics
by: Hua, Yilun, et al.
Published: (2026)
by: Hua, Yilun, et al.
Published: (2026)
Synergy-of-Thoughts: Eliciting Efficient Reasoning in Hybrid Language Models
by: Shang, Yu, et al.
Published: (2024)
by: Shang, Yu, et al.
Published: (2024)
Bayesian Preference Elicitation with Language Models
by: Handa, Kunal, et al.
Published: (2024)
by: Handa, Kunal, et al.
Published: (2024)
Activation Control for Efficiently Eliciting Long Chain-of-thought Ability of Language Models
by: Zhao, Zekai, et al.
Published: (2025)
by: Zhao, Zekai, et al.
Published: (2025)
ECCO: Can We Improve Model-Generated Code Efficiency Without Sacrificing Functional Correctness?
by: Waghjale, Siddhant, et al.
Published: (2024)
by: Waghjale, Siddhant, et al.
Published: (2024)
Pre-training Limited Memory Language Models with Internal and External Knowledge
by: Zhao, Linxi, et al.
Published: (2025)
by: Zhao, Linxi, et al.
Published: (2025)
AutoElicit: Using Large Language Models for Expert Prior Elicitation in Predictive Modelling
by: Capstick, Alexander, et al.
Published: (2024)
by: Capstick, Alexander, et al.
Published: (2024)
Data Augmentation for Code Translation with Comparable Corpora and Multiple References
by: Xie, Yiqing, et al.
Published: (2023)
by: Xie, Yiqing, et al.
Published: (2023)
CRScore: Grounding Automated Evaluation of Code Review Comments in Code Claims and Smells
by: Naik, Atharva, et al.
Published: (2024)
by: Naik, Atharva, et al.
Published: (2024)
Z1: Efficient Test-time Scaling with Code
by: Yu, Zhaojian, et al.
Published: (2025)
by: Yu, Zhaojian, et al.
Published: (2025)
Unsupervised Elicitation of Language Models
by: Wen, Jiaxin, et al.
Published: (2025)
by: Wen, Jiaxin, et al.
Published: (2025)
Paraphrase Types Elicit Prompt Engineering Capabilities
by: Wahle, Jan Philip, et al.
Published: (2024)
by: Wahle, Jan Philip, et al.
Published: (2024)
Similar Items
-
Post-training for Efficient Communication via Convention Formation
by: Hua, Yilun, et al.
Published: (2025) -
Analyzing Information Sharing and Coordination in Multi-Agent Planning
by: Ou, Tianyue, et al.
Published: (2025) -
Is the Pope Catholic? Yes, the Pope is Catholic. Generative Evaluation of Non-Literal Intent Resolution in LLMs
by: Yerukola, Akhila, et al.
Published: (2024) -
Generating Pragmatic Examples to Train Neural Program Synthesizers
by: Vaduguru, Saujas, et al.
Published: (2023) -
Talk Less, Interact Better: Evaluating In-context Conversational Adaptation in Multimodal LLMs
by: Hua, Yilun, et al.
Published: (2024)