Saved in:
| Main Authors: | Sivakumaran, Nithin, Yu, Shoubin, Lee, Hyunji, Zhang, Yue, Payani, Ali, Bansal, Mohit, Stengel-Eskin, Elias |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.16154 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DART: Leveraging Multi-Agent Disagreement for Tool Recruitment in Multimodal Reasoning
by: Sivakumaran, Nithin, et al.
Published: (2025)
by: Sivakumaran, Nithin, et al.
Published: (2025)
LACIE: Listener-Aware Finetuning for Confidence Calibration in Large Language Models
by: Stengel-Eskin, Elias, et al.
Published: (2024)
by: Stengel-Eskin, Elias, et al.
Published: (2024)
Teaching Models to Balance Resisting and Accepting Persuasion
by: Stengel-Eskin, Elias, et al.
Published: (2024)
by: Stengel-Eskin, Elias, et al.
Published: (2024)
UPCORE: Utility-Preserving Coreset Selection for Balanced Unlearning
by: Patil, Vaidehi, et al.
Published: (2025)
by: Patil, Vaidehi, et al.
Published: (2025)
Generalized Correctness Models: Learning Calibrated and Model-Agnostic Correctness Predictors from Historical Patterns
by: Xiao, Hanqi, et al.
Published: (2025)
by: Xiao, Hanqi, et al.
Published: (2025)
Multimodal Fact-Level Attribution for Verifiable Reasoning
by: Wan, David, et al.
Published: (2026)
by: Wan, David, et al.
Published: (2026)
MAMM-Refine: A Recipe for Improving Faithfulness in Generation with Multi-Agent Collaboration
by: Wan, David, et al.
Published: (2025)
by: Wan, David, et al.
Published: (2025)
Playing Along: Learning a Double-Agent Defender for Belief Steering via Theory of Mind
by: Xiao, Hanqi, et al.
Published: (2026)
by: Xiao, Hanqi, et al.
Published: (2026)
The Sum Leaks More Than Its Parts: Compositional Privacy Risks and Mitigations in Multi-Agent Collaboration
by: Patil, Vaidehi, et al.
Published: (2025)
by: Patil, Vaidehi, et al.
Published: (2025)
Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language Models
by: Prasad, Archiki, et al.
Published: (2023)
by: Prasad, Archiki, et al.
Published: (2023)
Soft Self-Consistency Improves Language Model Agents
by: Wang, Han, et al.
Published: (2024)
by: Wang, Han, et al.
Published: (2024)
Language Models Identify Ambiguities and Exploit Loopholes
by: Choi, Jio, et al.
Published: (2025)
by: Choi, Jio, et al.
Published: (2025)
VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos
by: Wang, Ziyang, et al.
Published: (2024)
by: Wang, Ziyang, et al.
Published: (2024)
Multi-Attribute Steering of Language Models via Targeted Intervention
by: Nguyen, Duy, et al.
Published: (2025)
by: Nguyen, Duy, et al.
Published: (2025)
ReGAL: Refactoring Programs to Discover Generalizable Abstractions
by: Stengel-Eskin, Elias, et al.
Published: (2024)
by: Stengel-Eskin, Elias, et al.
Published: (2024)
GenerationPrograms: Fine-grained Attribution with Executable Programs
by: Wan, David, et al.
Published: (2025)
by: Wan, David, et al.
Published: (2025)
CAPTURe: Evaluating Spatial Reasoning in Vision Language Models via Occluded Object Counting
by: Pothiraj, Atin, et al.
Published: (2025)
by: Pothiraj, Atin, et al.
Published: (2025)
Executable Functional Abstractions: Inferring Generative Programs for Advanced Math Problems
by: Khan, Zaid, et al.
Published: (2025)
by: Khan, Zaid, et al.
Published: (2025)
Are language models rational? The case of coherence norms and belief revision
by: Hofweber, Thomas, et al.
Published: (2024)
by: Hofweber, Thomas, et al.
Published: (2024)
Retrieval-Augmented Generation with Conflicting Evidence
by: Wang, Han, et al.
Published: (2025)
by: Wang, Han, et al.
Published: (2025)
DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback
by: Khan, Zaid, et al.
Published: (2024)
by: Khan, Zaid, et al.
Published: (2024)
Skill-Based Mixture-of-Experts: Adaptive Routing for Heterogeneous Reasoning via Inferred Skills
by: Chen, Justin Chih-Yao, et al.
Published: (2025)
by: Chen, Justin Chih-Yao, et al.
Published: (2025)
AVSD: Adaptive-View Self-Distillation by Balancing Consensus and Teacher-Specific Privileged Signals
by: Nguyen, Duy, et al.
Published: (2026)
by: Nguyen, Duy, et al.
Published: (2026)
RotBench: Evaluating Multimodal Large Language Models on Identifying Image Rotation
by: Niu, Tianyi, et al.
Published: (2025)
by: Niu, Tianyi, et al.
Published: (2025)
GrAInS: Gradient-based Attribution for Inference-Time Steering of LLMs and VLMs
by: Nguyen, Duy, et al.
Published: (2025)
by: Nguyen, Duy, et al.
Published: (2025)
Agent-BRACE: Decoupling Beliefs from Actions in Long-Horizon Tasks via Verbalized State Uncertainty
by: Singh, Joykirat, et al.
Published: (2026)
by: Singh, Joykirat, et al.
Published: (2026)
MERRIN: A Benchmark for Multimodal Evidence Retrieval and Reasoning in Noisy Web Environments
by: Wang, Han, et al.
Published: (2026)
by: Wang, Han, et al.
Published: (2026)
AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge
by: Wang, Han, et al.
Published: (2024)
by: Wang, Han, et al.
Published: (2024)
Gistify! Codebase-Level Understanding via Runtime Execution
by: Lee, Hyunji, et al.
Published: (2025)
by: Lee, Hyunji, et al.
Published: (2025)
Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training
by: Wan, David, et al.
Published: (2024)
by: Wan, David, et al.
Published: (2024)
Task-Circuit Quantization: Leveraging Knowledge Localization and Interpretability for Compression
by: Xiao, Hanqi, et al.
Published: (2025)
by: Xiao, Hanqi, et al.
Published: (2025)
Conflict-Resolving and Sharpness-Aware Minimization for Generalized Knowledge Editing with Multiple Updates
by: Nguyen, Duy, et al.
Published: (2026)
by: Nguyen, Duy, et al.
Published: (2026)
Fundamental Problems With Model Editing: How Should Rational Belief Revision Work in LLMs?
by: Hase, Peter, et al.
Published: (2024)
by: Hase, Peter, et al.
Published: (2024)
CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion
by: Yu, Shoubin, et al.
Published: (2024)
by: Yu, Shoubin, et al.
Published: (2024)
See It from My Perspective: How Language Affects Cultural Bias in Image Understanding
by: Ananthram, Amith, et al.
Published: (2024)
by: Ananthram, Amith, et al.
Published: (2024)
One Life to Learn: Inferring Symbolic World Models for Stochastic Environments from Unguided Exploration
by: Khan, Zaid, et al.
Published: (2025)
by: Khan, Zaid, et al.
Published: (2025)
System-1.x: Learning to Balance Fast and Slow Planning with Language Models
by: Saha, Swarnadeep, et al.
Published: (2024)
by: Saha, Swarnadeep, et al.
Published: (2024)
MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models
by: Chen, Justin Chih-Yao, et al.
Published: (2024)
by: Chen, Justin Chih-Yao, et al.
Published: (2024)
LASeR: Learning to Adaptively Select Reward Models with Multi-Armed Bandits
by: Nguyen, Duy, et al.
Published: (2024)
by: Nguyen, Duy, et al.
Published: (2024)
MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation
by: Yu, Shoubin, et al.
Published: (2025)
by: Yu, Shoubin, et al.
Published: (2025)
Similar Items
-
DART: Leveraging Multi-Agent Disagreement for Tool Recruitment in Multimodal Reasoning
by: Sivakumaran, Nithin, et al.
Published: (2025) -
LACIE: Listener-Aware Finetuning for Confidence Calibration in Large Language Models
by: Stengel-Eskin, Elias, et al.
Published: (2024) -
Teaching Models to Balance Resisting and Accepting Persuasion
by: Stengel-Eskin, Elias, et al.
Published: (2024) -
UPCORE: Utility-Preserving Coreset Selection for Balanced Unlearning
by: Patil, Vaidehi, et al.
Published: (2025) -
Generalized Correctness Models: Learning Calibrated and Model-Agnostic Correctness Predictors from Historical Patterns
by: Xiao, Hanqi, et al.
Published: (2025)