:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhao, Wanjia, Yuksekgonul, Mert, Wu, Shirley, Zou, James
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2502.04780
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Inefficiencies of Meta Agents for Agent Design
by: El, Batu, et al.
Published: (2025)

Cost-of-Pass: An Economic Framework for Evaluating Language Models
by: Erol, Mehmet Hamza, et al.
Published: (2025)

Diversity of Thought Improves Reasoning Abilities of LLMs
by: Naik, Ranjita, et al.
Published: (2023)

ZEBRAARENA: A Diagnostic Simulation Environment for Studying Reasoning-Action Coupling in Tool-Augmented LLMs
by: Zhao, Wanjia, et al.
Published: (2026)

TextGrad: Automatic "Differentiation" via Text
by: Yuksekgonul, Mert, et al.
Published: (2024)

How Well Can LLMs Negotiate? NegotiationArena Platform and Analysis
by: Bianchi, Federico, et al.
Published: (2024)

On Information Self-Locking in Reinforcement Learning for Active Reasoning of LLM agents
by: Zou, Deyu, et al.
Published: (2026)

GeoAda: Efficiently Finetune Geometric Diffusion Models with Equivariant Adapters
by: Zhao, Wanjia, et al.
Published: (2025)

Can LLM feedback enhance review quality? A randomized study of 20K reviews at ICLR 2025
by: Thakkar, Nitya, et al.
Published: (2025)

Sparse Reward Subsystem in Large Language Models
by: Xu, Guowei, et al.
Published: (2026)

Learning to Discover at Test Time
by: Yuksekgonul, Mert, et al.
Published: (2026)

Step-On-Feet Tuning: Scaling Self-Alignment of LLMs via Bootstrapping
by: Wang, Haoyu, et al.
Published: (2024)

CopT: Contrastive On-Policy Thinking with Continuous Spaces for General and Agentic Reasoning
by: Shi, Dachuan, et al.
Published: (2026)

Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of Language Models
by: Yuksekgonul, Mert, et al.
Published: (2023)

The Self-Improvement Paradox: Can Language Models Bootstrap Reasoning Capabilities without External Scaffolding?
by: Sun, Yutao, et al.
Published: (2025)

AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play
by: Xu, Ran, et al.
Published: (2025)

Multi-agent Self-triage System with Medical Flowcharts
by: Liu, Yujia, et al.
Published: (2025)

Self-Supervised Bootstrapping of Action-Predictive Embodied Reasoning
by: Ganai, Milan, et al.
Published: (2026)

Learn to Think: Bootstrapping LLM Reasoning Capability Through Graph Representation Learning
by: Gao, Hang, et al.
Published: (2025)

Attention Bootstrapping for Multi-Modal Test-Time Adaptation
by: Zhao, Yusheng, et al.
Published: (2025)

LLMs are Superior Feedback Providers: Bootstrapping Reasoning for Lie Detection with Self-Generated Feedback
by: Banerjee, Tanushree, et al.
Published: (2024)

metaTextGrad: Automatically optimizing language model optimizers
by: Xu, Guowei, et al.
Published: (2025)

Reasoning Curriculum: Bootstrapping Broad LLM Reasoning from Math
by: Pang, Bo, et al.
Published: (2025)

AgentPSO: Evolving Agent Reasoning Skill via Multi-agent Particle Swarm Optimization
by: Hwang, Hyunmin, et al.
Published: (2026)

MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs
by: Yuan, Huining, et al.
Published: (2025)

Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk
by: Ulmer, Dennis, et al.
Published: (2024)

QuantAgents: Towards Multi-agent Financial System via Simulated Trading
by: Li, Xiangyu, et al.
Published: (2025)

h1: Bootstrapping LLMs to Reason over Longer Horizons via Reinforcement Learning
by: Motwani, Sumeet Ramesh, et al.
Published: (2025)

Physics-Informed Regularization for Domain-Agnostic Dynamical System Modeling
by: Huang, Zijie, et al.
Published: (2024)

Boosting LLM Reasoning via Spontaneous Self-Correction
by: Zhao, Xutong, et al.
Published: (2025)

BOOST: Bootstrapping Strategy-Driven Reasoning Programs for Program-Guided Fact-Checking
by: Hu, Qisheng, et al.
Published: (2025)

ReasonOps: Operator Segmentation for LLM Reasoning Traces
by: Lee, Daniel, et al.
Published: (2026)

MIRROR: Multi-agent Intra- and Inter-Reflection for Optimized Reasoning in Tool Learning
by: Guo, Zikang, et al.
Published: (2025)

Single-agent or Multi-agent Systems? Why Not Both?
by: Gao, Mingyan, et al.
Published: (2025)

PARTNR: A Benchmark for Planning and Reasoning in Embodied Multi-agent Tasks
by: Chang, Matthew, et al.
Published: (2024)

MeTHanol: Modularized Thinking Language Models with Intermediate Layer Thinking, Decoding and Bootstrapping Reasoning
by: Xi, Ningyuan, et al.
Published: (2024)

DIAGPaper: Diagnosing Valid and Specific Weaknesses in Scientific Papers via Multi-Agent Reasoning
by: Zou, Zhuoyang, et al.
Published: (2026)

Bootstrapping Imitation Learning for Long-horizon Manipulation via Hierarchical Data Collection Space
by: Yang, Jinrong, et al.
Published: (2025)

Enhancing the Efficiency and Accuracy of Underlying Asset Reviews in Structured Finance: The Application of Multi-agent Framework
by: Wan, Xiangpeng, et al.
Published: (2024)

Visual Attention Reasoning via Hierarchical Search and Self-Verification
by: Cai, Wei, et al.
Published: (2025)