Saved in:
| Main Authors: | Zhao, Wanjia, Yuksekgonul, Mert, Wu, Shirley, Zou, James |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.04780 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Inefficiencies of Meta Agents for Agent Design
by: El, Batu, et al.
Published: (2025)
by: El, Batu, et al.
Published: (2025)
Cost-of-Pass: An Economic Framework for Evaluating Language Models
by: Erol, Mehmet Hamza, et al.
Published: (2025)
by: Erol, Mehmet Hamza, et al.
Published: (2025)
Diversity of Thought Improves Reasoning Abilities of LLMs
by: Naik, Ranjita, et al.
Published: (2023)
by: Naik, Ranjita, et al.
Published: (2023)
ZEBRAARENA: A Diagnostic Simulation Environment for Studying Reasoning-Action Coupling in Tool-Augmented LLMs
by: Zhao, Wanjia, et al.
Published: (2026)
by: Zhao, Wanjia, et al.
Published: (2026)
TextGrad: Automatic "Differentiation" via Text
by: Yuksekgonul, Mert, et al.
Published: (2024)
by: Yuksekgonul, Mert, et al.
Published: (2024)
How Well Can LLMs Negotiate? NegotiationArena Platform and Analysis
by: Bianchi, Federico, et al.
Published: (2024)
by: Bianchi, Federico, et al.
Published: (2024)
On Information Self-Locking in Reinforcement Learning for Active Reasoning of LLM agents
by: Zou, Deyu, et al.
Published: (2026)
by: Zou, Deyu, et al.
Published: (2026)
GeoAda: Efficiently Finetune Geometric Diffusion Models with Equivariant Adapters
by: Zhao, Wanjia, et al.
Published: (2025)
by: Zhao, Wanjia, et al.
Published: (2025)
Can LLM feedback enhance review quality? A randomized study of 20K reviews at ICLR 2025
by: Thakkar, Nitya, et al.
Published: (2025)
by: Thakkar, Nitya, et al.
Published: (2025)
Sparse Reward Subsystem in Large Language Models
by: Xu, Guowei, et al.
Published: (2026)
by: Xu, Guowei, et al.
Published: (2026)
Learning to Discover at Test Time
by: Yuksekgonul, Mert, et al.
Published: (2026)
by: Yuksekgonul, Mert, et al.
Published: (2026)
Step-On-Feet Tuning: Scaling Self-Alignment of LLMs via Bootstrapping
by: Wang, Haoyu, et al.
Published: (2024)
by: Wang, Haoyu, et al.
Published: (2024)
CopT: Contrastive On-Policy Thinking with Continuous Spaces for General and Agentic Reasoning
by: Shi, Dachuan, et al.
Published: (2026)
by: Shi, Dachuan, et al.
Published: (2026)
Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of Language Models
by: Yuksekgonul, Mert, et al.
Published: (2023)
by: Yuksekgonul, Mert, et al.
Published: (2023)
The Self-Improvement Paradox: Can Language Models Bootstrap Reasoning Capabilities without External Scaffolding?
by: Sun, Yutao, et al.
Published: (2025)
by: Sun, Yutao, et al.
Published: (2025)
AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play
by: Xu, Ran, et al.
Published: (2025)
by: Xu, Ran, et al.
Published: (2025)
Multi-agent Self-triage System with Medical Flowcharts
by: Liu, Yujia, et al.
Published: (2025)
by: Liu, Yujia, et al.
Published: (2025)
Self-Supervised Bootstrapping of Action-Predictive Embodied Reasoning
by: Ganai, Milan, et al.
Published: (2026)
by: Ganai, Milan, et al.
Published: (2026)
Learn to Think: Bootstrapping LLM Reasoning Capability Through Graph Representation Learning
by: Gao, Hang, et al.
Published: (2025)
by: Gao, Hang, et al.
Published: (2025)
Attention Bootstrapping for Multi-Modal Test-Time Adaptation
by: Zhao, Yusheng, et al.
Published: (2025)
by: Zhao, Yusheng, et al.
Published: (2025)
LLMs are Superior Feedback Providers: Bootstrapping Reasoning for Lie Detection with Self-Generated Feedback
by: Banerjee, Tanushree, et al.
Published: (2024)
by: Banerjee, Tanushree, et al.
Published: (2024)
metaTextGrad: Automatically optimizing language model optimizers
by: Xu, Guowei, et al.
Published: (2025)
by: Xu, Guowei, et al.
Published: (2025)
Reasoning Curriculum: Bootstrapping Broad LLM Reasoning from Math
by: Pang, Bo, et al.
Published: (2025)
by: Pang, Bo, et al.
Published: (2025)
AgentPSO: Evolving Agent Reasoning Skill via Multi-agent Particle Swarm Optimization
by: Hwang, Hyunmin, et al.
Published: (2026)
by: Hwang, Hyunmin, et al.
Published: (2026)
MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs
by: Yuan, Huining, et al.
Published: (2025)
by: Yuan, Huining, et al.
Published: (2025)
Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk
by: Ulmer, Dennis, et al.
Published: (2024)
by: Ulmer, Dennis, et al.
Published: (2024)
QuantAgents: Towards Multi-agent Financial System via Simulated Trading
by: Li, Xiangyu, et al.
Published: (2025)
by: Li, Xiangyu, et al.
Published: (2025)
h1: Bootstrapping LLMs to Reason over Longer Horizons via Reinforcement Learning
by: Motwani, Sumeet Ramesh, et al.
Published: (2025)
by: Motwani, Sumeet Ramesh, et al.
Published: (2025)
Physics-Informed Regularization for Domain-Agnostic Dynamical System Modeling
by: Huang, Zijie, et al.
Published: (2024)
by: Huang, Zijie, et al.
Published: (2024)
Boosting LLM Reasoning via Spontaneous Self-Correction
by: Zhao, Xutong, et al.
Published: (2025)
by: Zhao, Xutong, et al.
Published: (2025)
BOOST: Bootstrapping Strategy-Driven Reasoning Programs for Program-Guided Fact-Checking
by: Hu, Qisheng, et al.
Published: (2025)
by: Hu, Qisheng, et al.
Published: (2025)
ReasonOps: Operator Segmentation for LLM Reasoning Traces
by: Lee, Daniel, et al.
Published: (2026)
by: Lee, Daniel, et al.
Published: (2026)
MIRROR: Multi-agent Intra- and Inter-Reflection for Optimized Reasoning in Tool Learning
by: Guo, Zikang, et al.
Published: (2025)
by: Guo, Zikang, et al.
Published: (2025)
Single-agent or Multi-agent Systems? Why Not Both?
by: Gao, Mingyan, et al.
Published: (2025)
by: Gao, Mingyan, et al.
Published: (2025)
PARTNR: A Benchmark for Planning and Reasoning in Embodied Multi-agent Tasks
by: Chang, Matthew, et al.
Published: (2024)
by: Chang, Matthew, et al.
Published: (2024)
MeTHanol: Modularized Thinking Language Models with Intermediate Layer Thinking, Decoding and Bootstrapping Reasoning
by: Xi, Ningyuan, et al.
Published: (2024)
by: Xi, Ningyuan, et al.
Published: (2024)
DIAGPaper: Diagnosing Valid and Specific Weaknesses in Scientific Papers via Multi-Agent Reasoning
by: Zou, Zhuoyang, et al.
Published: (2026)
by: Zou, Zhuoyang, et al.
Published: (2026)
Bootstrapping Imitation Learning for Long-horizon Manipulation via Hierarchical Data Collection Space
by: Yang, Jinrong, et al.
Published: (2025)
by: Yang, Jinrong, et al.
Published: (2025)
Enhancing the Efficiency and Accuracy of Underlying Asset Reviews in Structured Finance: The Application of Multi-agent Framework
by: Wan, Xiangpeng, et al.
Published: (2024)
by: Wan, Xiangpeng, et al.
Published: (2024)
Visual Attention Reasoning via Hierarchical Search and Self-Verification
by: Cai, Wei, et al.
Published: (2025)
by: Cai, Wei, et al.
Published: (2025)
Similar Items
-
Inefficiencies of Meta Agents for Agent Design
by: El, Batu, et al.
Published: (2025) -
Cost-of-Pass: An Economic Framework for Evaluating Language Models
by: Erol, Mehmet Hamza, et al.
Published: (2025) -
Diversity of Thought Improves Reasoning Abilities of LLMs
by: Naik, Ranjita, et al.
Published: (2023) -
ZEBRAARENA: A Diagnostic Simulation Environment for Studying Reasoning-Action Coupling in Tool-Augmented LLMs
by: Zhao, Wanjia, et al.
Published: (2026) -
TextGrad: Automatic "Differentiation" via Text
by: Yuksekgonul, Mert, et al.
Published: (2024)