:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Chung, Jui-Hui, Lin, Hongzhou, Jiang, Lai, Tang, Shange, Jin, Chi
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2604.08388
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving
by: Lin, Yong, et al.
Published: (2025)

Goedel-Prover-V2: Scaling Formal Theorem Proving with Scaffolded Data Synthesis and Self-Correction
by: Lin, Yong, et al.
Published: (2025)

Goedel-Code-Prover: Hierarchical Proof Search for Open State-of-the-Art Code Verification
by: Li, Zenan, et al.
Published: (2026)

Ineq-Comp: Benchmarking Human-Intuitive Compositional Reasoning in Automated Theorem Proving on Inequalities
by: Zhao, Haoyu, et al.
Published: (2025)

Is Elo Rating Reliable? A Study Under Model Misspecification
by: Tang, Shange, et al.
Published: (2025)

REAL-Prover: Retrieval Augmented Lean Prover for Mathematical Reasoning
by: Shen, Ziju, et al.
Published: (2025)

SorryDB: Can AI Provers Complete Real-World Lean Theorems?
by: Letson, Austin, et al.
Published: (2026)

Principled Out-of-Distribution Generalization via Simplicity
by: Ge, Jiawei, et al.
Published: (2025)

Benign Overfitting in Out-of-Distribution Generalization of Linear Models
by: Tang, Shange, et al.
Published: (2024)

Inference-Time Diversity in RL-Trained Lean Theorem Provers: A Diagnostic Study
by: Burton, Zachary
Published: (2026)

LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning
by: Wang, Jianing, et al.
Published: (2026)

DreamProver: Evolving Transferable Lemma Libraries via a Wake-Sleep Theorem-Proving Agent
by: Zhang, Youyuan, et al.
Published: (2026)

Numina-Lean-Agent: An Open and General Agentic Reasoning System for Formal Mathematics
by: Liu, Junqi, et al.
Published: (2026)

Theorem Prover as a Judge for Synthetic Data Generation
by: Leang, Joshua Ong Jun, et al.
Published: (2025)

SoK: Agentic Skills -- Beyond Tool Use in LLM Agents
by: Jiang, Yanna, et al.
Published: (2026)

TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool Use
by: He, Pengfei, et al.
Published: (2025)

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use
by: Jiang, Dongfu, et al.
Published: (2025)

6GAgentGym: Tool Use, Data Synthesis, and Agentic Learning for Network Management
by: Chen, Jiao, et al.
Published: (2026)

ToolRM: Towards Agentic Tool-Use Reward Modeling
by: Li, Renhao, et al.
Published: (2025)

Controllable and Verifiable Tool-Use Data Synthesis for Agentic Reinforcement Learning
by: Xu, Siyuan, et al.
Published: (2026)

Are Tools All We Need? Unveiling the Tool-Use Tax in LLM Agents
by: Zhang, Kaituo, et al.
Published: (2026)

MCPVerse: An Expansive, Real-World Benchmark for Agentic Tool Use
by: Lei, Fei, et al.
Published: (2025)

Prover Agent: An Agent-Based Framework for Formal Mathematical Proofs
by: Baba, Kaito, et al.
Published: (2025)

How Far Are LLMs from Professional Poker Players? Revisiting Game-Theoretic Reasoning with Agentic Tool Use
by: Lin, Minhua, et al.
Published: (2026)

DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use
by: Chen, Aili, et al.
Published: (2026)

LiveFMBench: Unveiling the Power and Limits of Agentic Workflows in Specification Generation
by: Xu, Dong, et al.
Published: (2026)

A Comprehensive Survey of the Lean 4 Theorem Prover: Architecture, Applications, and Advances
by: Tang, Xichen
Published: (2025)

ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use
by: Deng, Mengjie, et al.
Published: (2025)

On the Goedel's formula
by: Ferreira, Jailton C.
Published: (2001)

Ax-Prover: A Deep Reasoning Agentic Framework for Theorem Proving in Mathematics and Quantum Physics
by: Breen, Benjamin, et al.
Published: (2025)

Lean-auto: An Interface between Lean 4 and Automated Theorem Provers
by: Qian, Yicheng, et al.
Published: (2025)

UniToolCall: Unifying Tool-Use Representation, Data, and Evaluation for LLM Agents
by: Liang, Yijuan, et al.
Published: (2026)

On the Day They Experience: Awakening Self-Sovereign Experiential AI Agents
by: Hu, Botao Amber, et al.
Published: (2025)

Understanding Tool-Augmented Agents for Lean Formalization: A Factorial Analysis
by: Zhang, Ke, et al.
Published: (2026)

Teaching "Foundations of Mathematics" with the Lean Theorem Prover
by: Bottoni, Mattia Luciano, et al.
Published: (2025)

VitalAgent: A Tool-Augmented Agent for Reactive and Proactive Physiological Monitoring over Wearable Health Data
by: Zhu, Di, et al.
Published: (2026)

Clarifying Before Reasoning: A Coq Prover with Structural Context
by: Lu, Yanzhen, et al.
Published: (2025)

Budget-Aware Tool-Use Enables Effective Agent Scaling
by: Liu, Tengxiao, et al.
Published: (2025)

Act Wisely: Cultivating Meta-Cognitive Tool Use in Agentic Multimodal Models
by: Yan, Shilin, et al.
Published: (2026)

ATP-Bench: Towards Agentic Tool Planning for MLLM Interleaved Generation
by: Liu, Yinuo, et al.
Published: (2026)