Saved in:
| Main Authors: | Chung, Jui-Hui, Lin, Hongzhou, Jiang, Lai, Tang, Shange, Jin, Chi |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.08388 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving
by: Lin, Yong, et al.
Published: (2025)
by: Lin, Yong, et al.
Published: (2025)
Goedel-Prover-V2: Scaling Formal Theorem Proving with Scaffolded Data Synthesis and Self-Correction
by: Lin, Yong, et al.
Published: (2025)
by: Lin, Yong, et al.
Published: (2025)
Goedel-Code-Prover: Hierarchical Proof Search for Open State-of-the-Art Code Verification
by: Li, Zenan, et al.
Published: (2026)
by: Li, Zenan, et al.
Published: (2026)
Ineq-Comp: Benchmarking Human-Intuitive Compositional Reasoning in Automated Theorem Proving on Inequalities
by: Zhao, Haoyu, et al.
Published: (2025)
by: Zhao, Haoyu, et al.
Published: (2025)
Is Elo Rating Reliable? A Study Under Model Misspecification
by: Tang, Shange, et al.
Published: (2025)
by: Tang, Shange, et al.
Published: (2025)
REAL-Prover: Retrieval Augmented Lean Prover for Mathematical Reasoning
by: Shen, Ziju, et al.
Published: (2025)
by: Shen, Ziju, et al.
Published: (2025)
SorryDB: Can AI Provers Complete Real-World Lean Theorems?
by: Letson, Austin, et al.
Published: (2026)
by: Letson, Austin, et al.
Published: (2026)
Principled Out-of-Distribution Generalization via Simplicity
by: Ge, Jiawei, et al.
Published: (2025)
by: Ge, Jiawei, et al.
Published: (2025)
Benign Overfitting in Out-of-Distribution Generalization of Linear Models
by: Tang, Shange, et al.
Published: (2024)
by: Tang, Shange, et al.
Published: (2024)
Inference-Time Diversity in RL-Trained Lean Theorem Provers: A Diagnostic Study
by: Burton, Zachary
Published: (2026)
by: Burton, Zachary
Published: (2026)
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning
by: Wang, Jianing, et al.
Published: (2026)
by: Wang, Jianing, et al.
Published: (2026)
DreamProver: Evolving Transferable Lemma Libraries via a Wake-Sleep Theorem-Proving Agent
by: Zhang, Youyuan, et al.
Published: (2026)
by: Zhang, Youyuan, et al.
Published: (2026)
Numina-Lean-Agent: An Open and General Agentic Reasoning System for Formal Mathematics
by: Liu, Junqi, et al.
Published: (2026)
by: Liu, Junqi, et al.
Published: (2026)
Theorem Prover as a Judge for Synthetic Data Generation
by: Leang, Joshua Ong Jun, et al.
Published: (2025)
by: Leang, Joshua Ong Jun, et al.
Published: (2025)
SoK: Agentic Skills -- Beyond Tool Use in LLM Agents
by: Jiang, Yanna, et al.
Published: (2026)
by: Jiang, Yanna, et al.
Published: (2026)
TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool Use
by: He, Pengfei, et al.
Published: (2025)
by: He, Pengfei, et al.
Published: (2025)
VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use
by: Jiang, Dongfu, et al.
Published: (2025)
by: Jiang, Dongfu, et al.
Published: (2025)
6GAgentGym: Tool Use, Data Synthesis, and Agentic Learning for Network Management
by: Chen, Jiao, et al.
Published: (2026)
by: Chen, Jiao, et al.
Published: (2026)
ToolRM: Towards Agentic Tool-Use Reward Modeling
by: Li, Renhao, et al.
Published: (2025)
by: Li, Renhao, et al.
Published: (2025)
Controllable and Verifiable Tool-Use Data Synthesis for Agentic Reinforcement Learning
by: Xu, Siyuan, et al.
Published: (2026)
by: Xu, Siyuan, et al.
Published: (2026)
Are Tools All We Need? Unveiling the Tool-Use Tax in LLM Agents
by: Zhang, Kaituo, et al.
Published: (2026)
by: Zhang, Kaituo, et al.
Published: (2026)
MCPVerse: An Expansive, Real-World Benchmark for Agentic Tool Use
by: Lei, Fei, et al.
Published: (2025)
by: Lei, Fei, et al.
Published: (2025)
Prover Agent: An Agent-Based Framework for Formal Mathematical Proofs
by: Baba, Kaito, et al.
Published: (2025)
by: Baba, Kaito, et al.
Published: (2025)
How Far Are LLMs from Professional Poker Players? Revisiting Game-Theoretic Reasoning with Agentic Tool Use
by: Lin, Minhua, et al.
Published: (2026)
by: Lin, Minhua, et al.
Published: (2026)
DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use
by: Chen, Aili, et al.
Published: (2026)
by: Chen, Aili, et al.
Published: (2026)
LiveFMBench: Unveiling the Power and Limits of Agentic Workflows in Specification Generation
by: Xu, Dong, et al.
Published: (2026)
by: Xu, Dong, et al.
Published: (2026)
A Comprehensive Survey of the Lean 4 Theorem Prover: Architecture, Applications, and Advances
by: Tang, Xichen
Published: (2025)
by: Tang, Xichen
Published: (2025)
ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use
by: Deng, Mengjie, et al.
Published: (2025)
by: Deng, Mengjie, et al.
Published: (2025)
On the Goedel's formula
by: Ferreira, Jailton C.
Published: (2001)
by: Ferreira, Jailton C.
Published: (2001)
Ax-Prover: A Deep Reasoning Agentic Framework for Theorem Proving in Mathematics and Quantum Physics
by: Breen, Benjamin, et al.
Published: (2025)
by: Breen, Benjamin, et al.
Published: (2025)
Lean-auto: An Interface between Lean 4 and Automated Theorem Provers
by: Qian, Yicheng, et al.
Published: (2025)
by: Qian, Yicheng, et al.
Published: (2025)
UniToolCall: Unifying Tool-Use Representation, Data, and Evaluation for LLM Agents
by: Liang, Yijuan, et al.
Published: (2026)
by: Liang, Yijuan, et al.
Published: (2026)
On the Day They Experience: Awakening Self-Sovereign Experiential AI Agents
by: Hu, Botao Amber, et al.
Published: (2025)
by: Hu, Botao Amber, et al.
Published: (2025)
Understanding Tool-Augmented Agents for Lean Formalization: A Factorial Analysis
by: Zhang, Ke, et al.
Published: (2026)
by: Zhang, Ke, et al.
Published: (2026)
Teaching "Foundations of Mathematics" with the Lean Theorem Prover
by: Bottoni, Mattia Luciano, et al.
Published: (2025)
by: Bottoni, Mattia Luciano, et al.
Published: (2025)
VitalAgent: A Tool-Augmented Agent for Reactive and Proactive Physiological Monitoring over Wearable Health Data
by: Zhu, Di, et al.
Published: (2026)
by: Zhu, Di, et al.
Published: (2026)
Clarifying Before Reasoning: A Coq Prover with Structural Context
by: Lu, Yanzhen, et al.
Published: (2025)
by: Lu, Yanzhen, et al.
Published: (2025)
Budget-Aware Tool-Use Enables Effective Agent Scaling
by: Liu, Tengxiao, et al.
Published: (2025)
by: Liu, Tengxiao, et al.
Published: (2025)
Act Wisely: Cultivating Meta-Cognitive Tool Use in Agentic Multimodal Models
by: Yan, Shilin, et al.
Published: (2026)
by: Yan, Shilin, et al.
Published: (2026)
ATP-Bench: Towards Agentic Tool Planning for MLLM Interleaved Generation
by: Liu, Yinuo, et al.
Published: (2026)
by: Liu, Yinuo, et al.
Published: (2026)
Similar Items
-
Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving
by: Lin, Yong, et al.
Published: (2025) -
Goedel-Prover-V2: Scaling Formal Theorem Proving with Scaffolded Data Synthesis and Self-Correction
by: Lin, Yong, et al.
Published: (2025) -
Goedel-Code-Prover: Hierarchical Proof Search for Open State-of-the-Art Code Verification
by: Li, Zenan, et al.
Published: (2026) -
Ineq-Comp: Benchmarking Human-Intuitive Compositional Reasoning in Automated Theorem Proving on Inequalities
by: Zhao, Haoyu, et al.
Published: (2025) -
Is Elo Rating Reliable? A Study Under Model Misspecification
by: Tang, Shange, et al.
Published: (2025)