:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yu, Xiaodong, Zhou, Ben, Cheng, Hao, Roth, Dan
Format:	Preprint
Published:	2024
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2410.19056
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial Attacks
by: Yu, Xiaodong, et al.
Published: (2023)

SocREval: Large Language Models with the Socratic Method for Reference-Free Reasoning Evaluation
by: He, Hangfeng, et al.
Published: (2023)

Deceptive Semantic Shortcuts on Reasoning Chains: How Far Can Models Go without Hallucination?
by: Li, Bangzheng, et al.
Published: (2023)

VAR-MATH: Probing True Mathematical Reasoning in LLMS via Symbolic Multi-Instance Benchmarks
by: Yao, Jian, et al.
Published: (2025)

An Investigation of Robustness of LLMs in Mathematical Reasoning: Benchmarking with Mathematically-Equivalent Transformation of Advanced Mathematical Problems
by: Hao, Yuren, et al.
Published: (2025)

GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models
by: Mirzadeh, Iman, et al.
Published: (2024)

Advanced Weakly-Supervised Formula Exploration for Neuro-Symbolic Mathematical Reasoning
by: Wu, Yuxuan, et al.
Published: (2025)

Beyond Accuracy: Evaluating Strategy Diversity in LLM Mathematical Reasoning
by: Yang, Xia, et al.
Published: (2026)

Neuro-Symbolic Data Generation for Math Reasoning
by: Li, Zenan, et al.
Published: (2024)

Reasoning Relay: Evaluating Stability and Interchangeability of Large Language Models in Mathematical Reasoning
by: Lu, Leo, et al.
Published: (2025)

From Calculation to Adjudication: Examining LLM judges on Mathematical Reasoning Tasks
by: Stephan, Andreas, et al.
Published: (2024)

Do MLLMs Really Understand Space? A Mathematical Reasoning Evaluation
by: Lu, Shuo, et al.
Published: (2026)

CORE: Concept-Oriented Reinforcement for Bridging the Definition-Application Gap in Mathematical Reasoning
by: Gao, Zijun, et al.
Published: (2025)

A Survey of Multimodal Mathematical Reasoning: From Perception, Alignment to Reasoning
by: Yang, Tianyu, et al.
Published: (2026)

Schoenfeld's Anatomy of Mathematical Reasoning by Language Models
by: Li, Ming, et al.
Published: (2025)

No Universal Prompt: Unifying Reasoning through Adaptive Prompting for Temporal Table Reasoning
by: Rajgaria, Abhishek, et al.
Published: (2025)

Compositional Neuro-Symbolic Reasoning
by: Das, Anugyan, et al.
Published: (2026)

RE-IMAGINE: Symbolic Benchmark Synthesis for Reasoning Evaluation
by: Xu, Xinnuo, et al.
Published: (2025)

Adaptive Selection of Symbolic Languages for Improving LLM Logical Reasoning
by: Wang, Xiangyu, et al.
Published: (2025)

Weaver: Interweaving SQL and LLM for Table Reasoning
by: Khoja, Rohit, et al.
Published: (2025)

Constraints-Guided Diffusion Reasoner for Neuro-Symbolic Learning
by: Zhang, Xuan, et al.
Published: (2025)

A Simple "Try Again" Can Elicit Multi-Turn LLM Reasoning
by: Liu, Licheng, et al.
Published: (2025)

Making Mathematical Reasoning Adaptive
by: Lai, Zhejian, et al.
Published: (2025)

What Really Improves Mathematical Reasoning: Structured Reasoning Signals Beyond Pure Code
by: Zhao, Yuze, et al.
Published: (2026)

RESOLVE: Relational Reasoning with Symbolic and Object-Level Features Using Vector Symbolic Processing
by: Mejri, Mohamed, et al.
Published: (2024)

Evaluating Robustness of Reward Models for Mathematical Reasoning
by: Kim, Sunghwan, et al.
Published: (2024)

Model Tells Itself Where to Attend: Faithfulness Meets Automatic Attention Steering
by: Zhang, Qingru, et al.
Published: (2024)

A Survey on Large Language Models for Mathematical Reasoning
by: Wang, Peng-Yuan, et al.
Published: (2025)

CoTEvol: Self-Evolving Chain-of-Thoughts for Data Synthesis in Mathematical Reasoning
by: Wang, Zhuo, et al.
Published: (2026)

Language Models as Inductive Reasoners
by: Yang, Zonglin, et al.
Published: (2022)

AXIOM: A Trust-First Neuro-Symbolic Execution Architecture for Verifiable Mathematical Reasoning
by: Bruno, Alessio
Published: (2026)

ReaComp: Compiling LLM Reasoning into Symbolic Solvers for Efficient Program Synthesis
by: Naik, Atharva, et al.
Published: (2026)

Challenging Mathematical Problems Designed to Evaluate Advanced AI Reasoning
by: Tan, Kwan Hong
Published: (2025)

FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI
by: Glazer, Elliot, et al.
Published: (2024)

MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts
by: Lu, Pan, et al.
Published: (2023)

Efficient Rectification of Neuro-Symbolic Reasoning Inconsistencies by Abductive Reflection
by: Hu, Wen-Chao, et al.
Published: (2024)

Unmasking Reasoning Processes: A Process-aware Benchmark for Evaluating Structural Mathematical Reasoning in LLMs
by: Zheng, Xiang, et al.
Published: (2026)

A Neuro-Symbolic Framework for Reasoning under Perceptual Uncertainty: Bridging Continuous Perception and Discrete Symbolic Planning
by: Wu, Jiahao, et al.
Published: (2025)

Primender Sequence: A Novel Mathematical Construct for Testing Symbolic Inference and AI Reasoning
by: Faiz, Mohd Anwar Jamal
Published: (2025)

Evaluating Strategic Reasoning in Forecasting Agents
by: Liptay, Tom, et al.
Published: (2026)