Saved in:
| Main Authors: | Yu, Xiaodong, Zhou, Ben, Cheng, Hao, Roth, Dan |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.19056 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial Attacks
by: Yu, Xiaodong, et al.
Published: (2023)
by: Yu, Xiaodong, et al.
Published: (2023)
SocREval: Large Language Models with the Socratic Method for Reference-Free Reasoning Evaluation
by: He, Hangfeng, et al.
Published: (2023)
by: He, Hangfeng, et al.
Published: (2023)
Deceptive Semantic Shortcuts on Reasoning Chains: How Far Can Models Go without Hallucination?
by: Li, Bangzheng, et al.
Published: (2023)
by: Li, Bangzheng, et al.
Published: (2023)
VAR-MATH: Probing True Mathematical Reasoning in LLMS via Symbolic Multi-Instance Benchmarks
by: Yao, Jian, et al.
Published: (2025)
by: Yao, Jian, et al.
Published: (2025)
An Investigation of Robustness of LLMs in Mathematical Reasoning: Benchmarking with Mathematically-Equivalent Transformation of Advanced Mathematical Problems
by: Hao, Yuren, et al.
Published: (2025)
by: Hao, Yuren, et al.
Published: (2025)
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models
by: Mirzadeh, Iman, et al.
Published: (2024)
by: Mirzadeh, Iman, et al.
Published: (2024)
Advanced Weakly-Supervised Formula Exploration for Neuro-Symbolic Mathematical Reasoning
by: Wu, Yuxuan, et al.
Published: (2025)
by: Wu, Yuxuan, et al.
Published: (2025)
Beyond Accuracy: Evaluating Strategy Diversity in LLM Mathematical Reasoning
by: Yang, Xia, et al.
Published: (2026)
by: Yang, Xia, et al.
Published: (2026)
Neuro-Symbolic Data Generation for Math Reasoning
by: Li, Zenan, et al.
Published: (2024)
by: Li, Zenan, et al.
Published: (2024)
Reasoning Relay: Evaluating Stability and Interchangeability of Large Language Models in Mathematical Reasoning
by: Lu, Leo, et al.
Published: (2025)
by: Lu, Leo, et al.
Published: (2025)
From Calculation to Adjudication: Examining LLM judges on Mathematical Reasoning Tasks
by: Stephan, Andreas, et al.
Published: (2024)
by: Stephan, Andreas, et al.
Published: (2024)
Do MLLMs Really Understand Space? A Mathematical Reasoning Evaluation
by: Lu, Shuo, et al.
Published: (2026)
by: Lu, Shuo, et al.
Published: (2026)
CORE: Concept-Oriented Reinforcement for Bridging the Definition-Application Gap in Mathematical Reasoning
by: Gao, Zijun, et al.
Published: (2025)
by: Gao, Zijun, et al.
Published: (2025)
A Survey of Multimodal Mathematical Reasoning: From Perception, Alignment to Reasoning
by: Yang, Tianyu, et al.
Published: (2026)
by: Yang, Tianyu, et al.
Published: (2026)
Schoenfeld's Anatomy of Mathematical Reasoning by Language Models
by: Li, Ming, et al.
Published: (2025)
by: Li, Ming, et al.
Published: (2025)
No Universal Prompt: Unifying Reasoning through Adaptive Prompting for Temporal Table Reasoning
by: Rajgaria, Abhishek, et al.
Published: (2025)
by: Rajgaria, Abhishek, et al.
Published: (2025)
Compositional Neuro-Symbolic Reasoning
by: Das, Anugyan, et al.
Published: (2026)
by: Das, Anugyan, et al.
Published: (2026)
RE-IMAGINE: Symbolic Benchmark Synthesis for Reasoning Evaluation
by: Xu, Xinnuo, et al.
Published: (2025)
by: Xu, Xinnuo, et al.
Published: (2025)
Adaptive Selection of Symbolic Languages for Improving LLM Logical Reasoning
by: Wang, Xiangyu, et al.
Published: (2025)
by: Wang, Xiangyu, et al.
Published: (2025)
Weaver: Interweaving SQL and LLM for Table Reasoning
by: Khoja, Rohit, et al.
Published: (2025)
by: Khoja, Rohit, et al.
Published: (2025)
Constraints-Guided Diffusion Reasoner for Neuro-Symbolic Learning
by: Zhang, Xuan, et al.
Published: (2025)
by: Zhang, Xuan, et al.
Published: (2025)
A Simple "Try Again" Can Elicit Multi-Turn LLM Reasoning
by: Liu, Licheng, et al.
Published: (2025)
by: Liu, Licheng, et al.
Published: (2025)
Making Mathematical Reasoning Adaptive
by: Lai, Zhejian, et al.
Published: (2025)
by: Lai, Zhejian, et al.
Published: (2025)
What Really Improves Mathematical Reasoning: Structured Reasoning Signals Beyond Pure Code
by: Zhao, Yuze, et al.
Published: (2026)
by: Zhao, Yuze, et al.
Published: (2026)
RESOLVE: Relational Reasoning with Symbolic and Object-Level Features Using Vector Symbolic Processing
by: Mejri, Mohamed, et al.
Published: (2024)
by: Mejri, Mohamed, et al.
Published: (2024)
Evaluating Robustness of Reward Models for Mathematical Reasoning
by: Kim, Sunghwan, et al.
Published: (2024)
by: Kim, Sunghwan, et al.
Published: (2024)
Model Tells Itself Where to Attend: Faithfulness Meets Automatic Attention Steering
by: Zhang, Qingru, et al.
Published: (2024)
by: Zhang, Qingru, et al.
Published: (2024)
A Survey on Large Language Models for Mathematical Reasoning
by: Wang, Peng-Yuan, et al.
Published: (2025)
by: Wang, Peng-Yuan, et al.
Published: (2025)
CoTEvol: Self-Evolving Chain-of-Thoughts for Data Synthesis in Mathematical Reasoning
by: Wang, Zhuo, et al.
Published: (2026)
by: Wang, Zhuo, et al.
Published: (2026)
Language Models as Inductive Reasoners
by: Yang, Zonglin, et al.
Published: (2022)
by: Yang, Zonglin, et al.
Published: (2022)
AXIOM: A Trust-First Neuro-Symbolic Execution Architecture for Verifiable Mathematical Reasoning
by: Bruno, Alessio
Published: (2026)
by: Bruno, Alessio
Published: (2026)
ReaComp: Compiling LLM Reasoning into Symbolic Solvers for Efficient Program Synthesis
by: Naik, Atharva, et al.
Published: (2026)
by: Naik, Atharva, et al.
Published: (2026)
Challenging Mathematical Problems Designed to Evaluate Advanced AI Reasoning
by: Tan, Kwan Hong
Published: (2025)
by: Tan, Kwan Hong
Published: (2025)
FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI
by: Glazer, Elliot, et al.
Published: (2024)
by: Glazer, Elliot, et al.
Published: (2024)
MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts
by: Lu, Pan, et al.
Published: (2023)
by: Lu, Pan, et al.
Published: (2023)
Efficient Rectification of Neuro-Symbolic Reasoning Inconsistencies by Abductive Reflection
by: Hu, Wen-Chao, et al.
Published: (2024)
by: Hu, Wen-Chao, et al.
Published: (2024)
Unmasking Reasoning Processes: A Process-aware Benchmark for Evaluating Structural Mathematical Reasoning in LLMs
by: Zheng, Xiang, et al.
Published: (2026)
by: Zheng, Xiang, et al.
Published: (2026)
A Neuro-Symbolic Framework for Reasoning under Perceptual Uncertainty: Bridging Continuous Perception and Discrete Symbolic Planning
by: Wu, Jiahao, et al.
Published: (2025)
by: Wu, Jiahao, et al.
Published: (2025)
Primender Sequence: A Novel Mathematical Construct for Testing Symbolic Inference and AI Reasoning
by: Faiz, Mohd Anwar Jamal
Published: (2025)
by: Faiz, Mohd Anwar Jamal
Published: (2025)
Evaluating Strategic Reasoning in Forecasting Agents
by: Liptay, Tom, et al.
Published: (2026)
by: Liptay, Tom, et al.
Published: (2026)
Similar Items
-
ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial Attacks
by: Yu, Xiaodong, et al.
Published: (2023) -
SocREval: Large Language Models with the Socratic Method for Reference-Free Reasoning Evaluation
by: He, Hangfeng, et al.
Published: (2023) -
Deceptive Semantic Shortcuts on Reasoning Chains: How Far Can Models Go without Hallucination?
by: Li, Bangzheng, et al.
Published: (2023) -
VAR-MATH: Probing True Mathematical Reasoning in LLMS via Symbolic Multi-Instance Benchmarks
by: Yao, Jian, et al.
Published: (2025) -
An Investigation of Robustness of LLMs in Mathematical Reasoning: Benchmarking with Mathematically-Equivalent Transformation of Advanced Mathematical Problems
by: Hao, Yuren, et al.
Published: (2025)