Saved in:
| Main Authors: | Parashar, Jayant, Bhandarkar, Suchendra M. |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2606.00532 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
KisMATH: Do LLMs Have Knowledge of Implicit Structures in Mathematical Reasoning?
by: Saha, Soumadeep, et al.
Published: (2025)
by: Saha, Soumadeep, et al.
Published: (2025)
PhysNote: Self-Knowledge Notes for Evolvable Physical Reasoning in Vision-Language Model
by: Zhang, Sinin, et al.
Published: (2026)
by: Zhang, Sinin, et al.
Published: (2026)
InfinityMATH: A Scalable Instruction Tuning Dataset in Programmatic Mathematical Reasoning
by: Zhang, Bo-Wen, et al.
Published: (2024)
by: Zhang, Bo-Wen, et al.
Published: (2024)
Evaluating Relational Reasoning in LLMs with REL
by: Fesser, Lukas, et al.
Published: (2026)
by: Fesser, Lukas, et al.
Published: (2026)
HIP Network: Historical Information Passing Network for Extrapolation Reasoning on Temporal Knowledge Graph
by: He, Yongquan, et al.
Published: (2024)
by: He, Yongquan, et al.
Published: (2024)
Active Context Compression: Autonomous Memory Management in LLM Agents
by: Verma, Nikhil
Published: (2026)
by: Verma, Nikhil
Published: (2026)
SCULPT: Constraint-Guided Pruned MCTS that Carves Efficient Paths for Mathematical Reasoning
by: Fang, Qitong, et al.
Published: (2026)
by: Fang, Qitong, et al.
Published: (2026)
Temporal Knowledge Question Answering via Abstract Reasoning Induction
by: Chen, Ziyang, et al.
Published: (2023)
by: Chen, Ziyang, et al.
Published: (2023)
RomanLens: The Role Of Latent Romanization In Multilinguality In LLMs
by: Saji, Alan, et al.
Published: (2025)
by: Saji, Alan, et al.
Published: (2025)
Text-Based Approaches to Item Difficulty Modeling in Large-Scale Assessments: A Systematic Review
by: Peters, Sydney, et al.
Published: (2025)
by: Peters, Sydney, et al.
Published: (2025)
SagaLLM: Context Management, Validation, and Transaction Guarantees for Multi-Agent LLM Planning
by: Chang, Edward Y., et al.
Published: (2025)
by: Chang, Edward Y., et al.
Published: (2025)
Pareto-Optimized Open-Source LLMs for Healthcare via Context Retrieval
by: Bayarri-Planas, Jordi, et al.
Published: (2024)
by: Bayarri-Planas, Jordi, et al.
Published: (2024)
Thinking Machines: Mathematical Reasoning in the Age of LLMs
by: Asperti, Andrea, et al.
Published: (2025)
by: Asperti, Andrea, et al.
Published: (2025)
RAudit: A Blind Auditing Protocol for Large Language Model Reasoning
by: Chang, Edward Y., et al.
Published: (2026)
by: Chang, Edward Y., et al.
Published: (2026)
From Extraction to Synthesis: Entangled Heuristics for Agent-Augmented Strategic Reasoning
by: Ghisellini, Renato, et al.
Published: (2025)
by: Ghisellini, Renato, et al.
Published: (2025)
RADD: Retrieval-Augmented Discrete Diffusion for Multi-Modal Knowledge Graph Completion
by: Niu, Guanglin, et al.
Published: (2026)
by: Niu, Guanglin, et al.
Published: (2026)
Enhancing Mental Health Counseling Support in Bangladesh using Culturally-Grounded Knowledge
by: Hasan, Md Arid, et al.
Published: (2026)
by: Hasan, Md Arid, et al.
Published: (2026)
A Survey of Task-Oriented Knowledge Graph Reasoning: Status, Applications, and Prospects
by: Niu, Guanglin, et al.
Published: (2025)
by: Niu, Guanglin, et al.
Published: (2025)
MapAgent: A Hierarchical Agent for Geospatial Reasoning with Dynamic Map Tool Integration
by: Hasan, Md Hasebul, et al.
Published: (2025)
by: Hasan, Md Hasebul, et al.
Published: (2025)
A Fuzzy Logic Prompting Framework for Large Language Models in Adaptive and Uncertain Tasks
by: Figueiredo, Vanessa
Published: (2025)
by: Figueiredo, Vanessa
Published: (2025)
The Unified Cognitive Consciousness Theory for Language Models: Anchoring Semantics, Thresholds of Activation, and Emergent Reasoning
by: Chang, Edward Y., et al.
Published: (2025)
by: Chang, Edward Y., et al.
Published: (2025)
Planning vs Reasoning: Ablations to Test Capabilities of LoRA layers
by: Redkar, Neel
Published: (2024)
by: Redkar, Neel
Published: (2024)
Automatic Prompt Optimization for Knowledge Graph Construction: Insights from an Empirical Study
by: Mihindukulasooriya, Nandana, et al.
Published: (2025)
by: Mihindukulasooriya, Nandana, et al.
Published: (2025)
Adaptive Interviewing for Persona Simulation in LLMs: Evidence-Grounded Reasoning Improves Decision Alignment
by: Su, Ruoxi, et al.
Published: (2026)
by: Su, Ruoxi, et al.
Published: (2026)
An Investigation of Neuron Activation as a Unified Lens to Explain Chain-of-Thought Eliciting Arithmetic Reasoning of LLMs
by: Rai, Daking, et al.
Published: (2024)
by: Rai, Daking, et al.
Published: (2024)
AI Predicts AGI: Leveraging AGI Forecasting and Peer Review to Explore LLMs' Complex Reasoning Capabilities
by: Davide, Fabrizio, et al.
Published: (2024)
by: Davide, Fabrizio, et al.
Published: (2024)
ChatGPT4PCG 2 Competition: Prompt Engineering for Science Birds Level Generation
by: Taveekitworachai, Pittawat, et al.
Published: (2024)
by: Taveekitworachai, Pittawat, et al.
Published: (2024)
CausalT5K: Diagnosing and Informing Refusal for Trustworthy Causal Reasoning of Skepticism, Sycophancy, Detection-Correction, and Rung Collapse
by: Geng, Longling, et al.
Published: (2026)
by: Geng, Longling, et al.
Published: (2026)
BitCal-TTS: Bit-Calibrated Test-Time Scaling for Quantized Reasoning Models
by: Patarlapalli, Sai Babu, et al.
Published: (2026)
by: Patarlapalli, Sai Babu, et al.
Published: (2026)
Reasoning-Based AI for Startup Evaluation (R.A.I.S.E.): A Memory-Augmented, Multi-Step Decision Framework
by: Preuveneers, Jack, et al.
Published: (2025)
by: Preuveneers, Jack, et al.
Published: (2025)
Mathador-LM: A Dynamic Benchmark for Mathematical Reasoning on Large Language Models
by: Kurtic, Eldar, et al.
Published: (2024)
by: Kurtic, Eldar, et al.
Published: (2024)
UA-Legal-Bench: A Benchmark for Evaluating Large Language Models on Ukrainian Legal Reasoning
by: Ovcharov, Volodymyr
Published: (2026)
by: Ovcharov, Volodymyr
Published: (2026)
A Benchmark for Audio Reasoning Capabilities of Multimodal Large Language Models
by: Christop, Iwona, et al.
Published: (2026)
by: Christop, Iwona, et al.
Published: (2026)
Red Teaming for Large Language Models At Scale: Tackling Hallucinations on Mathematics Tasks
by: Buszydlik, Aleksander, et al.
Published: (2023)
by: Buszydlik, Aleksander, et al.
Published: (2023)
Context Is What You Need: The Maximum Effective Context Window for Real World Limits of LLMs
by: Paulsen, Norman
Published: (2025)
by: Paulsen, Norman
Published: (2025)
Evaluating the Efficacy of Hybrid Deep Learning Models in Distinguishing AI-Generated Text
by: Oketunji, Abiodun Finbarrs
Published: (2023)
by: Oketunji, Abiodun Finbarrs
Published: (2023)
LLM-Based SQL Generation: Prompting, Self-Refinement, and Adaptive Weighted Majority Voting
by: Yang, Yu-Jie, et al.
Published: (2026)
by: Yang, Yu-Jie, et al.
Published: (2026)
From Guessing to Asking: An Approach to Resolving the Persona Knowledge Gap in LLMs during Multi-Turn Conversations
by: Baskar, Sarvesh, et al.
Published: (2025)
by: Baskar, Sarvesh, et al.
Published: (2025)
Beyond Greenfield: The D3 Framework for AI-Driven Productivity in Brownfield Engineering
by: Sharma, Krishna Kumaar
Published: (2025)
by: Sharma, Krishna Kumaar
Published: (2025)
Can LLMs Do Rocket Science? Exploring the Limits of Complex Reasoning with GTOC 12
by: del Campo, Iñaki, et al.
Published: (2026)
by: del Campo, Iñaki, et al.
Published: (2026)
Similar Items
-
KisMATH: Do LLMs Have Knowledge of Implicit Structures in Mathematical Reasoning?
by: Saha, Soumadeep, et al.
Published: (2025) -
PhysNote: Self-Knowledge Notes for Evolvable Physical Reasoning in Vision-Language Model
by: Zhang, Sinin, et al.
Published: (2026) -
InfinityMATH: A Scalable Instruction Tuning Dataset in Programmatic Mathematical Reasoning
by: Zhang, Bo-Wen, et al.
Published: (2024) -
Evaluating Relational Reasoning in LLMs with REL
by: Fesser, Lukas, et al.
Published: (2026) -
HIP Network: Historical Information Passing Network for Extrapolation Reasoning on Temporal Knowledge Graph
by: He, Yongquan, et al.
Published: (2024)