Saved in:
| Main Authors: | Pawar, Pranav, Jain, Dhwaj, Gupta, Varun, Dedhia, Kaustav, Kale, Dashrath, Dhekane, Sudhir |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.07238 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Thou Shalt Not Prompt: Zero-Shot Human Activity Recognition in Smart Homes via Language Modeling of Sensor Data & Activities
by: Dhekane, Sourish Gunesh, et al.
Published: (2025)
by: Dhekane, Sourish Gunesh, et al.
Published: (2025)
Interpretable Physics Reasoning and Performance Taxonomy in Vision-Language Models
by: Pawar, Pranav, et al.
Published: (2025)
by: Pawar, Pranav, et al.
Published: (2025)
Learning to Trust the Crowd: A Multi-Model Consensus Reasoning Engine for Large Language Models
by: Kallem, Pranav
Published: (2026)
by: Kallem, Pranav
Published: (2026)
Open-RAG: Enhanced Retrieval-Augmented Reasoning with Open-Source Large Language Models
by: Islam, Shayekh Bin, et al.
Published: (2024)
by: Islam, Shayekh Bin, et al.
Published: (2024)
Model Ensembling for Constrained Optimization
by: Globus-Harris, Ira, et al.
Published: (2024)
by: Globus-Harris, Ira, et al.
Published: (2024)
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models
by: Mirzadeh, Iman, et al.
Published: (2024)
by: Mirzadeh, Iman, et al.
Published: (2024)
CAMA: Enhancing Mathematical Reasoning in Large Language Models with Causal Knowledge
by: Zan, Lei, et al.
Published: (2025)
by: Zan, Lei, et al.
Published: (2025)
FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models
by: Yu, Zhouliang, et al.
Published: (2025)
by: Yu, Zhouliang, et al.
Published: (2025)
Multi-Source Knowledge-Based Hybrid Neural Framework for Time Series Representation Learning
by: Sakhinana, Sagar Srinivas, et al.
Published: (2024)
by: Sakhinana, Sagar Srinivas, et al.
Published: (2024)
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
by: Shao, Zhihong, et al.
Published: (2024)
by: Shao, Zhihong, et al.
Published: (2024)
I-RAVEN-X: Benchmarking Generalization and Robustness of Analogical and Mathematical Reasoning in Large Language and Reasoning Models
by: Camposampiero, Giacomo, et al.
Published: (2025)
by: Camposampiero, Giacomo, et al.
Published: (2025)
Stepwise Self-Consistent Mathematical Reasoning with Large Language Models
by: Zhao, Zilong, et al.
Published: (2024)
by: Zhao, Zilong, et al.
Published: (2024)
Forward-Backward Reasoning in Large Language Models for Mathematical Verification
by: Jiang, Weisen, et al.
Published: (2023)
by: Jiang, Weisen, et al.
Published: (2023)
Scaling Strategy, Not Compute: A Stand-Alone, Open-Source StarCraft II Benchmark for Accessible Reinforcement Learning Research
by: Panda, Sourav, et al.
Published: (2026)
by: Panda, Sourav, et al.
Published: (2026)
Mathematical Framework for Custom Reward Functions in Job Application Evaluation using Reinforcement Learning
by: Jain, Shreyansh, et al.
Published: (2025)
by: Jain, Shreyansh, et al.
Published: (2025)
Deploying Open-Source Large Language Models: A performance Analysis
by: Bendi-Ouis, Yannis, et al.
Published: (2024)
by: Bendi-Ouis, Yannis, et al.
Published: (2024)
AttackQA: Development and Adoption of a Dataset for Assisting Cybersecurity Operations using Fine-tuned and Open-Source LLMs
by: Krishna, Varun Badrinath
Published: (2024)
by: Krishna, Varun Badrinath
Published: (2024)
PREMISE: Scalable and Strategic Prompt Optimization for Efficient Mathematical Reasoning in Large Models
by: Yu, Ye, et al.
Published: (2025)
by: Yu, Ye, et al.
Published: (2025)
LOLA -- An Open-Source Massively Multilingual Large Language Model
by: Srivastava, Nikit, et al.
Published: (2024)
by: Srivastava, Nikit, et al.
Published: (2024)
Self-Evolved Preference Optimization for Enhancing Mathematical Reasoning in Small Language Models
by: Singh, Joykirat, et al.
Published: (2025)
by: Singh, Joykirat, et al.
Published: (2025)
LEPO: Latent Reasoning Policy Optimization for Large Language Models
by: Zhou, Yuyan, et al.
Published: (2026)
by: Zhou, Yuyan, et al.
Published: (2026)
Can Large Language Models Reason and Optimize Under Constraints?
by: Bernier, Fabien, et al.
Published: (2026)
by: Bernier, Fabien, et al.
Published: (2026)
Adaptive Acquisition Selection for Bayesian Optimization with Large Language Models
by: Ngo, Giang, et al.
Published: (2026)
by: Ngo, Giang, et al.
Published: (2026)
Hippocrates: An Open-Source Framework for Advancing Large Language Models in Healthcare
by: Acikgoz, Emre Can, et al.
Published: (2024)
by: Acikgoz, Emre Can, et al.
Published: (2024)
Meta-Reasoner: Dynamic Guidance for Optimized Inference-time Reasoning in Large Language Models
by: Sui, Yuan, et al.
Published: (2025)
by: Sui, Yuan, et al.
Published: (2025)
PARAMANU-GANITA: Can Small Math Language Models Rival with Large Language Models on Mathematical Reasoning?
by: Niyogi, Mitodru, et al.
Published: (2024)
by: Niyogi, Mitodru, et al.
Published: (2024)
Schoenfeld's Anatomy of Mathematical Reasoning by Language Models
by: Li, Ming, et al.
Published: (2025)
by: Li, Ming, et al.
Published: (2025)
Auto-Evolve: Enhancing Large Language Model's Performance via Self-Reasoning Framework
by: Aswani, Krishna, et al.
Published: (2024)
by: Aswani, Krishna, et al.
Published: (2024)
Evaluating Mathematical Reasoning of Large Language Models: A Focus on Error Identification and Correction
by: Li, Xiaoyuan, et al.
Published: (2024)
by: Li, Xiaoyuan, et al.
Published: (2024)
PromptCoT: Synthesizing Olympiad-level Problems for Mathematical Reasoning in Large Language Models
by: Zhao, Xueliang, et al.
Published: (2025)
by: Zhao, Xueliang, et al.
Published: (2025)
Eliciting In-context Retrieval and Reasoning for Long-context Large Language Models
by: Qiu, Yifu, et al.
Published: (2025)
by: Qiu, Yifu, et al.
Published: (2025)
MDPO: Multi-Granularity Direct Preference Optimization for Mathematical Reasoning
by: Lin, Yunze
Published: (2025)
by: Lin, Yunze
Published: (2025)
Sociodemographic Bias in Language Models: A Survey and Forward Path
by: Gupta, Vipul, et al.
Published: (2023)
by: Gupta, Vipul, et al.
Published: (2023)
DISPO: Enhancing Training Efficiency and Stability in Reinforcement Learning for Large Language Model Mathematical Reasoning
by: Karaman, Batuhan K., et al.
Published: (2026)
by: Karaman, Batuhan K., et al.
Published: (2026)
Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes On
by: Zeng, Liang, et al.
Published: (2024)
by: Zeng, Liang, et al.
Published: (2024)
CogAtom: From Cognitive Atoms to Olympiad-level Mathematical Reasoning in Large Language Models
by: Chen, Zhuofan, et al.
Published: (2025)
by: Chen, Zhuofan, et al.
Published: (2025)
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
by: Luo, Haipeng, et al.
Published: (2023)
by: Luo, Haipeng, et al.
Published: (2023)
AgentMath: Empowering Mathematical Reasoning for Large Language Models via Tool-Augmented Agent
by: Luo, Haipeng, et al.
Published: (2025)
by: Luo, Haipeng, et al.
Published: (2025)
Scaling Properties of Continuous Diffusion Spoken Language Models
by: Ramapuram, Jason, et al.
Published: (2026)
by: Ramapuram, Jason, et al.
Published: (2026)
Can Large Reasoning Models Improve Accuracy on Mathematical Tasks Using Flawed Thinking?
by: Amjith, Saraswathy, et al.
Published: (2025)
by: Amjith, Saraswathy, et al.
Published: (2025)
Similar Items
-
Thou Shalt Not Prompt: Zero-Shot Human Activity Recognition in Smart Homes via Language Modeling of Sensor Data & Activities
by: Dhekane, Sourish Gunesh, et al.
Published: (2025) -
Interpretable Physics Reasoning and Performance Taxonomy in Vision-Language Models
by: Pawar, Pranav, et al.
Published: (2025) -
Learning to Trust the Crowd: A Multi-Model Consensus Reasoning Engine for Large Language Models
by: Kallem, Pranav
Published: (2026) -
Open-RAG: Enhanced Retrieval-Augmented Reasoning with Open-Source Large Language Models
by: Islam, Shayekh Bin, et al.
Published: (2024) -
Model Ensembling for Constrained Optimization
by: Globus-Harris, Ira, et al.
Published: (2024)