Saved in:
| Main Authors: | Belkhiter, Yannis, Tirupathi, Seshu, Zizzo, Giulio, Kelleher, John D. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.14332 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
TRACES: Tagging Reasoning Steps for Adaptive Cost-Efficient Early-Stopping
by: Belkhiter, Yannis, et al.
Published: (2026)
by: Belkhiter, Yannis, et al.
Published: (2026)
Breaking MCP with Function Hijacking Attacks: Novel Threats for Function Calling and Agentic Models
by: Belkhiter, Yannis, et al.
Published: (2026)
by: Belkhiter, Yannis, et al.
Published: (2026)
Pre-Hoc Predictions in AutoML: Leveraging LLMs to Enhance Model Selection and Benchmarking for Tabular datasets
by: Belkhiter, Yannis, et al.
Published: (2025)
by: Belkhiter, Yannis, et al.
Published: (2025)
Dynamic Features Adaptation in Networking: Toward Flexible training and Explainable inference
by: Belkhiter, Yannis, et al.
Published: (2025)
by: Belkhiter, Yannis, et al.
Published: (2025)
HarmLevelBench: Evaluating Harm-Level Compliance and the Impact of Quantization on Model Alignment
by: Belkhiter, Yannis, et al.
Published: (2024)
by: Belkhiter, Yannis, et al.
Published: (2024)
Interpreting LLM-as-a-Judge Policies via Verifiable Global Explanations
by: Gajcin, Jasmina, et al.
Published: (2025)
by: Gajcin, Jasmina, et al.
Published: (2025)
Knowledge-Augmented Reasoning for EUAIA Compliance and Adversarial Robustness of LLMs
by: Momcilovic, Tomas Bueno, et al.
Published: (2024)
by: Momcilovic, Tomas Bueno, et al.
Published: (2024)
Domain Adaptation for Time series Transformers using One-step fine-tuning
by: Khanal, Subina, et al.
Published: (2024)
by: Khanal, Subina, et al.
Published: (2024)
GAF-Guard: An Agentic Framework for Risk Management and Governance in Large Language Models
by: Tirupathi, Seshu, et al.
Published: (2025)
by: Tirupathi, Seshu, et al.
Published: (2025)
Do LLMs Really Think Step-by-step In Implicit Reasoning?
by: Yu, Yijiong
Published: (2024)
by: Yu, Yijiong
Published: (2024)
Towards Hierarchical Multi-Step Reward Models for Enhanced Reasoning in Large Language Models
by: Wang, Teng, et al.
Published: (2025)
by: Wang, Teng, et al.
Published: (2025)
LLM Reasoners: New Evaluation, Library, and Analysis of Step-by-Step Reasoning with Large Language Models
by: Hao, Shibo, et al.
Published: (2024)
by: Hao, Shibo, et al.
Published: (2024)
Self-controller: Controlling LLMs with Multi-round Step-by-step Self-awareness
by: Peng, Xiao, et al.
Published: (2024)
by: Peng, Xiao, et al.
Published: (2024)
Exploring LLM Reasoning Through Controlled Prompt Variations
by: Chatziveroglou, Giannis, et al.
Published: (2025)
by: Chatziveroglou, Giannis, et al.
Published: (2025)
The Impact of Reasoning Step Length on Large Language Models
by: Jin, Mingyu, et al.
Published: (2024)
by: Jin, Mingyu, et al.
Published: (2024)
Multi-LogiEval: Towards Evaluating Multi-Step Logical Reasoning Ability of Large Language Models
by: Patel, Nisarg, et al.
Published: (2024)
by: Patel, Nisarg, et al.
Published: (2024)
AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence
by: Liu, Yuliang, et al.
Published: (2025)
by: Liu, Yuliang, et al.
Published: (2025)
ConCISE: Confidence-guided Compression in Step-by-step Efficient Reasoning
by: Qiao, Ziqing, et al.
Published: (2025)
by: Qiao, Ziqing, et al.
Published: (2025)
Large Language Model for Discrete Optimization Problems: Evaluation and Step-by-step Reasoning
by: Qian, Tianhao, et al.
Published: (2026)
by: Qian, Tianhao, et al.
Published: (2026)
Mentor-KD: Making Small Language Models Better Multi-step Reasoners
by: Lee, Hojae, et al.
Published: (2024)
by: Lee, Hojae, et al.
Published: (2024)
ChestX-Reasoner: Advancing Radiology Foundation Models with Reasoning through Step-by-Step Verification
by: Fan, Ziqing, et al.
Published: (2025)
by: Fan, Ziqing, et al.
Published: (2025)
Omanic: Towards Step-wise Evaluation of Multi-hop Reasoning in Large Language Models
by: Gu, Xiaojie, et al.
Published: (2026)
by: Gu, Xiaojie, et al.
Published: (2026)
Towards More Accurate US Presidential Election via Multi-step Reasoning with Large Language Models
by: Yu, Chenxiao, et al.
Published: (2024)
by: Yu, Chenxiao, et al.
Published: (2024)
A Survey on Feedback-based Multi-step Reasoning for Large Language Models on Mathematics
by: Wei, Ting-Ruen, et al.
Published: (2025)
by: Wei, Ting-Ruen, et al.
Published: (2025)
Exploring the Role of Reasoning Structures for Constructing Proofs in Multi-Step Natural Language Reasoning with Large Language Models
by: Zheng, Zi'ou, et al.
Published: (2024)
by: Zheng, Zi'ou, et al.
Published: (2024)
Enhancing Multi-Step Reasoning Abilities of Language Models through Direct Q-Function Optimization
by: Ji, Kaixuan, et al.
Published: (2024)
by: Ji, Kaixuan, et al.
Published: (2024)
STEPER: Step-wise Knowledge Distillation for Enhancing Reasoning Ability in Multi-Step Retrieval-Augmented Language Models
by: Lee, Kyumin, et al.
Published: (2025)
by: Lee, Kyumin, et al.
Published: (2025)
Fine-Tuned Language Models for Domain-Specific Summarization and Tagging
by: Wang, Jun, et al.
Published: (2025)
by: Wang, Jun, et al.
Published: (2025)
Towards Retrieval Augmented Generation over Large Video Libraries
by: Tevissen, Yannis, et al.
Published: (2024)
by: Tevissen, Yannis, et al.
Published: (2024)
Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models
by: Yang, Xiao-Wen, et al.
Published: (2025)
by: Yang, Xiao-Wen, et al.
Published: (2025)
Empowering Multi-step Reasoning across Languages via Tree-of-Thoughts
by: Ranaldi, Leonardo, et al.
Published: (2023)
by: Ranaldi, Leonardo, et al.
Published: (2023)
MathFimer: Enhancing Mathematical Reasoning by Expanding Reasoning Steps through Fill-in-the-Middle Task
by: Yan, Yuchen, et al.
Published: (2025)
by: Yan, Yuchen, et al.
Published: (2025)
Knowledge Tagging with Large Language Model based Multi-Agent System
by: Li, Hang, et al.
Published: (2024)
by: Li, Hang, et al.
Published: (2024)
BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning
by: Zhang, Beichen, et al.
Published: (2025)
by: Zhang, Beichen, et al.
Published: (2025)
Reasoning Towards Fairness: Mitigating Bias in Language Models through Reasoning-Guided Fine-Tuning
by: Kabra, Sanchit, et al.
Published: (2025)
by: Kabra, Sanchit, et al.
Published: (2025)
Multi-Step Reasoning with Large Language Models, a Survey
by: Plaat, Aske, et al.
Published: (2024)
by: Plaat, Aske, et al.
Published: (2024)
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models
by: Xu, Fengli, et al.
Published: (2025)
by: Xu, Fengli, et al.
Published: (2025)
Toward Mechanistic Explanation of Deductive Reasoning in Language Models
by: Maltoni, Davide, et al.
Published: (2025)
by: Maltoni, Davide, et al.
Published: (2025)
Timo: Towards Better Temporal Reasoning for Language Models
by: Su, Zhaochen, et al.
Published: (2024)
by: Su, Zhaochen, et al.
Published: (2024)
Opening the Black Box: A Survey on the Mechanisms of Multi-Step Reasoning in Large Language Models
by: Pan, Liangming, et al.
Published: (2026)
by: Pan, Liangming, et al.
Published: (2026)
Similar Items
-
TRACES: Tagging Reasoning Steps for Adaptive Cost-Efficient Early-Stopping
by: Belkhiter, Yannis, et al.
Published: (2026) -
Breaking MCP with Function Hijacking Attacks: Novel Threats for Function Calling and Agentic Models
by: Belkhiter, Yannis, et al.
Published: (2026) -
Pre-Hoc Predictions in AutoML: Leveraging LLMs to Enhance Model Selection and Benchmarking for Tabular datasets
by: Belkhiter, Yannis, et al.
Published: (2025) -
Dynamic Features Adaptation in Networking: Toward Flexible training and Explainable inference
by: Belkhiter, Yannis, et al.
Published: (2025) -
HarmLevelBench: Evaluating Harm-Level Compliance and the Impact of Quantization on Model Alignment
by: Belkhiter, Yannis, et al.
Published: (2024)