Saved in:
| Main Authors: | Jiang, Yuxuan, Ferraro, Francis |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.03555 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Bridging Reasoning Trajectories in On-Policy Distillation via Near-Future Guidance
by: Jiang, Yuxuan, et al.
Published: (2026)
by: Jiang, Yuxuan, et al.
Published: (2026)
Experiments or Outcomes? Probing Scientific Feasibility in Large Language Models
by: Mohammadi, Seyedali, et al.
Published: (2026)
by: Mohammadi, Seyedali, et al.
Published: (2026)
DecomposeRL: Learning to Ask Useful, Informative, and Diverse Questions for Semi-Supervised, Traceable Claim Verification
by: Dipta, Shubhashis Roy, et al.
Published: (2026)
by: Dipta, Shubhashis Roy, et al.
Published: (2026)
WellDunn: On the Robustness and Explainability of Language Models and Large Language Models in Identifying Wellness Dimensions
by: Mohammadi, Seyedali, et al.
Published: (2024)
by: Mohammadi, Seyedali, et al.
Published: (2024)
Learning How to Use Tools, Not Just When: Pattern-Aware Tool-Integrated Reasoning
by: Xu, Ningning, et al.
Published: (2025)
by: Xu, Ningning, et al.
Published: (2025)
The A-R Behavioral Space: Execution-Level Profiling of Tool-Using Language Model Agents in Organizational Deployment
by: Yu, Shasha, et al.
Published: (2026)
by: Yu, Shasha, et al.
Published: (2026)
Context Structure Reshapes the Representational Geometry of Language Models
by: Hosseini, Eghbal A., et al.
Published: (2026)
by: Hosseini, Eghbal A., et al.
Published: (2026)
Structured Object Language Modeling (SoLM): Native Structured Objects Generation Conforming to Complex Schemas with Self-Supervised Denoising
by: Tavanaei, Amir, et al.
Published: (2024)
by: Tavanaei, Amir, et al.
Published: (2024)
Nemotron-Research-Tool-N1: Exploring Tool-Using Language Models with Reinforced Reasoning
by: Zhang, Shaokun, et al.
Published: (2025)
by: Zhang, Shaokun, et al.
Published: (2025)
Investigation of Factorized Optical Flows as Mid-Level Representations
by: Yang, Hsuan-Kung, et al.
Published: (2022)
by: Yang, Hsuan-Kung, et al.
Published: (2022)
AgentProcessBench: Diagnosing Step-Level Process Quality in Tool-Using Agents
by: Fan, Shengda, et al.
Published: (2026)
by: Fan, Shengda, et al.
Published: (2026)
Class-Level Code Generation from Natural Language Using Iterative, Tool-Enhanced Reasoning over Repository
by: Deshpande, Ajinkya, et al.
Published: (2024)
by: Deshpande, Ajinkya, et al.
Published: (2024)
Evaluating and Enhancing Large Language Models for Conversational Reasoning on Knowledge Graphs
by: Huang, Yuxuan
Published: (2023)
by: Huang, Yuxuan
Published: (2023)
Classifying German Language Proficiency Levels Using Large Language Models
by: Ahlers, Elias-Leander, et al.
Published: (2025)
by: Ahlers, Elias-Leander, et al.
Published: (2025)
AuditLLM: A Tool for Auditing Large Language Models Using Multiprobe Approach
by: Amirizaniani, Maryam, et al.
Published: (2024)
by: Amirizaniani, Maryam, et al.
Published: (2024)
SDCD: Structure-Disrupted Contrastive Decoding for Mitigating Hallucinations in Large Vision-Language Models
by: Xia, Yuxuan, et al.
Published: (2026)
by: Xia, Yuxuan, et al.
Published: (2026)
Advanced Weakly-Supervised Formula Exploration for Neuro-Symbolic Mathematical Reasoning
by: Wu, Yuxuan, et al.
Published: (2025)
by: Wu, Yuxuan, et al.
Published: (2025)
AutoTool: Efficient Tool Selection for Large Language Model Agents
by: Jia, Jingyi, et al.
Published: (2025)
by: Jia, Jingyi, et al.
Published: (2025)
Orca: Enhancing Role-Playing Abilities of Large Language Models by Integrating Personality Traits
by: Huang, Yuxuan
Published: (2024)
by: Huang, Yuxuan
Published: (2024)
Spontaneous Giving and Calculated Greed in Language Models
by: Li, Yuxuan, et al.
Published: (2025)
by: Li, Yuxuan, et al.
Published: (2025)
Mid-Training with Self-Generated Data Improves Reinforcement Learning in Language Models
by: RRV, Aswin, et al.
Published: (2026)
by: RRV, Aswin, et al.
Published: (2026)
ToolPRM: Fine-Grained Inference Scaling of Structured Outputs for Function Calling
by: Lin, Jianghao, et al.
Published: (2025)
by: Lin, Jianghao, et al.
Published: (2025)
RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning
by: Ye, Junjie, et al.
Published: (2024)
by: Ye, Junjie, et al.
Published: (2024)
SENTINEL: A Fully End-to-End Language-Action Model for Humanoid Whole Body Control
by: Wang, Yuxuan, et al.
Published: (2025)
by: Wang, Yuxuan, et al.
Published: (2025)
LASER: Language Model Regression for Semi-Structured Workflow Resource and Runtime Estimation
by: Yin, Yuxuan, et al.
Published: (2025)
by: Yin, Yuxuan, et al.
Published: (2025)
Assessing The Potential Of Mid-Sized Language Models For Clinical QA
by: Bolton, Elliot, et al.
Published: (2024)
by: Bolton, Elliot, et al.
Published: (2024)
PRISM: A Transformer-based Language Model of Structured Clinical Event Data
by: Levine, Lionel, et al.
Published: (2025)
by: Levine, Lionel, et al.
Published: (2025)
VIVID-Med: LLM-Supervised Structured Pretraining for Deployable Medical ViTs
by: Wang, Xiyao, et al.
Published: (2026)
by: Wang, Xiyao, et al.
Published: (2026)
CalliRewrite: Recovering Handwriting Behaviors from Calligraphy Images without Supervision
by: Luo, Yuxuan, et al.
Published: (2024)
by: Luo, Yuxuan, et al.
Published: (2024)
AGENT: An Aerial Vehicle Generation and Design Tool Using Large Language Models
by: Samplawski, Colin, et al.
Published: (2025)
by: Samplawski, Colin, et al.
Published: (2025)
LLAssist: Simple Tools for Automating Literature Review Using Large Language Models
by: Haryanto, Christoforus Yoga
Published: (2024)
by: Haryanto, Christoforus Yoga
Published: (2024)
Cross-Language Bias Examination in Large Language Models
by: Liang, Yuxuan, et al.
Published: (2025)
by: Liang, Yuxuan, et al.
Published: (2025)
RLRC: Reinforcement Learning-based Recovery for Compressed Vision-Language-Action Models
by: Chen, Yuxuan, et al.
Published: (2025)
by: Chen, Yuxuan, et al.
Published: (2025)
DeepTool: Scaling Interleaved Deliberation in Tool-Integrated Reasoning via Process-Supervised Reinforcement Learning
by: He, Yang, et al.
Published: (2026)
by: He, Yang, et al.
Published: (2026)
FRIDA to the Rescue! Analyzing Synthetic Data Effectiveness in Object-Based Common Sense Reasoning for Disaster Response
by: Shichman, Mollie, et al.
Published: (2025)
by: Shichman, Mollie, et al.
Published: (2025)
Are Large Language Models Useful for Time Series Data Analysis?
by: Tang, Francis, et al.
Published: (2024)
by: Tang, Francis, et al.
Published: (2024)
ToolNet: Connecting Large Language Models with Massive Tools via Tool Graph
by: Liu, Xukun, et al.
Published: (2024)
by: Liu, Xukun, et al.
Published: (2024)
Thoth: Mid-Training Bridges LLMs to Time Series Understanding
by: Lin, Jiafeng, et al.
Published: (2026)
by: Lin, Jiafeng, et al.
Published: (2026)
SCRIBE: Structured Chain Reasoning for Interactive Behaviour Explanations using Tool Calling
by: Fawzi, Fares, et al.
Published: (2025)
by: Fawzi, Fares, et al.
Published: (2025)
Beyond Math: Stories as a Testbed for Memorization-Constrained Reasoning in LLMs
by: Jiang, Yuxuan, et al.
Published: (2024)
by: Jiang, Yuxuan, et al.
Published: (2024)
Similar Items
-
Bridging Reasoning Trajectories in On-Policy Distillation via Near-Future Guidance
by: Jiang, Yuxuan, et al.
Published: (2026) -
Experiments or Outcomes? Probing Scientific Feasibility in Large Language Models
by: Mohammadi, Seyedali, et al.
Published: (2026) -
DecomposeRL: Learning to Ask Useful, Informative, and Diverse Questions for Semi-Supervised, Traceable Claim Verification
by: Dipta, Shubhashis Roy, et al.
Published: (2026) -
WellDunn: On the Robustness and Explainability of Language Models and Large Language Models in Identifying Wellness Dimensions
by: Mohammadi, Seyedali, et al.
Published: (2024) -
Learning How to Use Tools, Not Just When: Pattern-Aware Tool-Integrated Reasoning
by: Xu, Ningning, et al.
Published: (2025)