:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Feng, Wanyong, Tran, Peter, Sireci, Stephen, Lan, Andrew
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2503.08551
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

From Text to Visuals: Using LLMs to Generate Math Diagrams with Vector Graphics
by: Lee, Jaewook, et al.
Published: (2025)

MCQ Difficulty Prediction via Modeling Learner Heterogeneity Using Data-Driven Cognitive Profiling
by: Krishnan, Dhriti, et al.
Published: (2026)

RADAR: Reasoning-Ability and Difficulty-Aware Routing for Reasoning LLMs
by: Fernandez, Nigel, et al.
Published: (2025)

Rectification Difficulty and Optimal Sample Allocation in LLM-Augmented Surveys
by: Ye, Zikun, et al.
Published: (2026)

Metric assessment protocol in the context of answer fluctuation on MCQ tasks
by: Goliakova, Ekaterina, et al.
Published: (2025)

AutoMCQ -- Automatically Generate Code Comprehension Questions using GenAI
by: Goodfellow, Martin, et al.
Published: (2025)

LLaMa-SciQ: An Educational Chatbot for Answering Science MCQ
by: Allard, Marc-Antoine, et al.
Published: (2024)

HS-STaR: Hierarchical Sampling for Self-Taught Reasoners via Difficulty Estimation and Budget Reallocation
by: Xiong, Feng, et al.
Published: (2025)

ABench-Physics: Benchmarking Physical Reasoning in LLMs via High-Difficulty and Dynamic Physics Problems
by: Zhang, Yiming, et al.
Published: (2025)

Optimizing Reasoning Efficiency through Prompt Difficulty Prediction
by: Zhao, Bo, et al.
Published: (2025)

Interpretable Difficulty-Aware Knowledge Tracing in Tutor-Student Dialogues
by: Huang, Shuyan, et al.
Published: (2026)

AdaCtrl: Towards Adaptive and Controllable Reasoning via Difficulty-Aware Budgeting
by: Huang, Shijue, et al.
Published: (2025)

DIFFUMA: High-Fidelity Spatio-Temporal Video Prediction via Dual-Path Mamba and Diffusion Enhancement
by: Xie, Xinyu, et al.
Published: (2025)

Can LLMs Estimate Student Struggles? Human-AI Difficulty Alignment with Proficiency Simulation for Item Difficulty Prediction
by: Li, Ming, et al.
Published: (2025)

Can Prompt Difficulty be Online Predicted for Accelerating RL Finetuning of Reasoning Models?
by: Qu, Yun, et al.
Published: (2025)

Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective
by: Kong, Deyang, et al.
Published: (2025)

Zero-shot Graph Reasoning via Retrieval Augmented Framework with LLMs
by: Li, Hanqing, et al.
Published: (2025)

QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation
by: Li, Jiazheng, et al.
Published: (2025)

NTRL: Encounter Generation via Reinforcement Learning for Dynamic Difficulty Adjustment in Dungeons and Dragons
by: Romeo, Carlo, et al.
Published: (2025)

Tailored Teaching with Balanced Difficulty: Elevating Reasoning in Multimodal Chain-of-Thought via Prompt Curriculum
by: Yang, Xinglong, et al.
Published: (2025)

VADE: Variance-Aware Dynamic Sampling via Online Sample-Level Difficulty Estimation for Multimodal RL
by: Hu, Zengjie, et al.
Published: (2025)

Temporal Sampling for Forgotten Reasoning in LLMs
by: Li, Yuetai, et al.
Published: (2025)

Mitigating Overthinking in Large Reasoning Models via Difficulty-aware Reinforcement Learning
by: Wan, Qian, et al.
Published: (2026)

MorphoBench: A Benchmark with Difficulty Adaptive to Model Reasoning
by: Wang, Xukai, et al.
Published: (2025)

Concise Reasoning, Big Gains: Pruning Long Reasoning Trace with Difficulty-Aware Prompting
by: Wu, Yifan, et al.
Published: (2025)

Difficulty Estimation and Simplification of French Text Using LLMs
by: Jamet, Henri, et al.
Published: (2024)

Perception Graph for Cognitive Attack Reasoning in Augmented Reality
by: Chen, Rongqian, et al.
Published: (2025)

Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning
by: Bae, Sanghwan, et al.
Published: (2025)

Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling
by: Bansal, Hritik, et al.
Published: (2024)

DART: Difficulty-Adaptive Reasoning Truncation for Efficient Large Language Models
by: Zhang, Ruofan, et al.
Published: (2025)

KAG-Thinker: Interactive Thinking and Deep Reasoning in LLMs via Knowledge-Augmented Generation
by: Zhang, Dalong, et al.
Published: (2025)

The Illusion of Reasoning: Exposing Evasive Data Contamination in LLMs via Zero-CoT Truncation
by: Lan, Yifan, et al.
Published: (2026)

Make Every Penny Count: Difficulty-Adaptive Self-Consistency for Cost-Efficient Reasoning
by: Wang, Xinglin, et al.
Published: (2024)

RankLLM: Weighted Ranking of LLMs by Quantifying Question Difficulty
by: Zhang, Ziqian, et al.
Published: (2026)

Reinforcing Numerical Reasoning in LLMs for Tabular Prediction via Structural Priors
by: Cai, Pengxiang, et al.
Published: (2025)

DAST: Difficulty-Adaptive Slow-Thinking for Large Reasoning Models
by: Shen, Yi, et al.
Published: (2025)

Beyond MCQ: An Open-Ended Arabic Cultural QA Benchmark with Dialect Variants
by: Bhatti, Hunzalah Hassan, et al.
Published: (2025)

DIVA-GRPO: Enhancing Multimodal Reasoning through Difficulty-Adaptive Variant Advantage
by: Gao, Haowen, et al.
Published: (2026)

Lightweight Dataset Pruning without Full Training via Example Difficulty and Prediction Uncertainty
by: Cho, Yeseul, et al.
Published: (2025)

From Reasoning to Generalization: Knowledge-Augmented LLMs for ARC Benchmark
by: Lei, Chao, et al.
Published: (2025)