:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Piao, Shengmin, Park, Sanghyun
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2412.08024
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

SpiralThinker: Latent Reasoning through an Iterative Process with Text-Latent Interleaving
by: Piao, Shengmin, et al.
Published: (2025)

GeneralThinker: Domain-General Reasoning through Likelihood-Guided Answer-Conditioned Optimization
by: Piao, Shengmin, et al.
Published: (2026)

LitE-SQL: A Lightweight and Efficient Text-to-SQL Framework with Vector-based Schema Linking and Execution-Guided Self-Correction
by: Piao, Shengmin, et al.
Published: (2025)

C2F-Thinker: Coarse-to-Fine Reasoning with Hint-Guided Reinforcement Learning for Multimodal Sentiment Analysis
by: Luo, Miaosen, et al.
Published: (2026)

Enhancing Long-Chain Reasoning Distillation through Error-Aware Self-Reflection
by: Wu, Zhuoyang, et al.
Published: (2025)

Learning to Retrieve and Reason on Knowledge Graph through Active Self-Reflection
by: Zhang, Han, et al.
Published: (2025)

Self-Knowledge Distillation for Learning Ambiguity
by: Park, Hancheol, et al.
Published: (2024)

SituatedThinker: Grounding LLM Reasoning with Real-World through Situated Thinking
by: Liu, Junnan, et al.
Published: (2025)

Learning While Staying Curious: Entropy-Preserving Supervised Fine-Tuning via Adaptive Self-Distillation for Large Reasoning Models
by: Wang, Hao, et al.
Published: (2026)

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B
by: Xu, Sen, et al.
Published: (2025)

ProxyThinker: Test-Time Guidance through Small Visual Reasoners
by: Xiao, Zilin, et al.
Published: (2025)

KAG-Thinker: Interactive Thinking and Deep Reasoning in LLMs via Knowledge-Augmented Generation
by: Zhang, Dalong, et al.
Published: (2025)

ReasoningRank: Teaching Student Models to Rank through Reasoning-Based Knowledge Distillation
by: Ji, Yuelyu, et al.
Published: (2024)

Taming the Thinker: Conditional Entropy Shaping for Adaptive LLM Reasoning
by: Wei, Shuyu, et al.
Published: (2026)

ROSD: Reflective On-Policy Self-Distillation for Language Model Reasoning across Domains
by: Zhao, Ziqi, et al.
Published: (2026)

TypedThinker: Diversify Large Language Model Reasoning with Typed Thinking
by: Wang, Danqing, et al.
Published: (2024)

MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning
by: Chen, Justin Chih-Yao, et al.
Published: (2024)

Propulsion: Steering LLM with Tiny Fine-Tuning
by: Kowsher, Md, et al.
Published: (2024)

Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection
by: Lee, Kyungjae, et al.
Published: (2024)

Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers
by: Green, Tommaso, et al.
Published: (2025)

The Markovian Thinker: Architecture-Agnostic Linear Scaling of Reasoning
by: Aghajohari, Milad, et al.
Published: (2025)

Self-Reflective Planning with Knowledge Graphs: Enhancing LLM Reasoning Reliability for Question Answering
by: Zhu, Jiajun, et al.
Published: (2025)

Coarse-to-Fine Highlighting: Reducing Knowledge Hallucination in Large Language Models
by: Lv, Qitan, et al.
Published: (2024)

Mirror: A Multiple-perspective Self-Reflection Method for Knowledge-rich Reasoning
by: Yan, Hanqi, et al.
Published: (2024)

LLM-Guided Knowledge Distillation for Temporal Knowledge Graph Reasoning
by: Xing, Wang, et al.
Published: (2026)

Doc-V*:Coarse-to-Fine Interactive Visual Reasoning for Multi-Page Document VQA
by: Zheng, Yuanlei, et al.
Published: (2026)

Thinking with Many Minds: Using Large Language Models for Multi-Perspective Problem-Solving
by: Park, Sanghyun, et al.
Published: (2025)

Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models
by: Zhao, Siyan, et al.
Published: (2026)

CFMS: A Coarse-to-Fine Multimodal Synthesis Framework for Enhanced Tabular Reasoning
by: Huang, Qixian, et al.
Published: (2026)

Knowledge Distillation for Temporal Knowledge Graph Reasoning with Large Language Models
by: Xing, Wang, et al.
Published: (2026)

LightThinker++: From Reasoning Compression to Memory Management
by: Zhu, Yuqi, et al.
Published: (2026)

MetaMem: Evolving Meta-Memory for Knowledge Utilization through Self-Reflective Symbolic Optimization
by: Xin, Haidong, et al.
Published: (2026)

Crosslingual On-Policy Self-Distillation for Multilingual Reasoning
by: Liu, Yihong, et al.
Published: (2026)

SmartThinker: Learning to Compress and Preserve Reasoning by Step-Level Length Control
by: He, Xingyang, et al.
Published: (2025)

Internalize the Temperature: On-Policy Self-Distillation as Policy Reheater for Reinforcement Learning
by: Yang, Xuewei, et al.
Published: (2026)

SelfReflect: Can LLMs Communicate Their Internal Answer Distribution?
by: Kirchhof, Michael, et al.
Published: (2025)

Does This Look Familiar to You? Knowledge Analysis via Model Internal Representations
by: Park, Sihyun
Published: (2025)

$\textit{SKIntern}$: Internalizing Symbolic Knowledge for Distilling Better CoT Capabilities into Small Language Models
by: Liao, Huanxuan, et al.
Published: (2024)

WebThinker: Empowering Large Reasoning Models with Deep Research Capability
by: Li, Xiaoxi, et al.
Published: (2025)

Internalizing Tool Knowledge in Small Language Models via QLoRA Fine-Tuning
by: Shemla, Yuval, et al.
Published: (2026)