Saved in:
| Main Authors: | Piao, Shengmin, Park, Sanghyun |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.08024 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SpiralThinker: Latent Reasoning through an Iterative Process with Text-Latent Interleaving
by: Piao, Shengmin, et al.
Published: (2025)
by: Piao, Shengmin, et al.
Published: (2025)
GeneralThinker: Domain-General Reasoning through Likelihood-Guided Answer-Conditioned Optimization
by: Piao, Shengmin, et al.
Published: (2026)
by: Piao, Shengmin, et al.
Published: (2026)
LitE-SQL: A Lightweight and Efficient Text-to-SQL Framework with Vector-based Schema Linking and Execution-Guided Self-Correction
by: Piao, Shengmin, et al.
Published: (2025)
by: Piao, Shengmin, et al.
Published: (2025)
C2F-Thinker: Coarse-to-Fine Reasoning with Hint-Guided Reinforcement Learning for Multimodal Sentiment Analysis
by: Luo, Miaosen, et al.
Published: (2026)
by: Luo, Miaosen, et al.
Published: (2026)
Enhancing Long-Chain Reasoning Distillation through Error-Aware Self-Reflection
by: Wu, Zhuoyang, et al.
Published: (2025)
by: Wu, Zhuoyang, et al.
Published: (2025)
Learning to Retrieve and Reason on Knowledge Graph through Active Self-Reflection
by: Zhang, Han, et al.
Published: (2025)
by: Zhang, Han, et al.
Published: (2025)
Self-Knowledge Distillation for Learning Ambiguity
by: Park, Hancheol, et al.
Published: (2024)
by: Park, Hancheol, et al.
Published: (2024)
SituatedThinker: Grounding LLM Reasoning with Real-World through Situated Thinking
by: Liu, Junnan, et al.
Published: (2025)
by: Liu, Junnan, et al.
Published: (2025)
Learning While Staying Curious: Entropy-Preserving Supervised Fine-Tuning via Adaptive Self-Distillation for Large Reasoning Models
by: Wang, Hao, et al.
Published: (2026)
by: Wang, Hao, et al.
Published: (2026)
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B
by: Xu, Sen, et al.
Published: (2025)
by: Xu, Sen, et al.
Published: (2025)
ProxyThinker: Test-Time Guidance through Small Visual Reasoners
by: Xiao, Zilin, et al.
Published: (2025)
by: Xiao, Zilin, et al.
Published: (2025)
KAG-Thinker: Interactive Thinking and Deep Reasoning in LLMs via Knowledge-Augmented Generation
by: Zhang, Dalong, et al.
Published: (2025)
by: Zhang, Dalong, et al.
Published: (2025)
ReasoningRank: Teaching Student Models to Rank through Reasoning-Based Knowledge Distillation
by: Ji, Yuelyu, et al.
Published: (2024)
by: Ji, Yuelyu, et al.
Published: (2024)
Taming the Thinker: Conditional Entropy Shaping for Adaptive LLM Reasoning
by: Wei, Shuyu, et al.
Published: (2026)
by: Wei, Shuyu, et al.
Published: (2026)
ROSD: Reflective On-Policy Self-Distillation for Language Model Reasoning across Domains
by: Zhao, Ziqi, et al.
Published: (2026)
by: Zhao, Ziqi, et al.
Published: (2026)
TypedThinker: Diversify Large Language Model Reasoning with Typed Thinking
by: Wang, Danqing, et al.
Published: (2024)
by: Wang, Danqing, et al.
Published: (2024)
MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning
by: Chen, Justin Chih-Yao, et al.
Published: (2024)
by: Chen, Justin Chih-Yao, et al.
Published: (2024)
Propulsion: Steering LLM with Tiny Fine-Tuning
by: Kowsher, Md, et al.
Published: (2024)
by: Kowsher, Md, et al.
Published: (2024)
Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection
by: Lee, Kyungjae, et al.
Published: (2024)
by: Lee, Kyungjae, et al.
Published: (2024)
Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers
by: Green, Tommaso, et al.
Published: (2025)
by: Green, Tommaso, et al.
Published: (2025)
The Markovian Thinker: Architecture-Agnostic Linear Scaling of Reasoning
by: Aghajohari, Milad, et al.
Published: (2025)
by: Aghajohari, Milad, et al.
Published: (2025)
Self-Reflective Planning with Knowledge Graphs: Enhancing LLM Reasoning Reliability for Question Answering
by: Zhu, Jiajun, et al.
Published: (2025)
by: Zhu, Jiajun, et al.
Published: (2025)
Coarse-to-Fine Highlighting: Reducing Knowledge Hallucination in Large Language Models
by: Lv, Qitan, et al.
Published: (2024)
by: Lv, Qitan, et al.
Published: (2024)
Mirror: A Multiple-perspective Self-Reflection Method for Knowledge-rich Reasoning
by: Yan, Hanqi, et al.
Published: (2024)
by: Yan, Hanqi, et al.
Published: (2024)
LLM-Guided Knowledge Distillation for Temporal Knowledge Graph Reasoning
by: Xing, Wang, et al.
Published: (2026)
by: Xing, Wang, et al.
Published: (2026)
Doc-V*:Coarse-to-Fine Interactive Visual Reasoning for Multi-Page Document VQA
by: Zheng, Yuanlei, et al.
Published: (2026)
by: Zheng, Yuanlei, et al.
Published: (2026)
Thinking with Many Minds: Using Large Language Models for Multi-Perspective Problem-Solving
by: Park, Sanghyun, et al.
Published: (2025)
by: Park, Sanghyun, et al.
Published: (2025)
Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models
by: Zhao, Siyan, et al.
Published: (2026)
by: Zhao, Siyan, et al.
Published: (2026)
CFMS: A Coarse-to-Fine Multimodal Synthesis Framework for Enhanced Tabular Reasoning
by: Huang, Qixian, et al.
Published: (2026)
by: Huang, Qixian, et al.
Published: (2026)
Knowledge Distillation for Temporal Knowledge Graph Reasoning with Large Language Models
by: Xing, Wang, et al.
Published: (2026)
by: Xing, Wang, et al.
Published: (2026)
LightThinker++: From Reasoning Compression to Memory Management
by: Zhu, Yuqi, et al.
Published: (2026)
by: Zhu, Yuqi, et al.
Published: (2026)
MetaMem: Evolving Meta-Memory for Knowledge Utilization through Self-Reflective Symbolic Optimization
by: Xin, Haidong, et al.
Published: (2026)
by: Xin, Haidong, et al.
Published: (2026)
Crosslingual On-Policy Self-Distillation for Multilingual Reasoning
by: Liu, Yihong, et al.
Published: (2026)
by: Liu, Yihong, et al.
Published: (2026)
SmartThinker: Learning to Compress and Preserve Reasoning by Step-Level Length Control
by: He, Xingyang, et al.
Published: (2025)
by: He, Xingyang, et al.
Published: (2025)
Internalize the Temperature: On-Policy Self-Distillation as Policy Reheater for Reinforcement Learning
by: Yang, Xuewei, et al.
Published: (2026)
by: Yang, Xuewei, et al.
Published: (2026)
SelfReflect: Can LLMs Communicate Their Internal Answer Distribution?
by: Kirchhof, Michael, et al.
Published: (2025)
by: Kirchhof, Michael, et al.
Published: (2025)
Does This Look Familiar to You? Knowledge Analysis via Model Internal Representations
by: Park, Sihyun
Published: (2025)
by: Park, Sihyun
Published: (2025)
$\textit{SKIntern}$: Internalizing Symbolic Knowledge for Distilling Better CoT Capabilities into Small Language Models
by: Liao, Huanxuan, et al.
Published: (2024)
by: Liao, Huanxuan, et al.
Published: (2024)
WebThinker: Empowering Large Reasoning Models with Deep Research Capability
by: Li, Xiaoxi, et al.
Published: (2025)
by: Li, Xiaoxi, et al.
Published: (2025)
Internalizing Tool Knowledge in Small Language Models via QLoRA Fine-Tuning
by: Shemla, Yuval, et al.
Published: (2026)
by: Shemla, Yuval, et al.
Published: (2026)
Similar Items
-
SpiralThinker: Latent Reasoning through an Iterative Process with Text-Latent Interleaving
by: Piao, Shengmin, et al.
Published: (2025) -
GeneralThinker: Domain-General Reasoning through Likelihood-Guided Answer-Conditioned Optimization
by: Piao, Shengmin, et al.
Published: (2026) -
LitE-SQL: A Lightweight and Efficient Text-to-SQL Framework with Vector-based Schema Linking and Execution-Guided Self-Correction
by: Piao, Shengmin, et al.
Published: (2025) -
C2F-Thinker: Coarse-to-Fine Reasoning with Hint-Guided Reinforcement Learning for Multimodal Sentiment Analysis
by: Luo, Miaosen, et al.
Published: (2026) -
Enhancing Long-Chain Reasoning Distillation through Error-Aware Self-Reflection
by: Wu, Zhuoyang, et al.
Published: (2025)