:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhou, Zhi, Miao, Sirui, Duan, Xiangyu, Yang, Hao, Zhang, Min
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2507.19741
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Learning While Staying Curious: Entropy-Preserving Supervised Fine-Tuning via Adaptive Self-Distillation for Large Reasoning Models
by: Wang, Hao, et al.
Published: (2026)

Quantification of Large Language Model Distillation
by: Lee, Sunbowen, et al.
Published: (2025)

Automated Item Neutralization for Non-Cognitive Scales: A Large Language Model Approach to Reducing Social-Desirability Bias
by: Wu, Sirui, et al.
Published: (2025)

BoRP: Bootstrapped Regression Probing for Scalable and Human-Aligned LLM Evaluation
by: Sun, Peng, et al.
Published: (2026)

LLMs Know What They Need: Leveraging a Missing Information Guided Framework to Empower Retrieval-Augmented Generation
by: Wang, Keheng, et al.
Published: (2024)

FRAME: Boosting LLMs with A Four-Quadrant Multi-Stage Pretraining Strategy
by: Zhang, Xuemiao, et al.
Published: (2025)

Capturing Nuanced Preferences: Preference-Aligned Distillation for Small Language Models
by: Gu, Yanggan, et al.
Published: (2025)

Improving Latent Reasoning in LLMs via Soft Concept Mixing
by: Wang, Kang, et al.
Published: (2025)

AI4Reading: Chinese Audiobook Interpretation System Based on Multi-Agent Collaboration
by: Huang, Minjiang, et al.
Published: (2025)

MiniPLM: Knowledge Distillation for Pre-Training Language Models
by: Gu, Yuxian, et al.
Published: (2024)

FIRE: Flexible Integration of Data Quality Ratings for Effective Pre-Training
by: Xu, Liangyu, et al.
Published: (2025)

Enhancing LLMs via High-Knowledge Data Selection
by: Duan, Feiyu, et al.
Published: (2025)

Preference Curriculum: LLMs Should Always Be Pretrained on Their Preferred Data
by: Zhang, Xuemiao, et al.
Published: (2025)

RuPLaR : Efficient Latent Compression of LLM Reasoning Chains with Rule-Based Priors From Multi-Step to One-Step
by: Luo, Xiaocheng, et al.
Published: (2026)

CKD-EHR:Clinical Knowledge Distillation for Electronic Health Records
by: Wang, Junke, et al.
Published: (2025)

In-Context Learning Distillation for Efficient Few-Shot Fine-Tuning
by: Duan, Yifei, et al.
Published: (2024)

Self-Distilled RLVR
by: Yang, Chenxu, et al.
Published: (2026)

RM-Distiller: Exploiting Generative LLM for Reward Model Distillation
by: Zhou, Hongli, et al.
Published: (2026)

Where Did This Sentence Come From? Tracing Provenance in LLM Reasoning Distillation
by: Liu, Kaiyuan, et al.
Published: (2025)

Multilingual Non-Autoregressive Machine Translation without Knowledge Distillation
by: Huang, Chenyang, et al.
Published: (2025)

Not Just the Destination, But the Journey: Reasoning Traces Causally Shape Generalization Behaviors
by: Wen, Pengcheng, et al.
Published: (2026)

Put Teacher in Student's Shoes: Cross-Distillation for Ultra-compact Model Compression Framework
by: Wang, Maolin, et al.
Published: (2025)

Distilling Rule-based Knowledge into Large Language Models
by: Yang, Wenkai, et al.
Published: (2023)

AdaSwitch: Balancing Exploration and Guidance in Knowledge Distillation via Adaptive Switching
by: Peng, Jingyu, et al.
Published: (2025)

First Place Solution of 2023 Global Artificial Intelligence Technology Innovation Competition Track 1
by: Wu, Xiangyu, et al.
Published: (2024)

COLA-GEC: A Bidirectional Framework for Enhancing Grammatical Acceptability and Error Correction
by: Yang, Xiangyu, et al.
Published: (2025)

EGAD: Entropy-Guided Adaptive Distillation for Token-Level Knowledge Transfer
by: Zhang, Hao, et al.
Published: (2026)

ThinkPatterns-21k: A Systematic Study on the Impact of Thinking Patterns in LLMs
by: Wen, Pengcheng, et al.
Published: (2025)

Arrows of Math Reasoning Data Synthesis for Large Language Models: Diversity, Complexity and Correctness
by: Chen, Sirui, et al.
Published: (2025)

Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding
by: Fan, Yue, et al.
Published: (2024)

InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion
by: Wang, Yuanyi, et al.
Published: (2025)

Bridging Relevance and Reasoning: Rationale Distillation in Retrieval-Augmented Generation
by: Jia, Pengyue, et al.
Published: (2024)

Unveil: Unified Visual-Textual Integration and Distillation for Multi-modal Document Retrieval
by: Sun, Hao, et al.
Published: (2026)

Exploring and Enhancing the Transfer of Distribution in Knowledge Distillation for Autoregressive Language Models
by: Rao, Jun, et al.
Published: (2024)

Near-Policy: Accelerating On-Policy Distillation via Asynchronous Generation and Selective Packing
by: Rang, Miao, et al.
Published: (2026)

Redundancy Principles for MLLMs Benchmarks
by: Zhang, Zicheng, et al.
Published: (2025)

Slow Tuning and Low-Entropy Masking for Safe Chain-of-Thought Distillation
by: Ma, Ziyang, et al.
Published: (2025)

Growing Through Experience: Scaling Episodic Grounding in Language Models
by: Zhang, Chunhui, et al.
Published: (2025)

FinAnchor: Aligned Multi-Model Representations for Financial Prediction
by: He, Zirui, et al.
Published: (2026)

Keypoint-based Progressive Chain-of-Thought Distillation for LLMs
by: Feng, Kaituo, et al.
Published: (2024)