:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yu, Tong, Cheng, Lei, Khalitov, Ruslan, Olsson, Erland Brandser, Yang, Zhirong
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2405.08538
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Predicting the Order of Upcoming Tokens Improves Language Modeling
by: Zuhri, Zayd M. K., et al.
Published: (2025)

Annealing Self-Distillation Rectification Improves Adversarial Training
by: Wu, Yu-Yu, et al.
Published: (2023)

Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference
by: Eysenbach, Benjamin, et al.
Published: (2024)

From Generic Correlation to Input-Specific Credit in On-Policy Self Distillation
by: Shen, Guobin, et al.
Published: (2026)

Softpick: No Attention Sink, No Massive Activations with Rectified Softmax
by: Zuhri, Zayd M. K., et al.
Published: (2025)

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information
by: Shen, Guobin, et al.
Published: (2026)

DNA Sequence Classification with Compressors
by: Ozan, Şükrü
Published: (2024)

Privacy Risks in Time Series Forecasting: User- and Record-Level Membership Inference
by: Johansson, Nicolas, et al.
Published: (2025)

Improving Generative Adversarial Networks with Self-Distillation
by: Nowinowski, Antoni, et al.
Published: (2026)

Deep Semantic Inference over the Air: An Efficient Task-Oriented Communication System
by: Wang, Chenyang, et al.
Published: (2025)

State Diversity Matters in Offline Behavior Distillation
by: Lei, Shiye, et al.
Published: (2025)

Protein Language Model Embeddings Improve Generalization of Implicit Transfer Operators
by: Antoniadis, Panagiotis, et al.
Published: (2026)

Self-Distilled RLVR
by: Yang, Chenxu, et al.
Published: (2026)

Can Large Reasoning Models Self-Train?
by: Shafayat, Sheikh, et al.
Published: (2025)

Improving Constrained Language Generation via Self-Distilled Twisted Sequential Monte Carlo
by: Kim, Sooyeon, et al.
Published: (2025)

In Search of Lost DNA Sequence Pretraining
by: Tang, Zhijiang, et al.
Published: (2026)

Quantization-Aware Distillation for NVFP4 Inference Accuracy Recovery
by: Xin, Meng, et al.
Published: (2026)

HAD: Hybrid Architecture Distillation Outperforms Teacher in Genomic Sequence Modeling
by: Yang, Hexiong, et al.
Published: (2025)

Internalize the Temperature: On-Policy Self-Distillation as Policy Reheater for Reinforcement Learning
by: Yang, Xuewei, et al.
Published: (2026)

UrbanAI 2025 Challenge: Linear vs Transformer Models for Long-Horizon Exogenous Temperature Forecasting
by: Gokhman, Ruslan
Published: (2025)

Model Decides How to Tokenize: Adaptive DNA Sequence Tokenization with MxDNA
by: Qiao, Lifeng, et al.
Published: (2024)

Online Variational Sequential Monte Carlo
by: Mastrototaro, Alessandro, et al.
Published: (2023)

Accelerating Diffusion Planners in Offline RL via Reward-Aware Consistency Trajectory Distillation
by: Duan, Xintong, et al.
Published: (2025)

Language Models for Controllable DNA Sequence Design
by: Su, Xingyu, et al.
Published: (2025)

Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models
by: Zhao, Siyan, et al.
Published: (2026)

Dirichlet Flow Matching with Applications to DNA Sequence Design
by: Stark, Hannes, et al.
Published: (2024)

Learning Critically: Selective Self Distillation in Federated Learning on Non-IID Data
by: He, Yuting, et al.
Published: (2025)

Reducing the Safety Tax in LLM Safety Alignment with On-Policy Self-Distillation
by: Fu, Yu, et al.
Published: (2026)

RosettaSearch: Multi-Objective Inference-Time Search for Protein Sequence Design
by: Kshirsagar, Meghana, et al.
Published: (2026)

Score $\times$ Decoder: A Unified View of Unsupervised Inference-Time Scaling for Hallucination Mitigation
by: Cheng, Yun-Chen, et al.
Published: (2026)

Calibrate to Discriminate: Improve In-Context Learning with Label-Free Comparative Inference
by: Cheng, Wei, et al.
Published: (2024)

Puzzle: Distillation-Based NAS for Inference-Optimized LLMs
by: Bercovich, Akhiad, et al.
Published: (2024)

HollowFlow: Efficient Sample Likelihood Evaluation using Hollow Message Passing
by: Gloy, Johann Flemming, et al.
Published: (2025)

Membership Inference Attacks on Sequence Models
by: Rossi, Lorenzo, et al.
Published: (2025)

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training
by: Shen, Guobin, et al.
Published: (2026)

DiReDi: Distillation and Reverse Distillation for AIoT Applications
by: Sun, Chen, et al.
Published: (2024)

JointDistill: Adaptive Multi-Task Distillation for Joint Depth Estimation and Scene Segmentation
by: Cheng, Tiancong, et al.
Published: (2025)

TIDE: Temporal Incremental Draft Engine for Self-Improving LLM Inference
by: Park, Jiyoung, et al.
Published: (2026)

Controlling dynamics of stochastic systems with deep reinforcement learning
by: Mukhamadiarov, Ruslan
Published: (2025)

Adversarial Dual On-Policy Distillation from Expressive Teacher
by: Wan, Zhenglin, et al.
Published: (2026)