Saved in:
| Main Authors: | Yu, Tong, Cheng, Lei, Khalitov, Ruslan, Olsson, Erland Brandser, Yang, Zhirong |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.08538 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Predicting the Order of Upcoming Tokens Improves Language Modeling
by: Zuhri, Zayd M. K., et al.
Published: (2025)
by: Zuhri, Zayd M. K., et al.
Published: (2025)
Annealing Self-Distillation Rectification Improves Adversarial Training
by: Wu, Yu-Yu, et al.
Published: (2023)
by: Wu, Yu-Yu, et al.
Published: (2023)
Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference
by: Eysenbach, Benjamin, et al.
Published: (2024)
by: Eysenbach, Benjamin, et al.
Published: (2024)
From Generic Correlation to Input-Specific Credit in On-Policy Self Distillation
by: Shen, Guobin, et al.
Published: (2026)
by: Shen, Guobin, et al.
Published: (2026)
Softpick: No Attention Sink, No Massive Activations with Rectified Softmax
by: Zuhri, Zayd M. K., et al.
Published: (2025)
by: Zuhri, Zayd M. K., et al.
Published: (2025)
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information
by: Shen, Guobin, et al.
Published: (2026)
by: Shen, Guobin, et al.
Published: (2026)
DNA Sequence Classification with Compressors
by: Ozan, Şükrü
Published: (2024)
by: Ozan, Şükrü
Published: (2024)
Privacy Risks in Time Series Forecasting: User- and Record-Level Membership Inference
by: Johansson, Nicolas, et al.
Published: (2025)
by: Johansson, Nicolas, et al.
Published: (2025)
Improving Generative Adversarial Networks with Self-Distillation
by: Nowinowski, Antoni, et al.
Published: (2026)
by: Nowinowski, Antoni, et al.
Published: (2026)
Deep Semantic Inference over the Air: An Efficient Task-Oriented Communication System
by: Wang, Chenyang, et al.
Published: (2025)
by: Wang, Chenyang, et al.
Published: (2025)
State Diversity Matters in Offline Behavior Distillation
by: Lei, Shiye, et al.
Published: (2025)
by: Lei, Shiye, et al.
Published: (2025)
Protein Language Model Embeddings Improve Generalization of Implicit Transfer Operators
by: Antoniadis, Panagiotis, et al.
Published: (2026)
by: Antoniadis, Panagiotis, et al.
Published: (2026)
Self-Distilled RLVR
by: Yang, Chenxu, et al.
Published: (2026)
by: Yang, Chenxu, et al.
Published: (2026)
Can Large Reasoning Models Self-Train?
by: Shafayat, Sheikh, et al.
Published: (2025)
by: Shafayat, Sheikh, et al.
Published: (2025)
Improving Constrained Language Generation via Self-Distilled Twisted Sequential Monte Carlo
by: Kim, Sooyeon, et al.
Published: (2025)
by: Kim, Sooyeon, et al.
Published: (2025)
In Search of Lost DNA Sequence Pretraining
by: Tang, Zhijiang, et al.
Published: (2026)
by: Tang, Zhijiang, et al.
Published: (2026)
Quantization-Aware Distillation for NVFP4 Inference Accuracy Recovery
by: Xin, Meng, et al.
Published: (2026)
by: Xin, Meng, et al.
Published: (2026)
HAD: Hybrid Architecture Distillation Outperforms Teacher in Genomic Sequence Modeling
by: Yang, Hexiong, et al.
Published: (2025)
by: Yang, Hexiong, et al.
Published: (2025)
Internalize the Temperature: On-Policy Self-Distillation as Policy Reheater for Reinforcement Learning
by: Yang, Xuewei, et al.
Published: (2026)
by: Yang, Xuewei, et al.
Published: (2026)
UrbanAI 2025 Challenge: Linear vs Transformer Models for Long-Horizon Exogenous Temperature Forecasting
by: Gokhman, Ruslan
Published: (2025)
by: Gokhman, Ruslan
Published: (2025)
Model Decides How to Tokenize: Adaptive DNA Sequence Tokenization with MxDNA
by: Qiao, Lifeng, et al.
Published: (2024)
by: Qiao, Lifeng, et al.
Published: (2024)
Online Variational Sequential Monte Carlo
by: Mastrototaro, Alessandro, et al.
Published: (2023)
by: Mastrototaro, Alessandro, et al.
Published: (2023)
Accelerating Diffusion Planners in Offline RL via Reward-Aware Consistency Trajectory Distillation
by: Duan, Xintong, et al.
Published: (2025)
by: Duan, Xintong, et al.
Published: (2025)
Language Models for Controllable DNA Sequence Design
by: Su, Xingyu, et al.
Published: (2025)
by: Su, Xingyu, et al.
Published: (2025)
Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models
by: Zhao, Siyan, et al.
Published: (2026)
by: Zhao, Siyan, et al.
Published: (2026)
Dirichlet Flow Matching with Applications to DNA Sequence Design
by: Stark, Hannes, et al.
Published: (2024)
by: Stark, Hannes, et al.
Published: (2024)
Learning Critically: Selective Self Distillation in Federated Learning on Non-IID Data
by: He, Yuting, et al.
Published: (2025)
by: He, Yuting, et al.
Published: (2025)
Reducing the Safety Tax in LLM Safety Alignment with On-Policy Self-Distillation
by: Fu, Yu, et al.
Published: (2026)
by: Fu, Yu, et al.
Published: (2026)
RosettaSearch: Multi-Objective Inference-Time Search for Protein Sequence Design
by: Kshirsagar, Meghana, et al.
Published: (2026)
by: Kshirsagar, Meghana, et al.
Published: (2026)
Score $\times$ Decoder: A Unified View of Unsupervised Inference-Time Scaling for Hallucination Mitigation
by: Cheng, Yun-Chen, et al.
Published: (2026)
by: Cheng, Yun-Chen, et al.
Published: (2026)
Calibrate to Discriminate: Improve In-Context Learning with Label-Free Comparative Inference
by: Cheng, Wei, et al.
Published: (2024)
by: Cheng, Wei, et al.
Published: (2024)
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs
by: Bercovich, Akhiad, et al.
Published: (2024)
by: Bercovich, Akhiad, et al.
Published: (2024)
HollowFlow: Efficient Sample Likelihood Evaluation using Hollow Message Passing
by: Gloy, Johann Flemming, et al.
Published: (2025)
by: Gloy, Johann Flemming, et al.
Published: (2025)
Membership Inference Attacks on Sequence Models
by: Rossi, Lorenzo, et al.
Published: (2025)
by: Rossi, Lorenzo, et al.
Published: (2025)
VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training
by: Shen, Guobin, et al.
Published: (2026)
by: Shen, Guobin, et al.
Published: (2026)
DiReDi: Distillation and Reverse Distillation for AIoT Applications
by: Sun, Chen, et al.
Published: (2024)
by: Sun, Chen, et al.
Published: (2024)
JointDistill: Adaptive Multi-Task Distillation for Joint Depth Estimation and Scene Segmentation
by: Cheng, Tiancong, et al.
Published: (2025)
by: Cheng, Tiancong, et al.
Published: (2025)
TIDE: Temporal Incremental Draft Engine for Self-Improving LLM Inference
by: Park, Jiyoung, et al.
Published: (2026)
by: Park, Jiyoung, et al.
Published: (2026)
Controlling dynamics of stochastic systems with deep reinforcement learning
by: Mukhamadiarov, Ruslan
Published: (2025)
by: Mukhamadiarov, Ruslan
Published: (2025)
Adversarial Dual On-Policy Distillation from Expressive Teacher
by: Wan, Zhenglin, et al.
Published: (2026)
by: Wan, Zhenglin, et al.
Published: (2026)
Similar Items
-
Predicting the Order of Upcoming Tokens Improves Language Modeling
by: Zuhri, Zayd M. K., et al.
Published: (2025) -
Annealing Self-Distillation Rectification Improves Adversarial Training
by: Wu, Yu-Yu, et al.
Published: (2023) -
Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference
by: Eysenbach, Benjamin, et al.
Published: (2024) -
From Generic Correlation to Input-Specific Credit in On-Policy Self Distillation
by: Shen, Guobin, et al.
Published: (2026) -
Softpick: No Attention Sink, No Massive Activations with Rectified Softmax
by: Zuhri, Zayd M. K., et al.
Published: (2025)