Farhat, S., & Chen, D. (2024). On the Surprising Efficacy of Distillation as an Alternative to Pre-Training Small Models.
Chicago Style (17th ed.) CitationFarhat, Sean, and Deming Chen. On the Surprising Efficacy of Distillation as an Alternative to Pre-Training Small Models. 2024.
MLA (9th ed.) CitationFarhat, Sean, and Deming Chen. On the Surprising Efficacy of Distillation as an Alternative to Pre-Training Small Models. 2024.
Warning: These citations may not always be 100% accurate.