Saved in:
Bibliographic Details
Main Authors: Zhou, Binglin, Zhong, Linhao, Chen, Wentao
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2402.13007
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909113057804288
author Zhou, Binglin
Zhong, Linhao
Chen, Wentao
author_facet Zhou, Binglin
Zhong, Linhao
Chen, Wentao
contents Dataset distillation, a pragmatic approach in machine learning, aims to create a smaller synthetic dataset from a larger existing dataset. However, existing distillation methods primarily adopt a model-based paradigm, where the synthetic dataset inherits model-specific biases, limiting its generalizability to alternative models. In response to this constraint, we propose a novel methodology termed "model pool". This approach involves selecting models from a diverse model pool based on a specific probability distribution during the data distillation process. Additionally, we integrate our model pool with the established knowledge distillation approach and apply knowledge distillation to the test process of the distilled dataset. Our experimental results validate the effectiveness of the model pool approach across a range of existing models while testing, demonstrating superior performance compared to existing methodologies.
format Preprint
id arxiv_https___arxiv_org_abs_2402_13007
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Improve Cross-Architecture Generalization on Dataset Distillation
Zhou, Binglin
Zhong, Linhao
Chen, Wentao
Machine Learning
Computer Vision and Pattern Recognition
Dataset distillation, a pragmatic approach in machine learning, aims to create a smaller synthetic dataset from a larger existing dataset. However, existing distillation methods primarily adopt a model-based paradigm, where the synthetic dataset inherits model-specific biases, limiting its generalizability to alternative models. In response to this constraint, we propose a novel methodology termed "model pool". This approach involves selecting models from a diverse model pool based on a specific probability distribution during the data distillation process. Additionally, we integrate our model pool with the established knowledge distillation approach and apply knowledge distillation to the test process of the distilled dataset. Our experimental results validate the effectiveness of the model pool approach across a range of existing models while testing, demonstrating superior performance compared to existing methodologies.
title Improve Cross-Architecture Generalization on Dataset Distillation
topic Machine Learning
Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2402.13007