Enregistré dans:
Détails bibliographiques
Auteurs principaux: Xie, Jiacheng, Zhang, Ziyang, Poudel, Biplab, Guo, Congyu, Yu, Yang, An, Guanghui, Tang, Xiaoting, Zhao, Lening, Xu, Chunhui, Xu, Dong
Format: Preprint
Publié: 2025
Sujets:
Accès en ligne:https://arxiv.org/abs/2508.14932
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
_version_ 1866918128216178688
author Xie, Jiacheng
Zhang, Ziyang
Poudel, Biplab
Guo, Congyu
Yu, Yang
An, Guanghui
Tang, Xiaoting
Zhao, Lening
Xu, Chunhui
Xu, Dong
author_facet Xie, Jiacheng
Zhang, Ziyang
Poudel, Biplab
Guo, Congyu
Yu, Yang
An, Guanghui
Tang, Xiaoting
Zhao, Lening
Xu, Chunhui
Xu, Dong
contents Tongue imaging serves as a valuable diagnostic tool, particularly in Traditional Chinese Medicine (TCM). The quality of tongue surface segmentation significantly affects the accuracy of tongue image classification and subsequent diagnosis in intelligent tongue diagnosis systems. However, existing research on tongue image segmentation faces notable limitations, and there is a lack of robust and user-friendly segmentation tools. This paper proposes a tongue image segmentation model (TOM) based on multi-teacher knowledge distillation. By incorporating a novel diffusion-based data augmentation method, we enhanced the generalization ability of the segmentation model while reducing its parameter size. Notably, after reducing the parameter count by 96.6% compared to the teacher models, the student model still achieves an impressive segmentation performance of 95.22% mIoU. Furthermore, we packaged and deployed the trained model as both an online and offline segmentation tool (available at https://itongue.cn/), allowing TCM practitioners and researchers to use it without any programming experience. We also present a case study on TCM constitution classification using segmented tongue patches. Experimental results demonstrate that training with tongue patches yields higher classification performance and better interpretability than original tongue images. To our knowledge, this is the first open-source and freely available tongue image segmentation tool.
format Preprint
id arxiv_https___arxiv_org_abs_2508_14932
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle TOM: An Open-Source Tongue Segmentation Method with Multi-Teacher Distillation and Task-Specific Data Augmentation
Xie, Jiacheng
Zhang, Ziyang
Poudel, Biplab
Guo, Congyu
Yu, Yang
An, Guanghui
Tang, Xiaoting
Zhao, Lening
Xu, Chunhui
Xu, Dong
Image and Video Processing
Artificial Intelligence
Quantitative Methods
Tongue imaging serves as a valuable diagnostic tool, particularly in Traditional Chinese Medicine (TCM). The quality of tongue surface segmentation significantly affects the accuracy of tongue image classification and subsequent diagnosis in intelligent tongue diagnosis systems. However, existing research on tongue image segmentation faces notable limitations, and there is a lack of robust and user-friendly segmentation tools. This paper proposes a tongue image segmentation model (TOM) based on multi-teacher knowledge distillation. By incorporating a novel diffusion-based data augmentation method, we enhanced the generalization ability of the segmentation model while reducing its parameter size. Notably, after reducing the parameter count by 96.6% compared to the teacher models, the student model still achieves an impressive segmentation performance of 95.22% mIoU. Furthermore, we packaged and deployed the trained model as both an online and offline segmentation tool (available at https://itongue.cn/), allowing TCM practitioners and researchers to use it without any programming experience. We also present a case study on TCM constitution classification using segmented tongue patches. Experimental results demonstrate that training with tongue patches yields higher classification performance and better interpretability than original tongue images. To our knowledge, this is the first open-source and freely available tongue image segmentation tool.
title TOM: An Open-Source Tongue Segmentation Method with Multi-Teacher Distillation and Task-Specific Data Augmentation
topic Image and Video Processing
Artificial Intelligence
Quantitative Methods
url https://arxiv.org/abs/2508.14932