Affichage MARC: :: Library Catalog

Enregistré dans:

Détails bibliographiques
Auteurs principaux:	Zhao, Xinxin, Jiang, Jian, Tian, Yan, Wu, Liqin, Xu, Zhaocheng, Yang, Teddy, Zou, Yunuo, Wang, Xun
Format:	Preprint
Publié:	2026
Sujets:	Computer Vision and Pattern Recognition
Accès en ligne:	https://arxiv.org/abs/2602.21712
Tags:	Ajouter un tag Pas de tags, Soyez le premier à ajouter un tag!

_version_	1866917294023639040
author	Zhao, Xinxin Jiang, Jian Tian, Yan Wu, Liqin Xu, Zhaocheng Yang, Teddy Zou, Yunuo Wang, Xun
author_facet	Zhao, Xinxin Jiang, Jian Tian, Yan Wu, Liqin Xu, Zhaocheng Yang, Teddy Zou, Yunuo Wang, Xun
contents	Tooth image segmentation is a cornerstone of dental digitization. However, traditional image encoders relying on fixed-resolution feature maps often lead to discontinuous segmentation and poor discrimination between target regions and background, due to insufficient modeling of environmental and global context. Moreover, transformer-based self-attention introduces substantial computational overhead because of its quadratic complexity (O(n^2)), making it inefficient for high-resolution dental images. To address these challenges, we introduce a three-stage encoder with hierarchical feature representation to capture scale-adaptive information in dental images. By jointly leveraging low-level details and high-level semantics through cross-scale feature fusion, the model effectively preserves fine structural information while maintaining strong contextual awareness. Furthermore, a bidirectional sequence modeling strategy is incorporated to enhance global spatial context understanding without incurring high computational cost. We validate our method on two dental datasets, with experimental results demonstrating its superiority over existing approaches. On the OralVision dataset, our model achieves a 1.1% improvement in mean intersection over union (mIoU).
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_21712
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Innovative Tooth Segmentation Using Hierarchical Features and Bidirectional Sequence Modeling Zhao, Xinxin Jiang, Jian Tian, Yan Wu, Liqin Xu, Zhaocheng Yang, Teddy Zou, Yunuo Wang, Xun Computer Vision and Pattern Recognition Tooth image segmentation is a cornerstone of dental digitization. However, traditional image encoders relying on fixed-resolution feature maps often lead to discontinuous segmentation and poor discrimination between target regions and background, due to insufficient modeling of environmental and global context. Moreover, transformer-based self-attention introduces substantial computational overhead because of its quadratic complexity (O(n^2)), making it inefficient for high-resolution dental images. To address these challenges, we introduce a three-stage encoder with hierarchical feature representation to capture scale-adaptive information in dental images. By jointly leveraging low-level details and high-level semantics through cross-scale feature fusion, the model effectively preserves fine structural information while maintaining strong contextual awareness. Furthermore, a bidirectional sequence modeling strategy is incorporated to enhance global spatial context understanding without incurring high computational cost. We validate our method on two dental datasets, with experimental results demonstrating its superiority over existing approaches. On the OralVision dataset, our model achieves a 1.1% improvement in mean intersection over union (mIoU).
title	Innovative Tooth Segmentation Using Hierarchical Features and Bidirectional Sequence Modeling
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2602.21712

Documents similaires