Enregistré dans:
Détails bibliographiques
Auteurs principaux: Zhao, Xinxin, Jiang, Jian, Tian, Yan, Wu, Liqin, Xu, Zhaocheng, Yang, Teddy, Zou, Yunuo, Wang, Xun
Format: Preprint
Publié: 2026
Sujets:
Accès en ligne:https://arxiv.org/abs/2602.21712
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
_version_ 1866917294023639040
author Zhao, Xinxin
Jiang, Jian
Tian, Yan
Wu, Liqin
Xu, Zhaocheng
Yang, Teddy
Zou, Yunuo
Wang, Xun
author_facet Zhao, Xinxin
Jiang, Jian
Tian, Yan
Wu, Liqin
Xu, Zhaocheng
Yang, Teddy
Zou, Yunuo
Wang, Xun
contents Tooth image segmentation is a cornerstone of dental digitization. However, traditional image encoders relying on fixed-resolution feature maps often lead to discontinuous segmentation and poor discrimination between target regions and background, due to insufficient modeling of environmental and global context. Moreover, transformer-based self-attention introduces substantial computational overhead because of its quadratic complexity (O(n^2)), making it inefficient for high-resolution dental images. To address these challenges, we introduce a three-stage encoder with hierarchical feature representation to capture scale-adaptive information in dental images. By jointly leveraging low-level details and high-level semantics through cross-scale feature fusion, the model effectively preserves fine structural information while maintaining strong contextual awareness. Furthermore, a bidirectional sequence modeling strategy is incorporated to enhance global spatial context understanding without incurring high computational cost. We validate our method on two dental datasets, with experimental results demonstrating its superiority over existing approaches. On the OralVision dataset, our model achieves a 1.1% improvement in mean intersection over union (mIoU).
format Preprint
id arxiv_https___arxiv_org_abs_2602_21712
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Innovative Tooth Segmentation Using Hierarchical Features and Bidirectional Sequence Modeling
Zhao, Xinxin
Jiang, Jian
Tian, Yan
Wu, Liqin
Xu, Zhaocheng
Yang, Teddy
Zou, Yunuo
Wang, Xun
Computer Vision and Pattern Recognition
Tooth image segmentation is a cornerstone of dental digitization. However, traditional image encoders relying on fixed-resolution feature maps often lead to discontinuous segmentation and poor discrimination between target regions and background, due to insufficient modeling of environmental and global context. Moreover, transformer-based self-attention introduces substantial computational overhead because of its quadratic complexity (O(n^2)), making it inefficient for high-resolution dental images. To address these challenges, we introduce a three-stage encoder with hierarchical feature representation to capture scale-adaptive information in dental images. By jointly leveraging low-level details and high-level semantics through cross-scale feature fusion, the model effectively preserves fine structural information while maintaining strong contextual awareness. Furthermore, a bidirectional sequence modeling strategy is incorporated to enhance global spatial context understanding without incurring high computational cost. We validate our method on two dental datasets, with experimental results demonstrating its superiority over existing approaches. On the OralVision dataset, our model achieves a 1.1% improvement in mean intersection over union (mIoU).
title Innovative Tooth Segmentation Using Hierarchical Features and Bidirectional Sequence Modeling
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2602.21712