Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Tong, Xin, Peng, Shi, Tian, Baojie, Guo, Yufei, Huang, Xuhui, Ma, Zhe
Format: Preprint
Veröffentlicht: 2025
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2502.17766
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866909509519147008
author Tong, Xin
Peng, Shi
Tian, Baojie
Guo, Yufei
Huang, Xuhui
Ma, Zhe
author_facet Tong, Xin
Peng, Shi
Tian, Baojie
Guo, Yufei
Huang, Xuhui
Ma, Zhe
contents Classical Transformer-based line segment detection methods have delivered impressive results. However, we observe that some accurately detected line segments are assigned low confidence scores during prediction, causing them to be ranked lower and potentially suppressed. Additionally, these models often require prolonged training periods to achieve strong performance, largely due to the necessity of bipartite matching. In this paper, we introduce RANK-LETR, a novel Transformer-based line segment detection method. Our approach leverages learnable geometric information to refine the ranking of predicted line segments by enhancing the confidence scores of high-quality predictions in a posterior verification step. We also propose a new line segment proposal method, wherein the feature point nearest to the centroid of the line segment directly predicts the location, significantly improving training efficiency and stability. Moreover, we introduce a line segment ranking loss to stabilize rankings during training, thereby enhancing the generalization capability of the model. Experimental results demonstrate that our method outperforms other Transformer-based and CNN-based approaches in prediction accuracy while requiring fewer training epochs than previous Transformer-based models.
format Preprint
id arxiv_https___arxiv_org_abs_2502_17766
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Improving Transformer Based Line Segment Detection with Matched Predicting and Re-ranking
Tong, Xin
Peng, Shi
Tian, Baojie
Guo, Yufei
Huang, Xuhui
Ma, Zhe
Computer Vision and Pattern Recognition
Classical Transformer-based line segment detection methods have delivered impressive results. However, we observe that some accurately detected line segments are assigned low confidence scores during prediction, causing them to be ranked lower and potentially suppressed. Additionally, these models often require prolonged training periods to achieve strong performance, largely due to the necessity of bipartite matching. In this paper, we introduce RANK-LETR, a novel Transformer-based line segment detection method. Our approach leverages learnable geometric information to refine the ranking of predicted line segments by enhancing the confidence scores of high-quality predictions in a posterior verification step. We also propose a new line segment proposal method, wherein the feature point nearest to the centroid of the line segment directly predicts the location, significantly improving training efficiency and stability. Moreover, we introduce a line segment ranking loss to stabilize rankings during training, thereby enhancing the generalization capability of the model. Experimental results demonstrate that our method outperforms other Transformer-based and CNN-based approaches in prediction accuracy while requiring fewer training epochs than previous Transformer-based models.
title Improving Transformer Based Line Segment Detection with Matched Predicting and Re-ranking
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2502.17766