Guardado en:
Detalles Bibliográficos
Autores principales: Zhao, Zeyang, Xue, Qilong, He, Yuhang, Bai, Yifan, Wei, Xing, Gong, Yihong
Formato: Preprint
Publicado: 2024
Materias:
Acceso en línea:https://arxiv.org/abs/2407.08489
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
_version_ 1866914866504138752
author Zhao, Zeyang
Xue, Qilong
He, Yuhang
Bai, Yifan
Wei, Xing
Gong, Yihong
author_facet Zhao, Zeyang
Xue, Qilong
He, Yuhang
Bai, Yifan
Wei, Xing
Gong, Yihong
contents This paper introduces the point-axis representation for oriented object detection, emphasizing its flexibility and geometrically intuitive nature with two key components: points and axes. 1) Points delineate the spatial extent and contours of objects, providing detailed shape descriptions. 2) Axes define the primary directionalities of objects, providing essential orientation cues crucial for precise detection. The point-axis representation decouples location and rotation, addressing the loss discontinuity issues commonly encountered in traditional bounding box-based approaches. For effective optimization without introducing additional annotations, we propose the max-projection loss to supervise point set learning and the cross-axis loss for robust axis representation learning. Further, leveraging this representation, we present the Oriented DETR model, seamlessly integrating the DETR framework for precise point-axis prediction and end-to-end detection. Experimental results demonstrate significant performance improvements in oriented object detection tasks.
format Preprint
id arxiv_https___arxiv_org_abs_2407_08489
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Projecting Points to Axes: Oriented Object Detection via Point-Axis Representation
Zhao, Zeyang
Xue, Qilong
He, Yuhang
Bai, Yifan
Wei, Xing
Gong, Yihong
Computer Vision and Pattern Recognition
This paper introduces the point-axis representation for oriented object detection, emphasizing its flexibility and geometrically intuitive nature with two key components: points and axes. 1) Points delineate the spatial extent and contours of objects, providing detailed shape descriptions. 2) Axes define the primary directionalities of objects, providing essential orientation cues crucial for precise detection. The point-axis representation decouples location and rotation, addressing the loss discontinuity issues commonly encountered in traditional bounding box-based approaches. For effective optimization without introducing additional annotations, we propose the max-projection loss to supervise point set learning and the cross-axis loss for robust axis representation learning. Further, leveraging this representation, we present the Oriented DETR model, seamlessly integrating the DETR framework for precise point-axis prediction and end-to-end detection. Experimental results demonstrate significant performance improvements in oriented object detection tasks.
title Projecting Points to Axes: Oriented Object Detection via Point-Axis Representation
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2407.08489