Saved in:
Bibliographic Details
Main Authors: Polewczyk, Marek, Spinaci, Marco
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2402.07502
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916256803717120
author Polewczyk, Marek
Spinaci, Marco
author_facet Polewczyk, Marek
Spinaci, Marco
contents We present a novel deep-learning-based method to cluster words in documents which we apply to detect and recognize tables given the OCR output. We interpret table structure bottom-up as a graph of relations between pairs of words (belonging to the same row, column, header, as well as to the same table) and use a transformer encoder model to predict its adjacency matrix. We demonstrate the performance of our method on the PubTables-1M dataset as well as PubTabNet and FinTabNet datasets. Compared to the current state-of-the-art detection methods such as DETR and Faster R-CNN, our method achieves similar or better accuracy, while requiring a significantly smaller model.
format Preprint
id arxiv_https___arxiv_org_abs_2402_07502
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle ClusterTabNet: Supervised clustering method for table detection and table structure recognition
Polewczyk, Marek
Spinaci, Marco
Machine Learning
Computer Vision and Pattern Recognition
We present a novel deep-learning-based method to cluster words in documents which we apply to detect and recognize tables given the OCR output. We interpret table structure bottom-up as a graph of relations between pairs of words (belonging to the same row, column, header, as well as to the same table) and use a transformer encoder model to predict its adjacency matrix. We demonstrate the performance of our method on the PubTables-1M dataset as well as PubTabNet and FinTabNet datasets. Compared to the current state-of-the-art detection methods such as DETR and Faster R-CNN, our method achieves similar or better accuracy, while requiring a significantly smaller model.
title ClusterTabNet: Supervised clustering method for table detection and table structure recognition
topic Machine Learning
Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2402.07502