Saved in:
Bibliographic Details
Main Authors: Penarrubia, Carlos, Garrido-Munoz, Carlos, Valero-Mas, Jose J., Calvo-Zaragoza, Jorge
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2404.11585
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909652027965440
author Penarrubia, Carlos
Garrido-Munoz, Carlos
Valero-Mas, Jose J.
Calvo-Zaragoza, Jorge
author_facet Penarrubia, Carlos
Garrido-Munoz, Carlos
Valero-Mas, Jose J.
Calvo-Zaragoza, Jorge
contents Handwritten Text Recognition (HTR) is a relevant problem in computer vision, and implies unique challenges owing to its inherent variability and the rich contextualization required for its interpretation. Despite the success of Self-Supervised Learning (SSL) in computer vision, its application to HTR has been rather scattered, leaving key SSL methodologies unexplored. This work focuses on one of them, namely Spatial Context-based SSL. We investigate how this family of approaches can be adapted and optimized for HTR and propose new workflows that leverage the unique features of handwritten text. Our experiments demonstrate that the methods considered lead to advancements in the state-of-the-art of SSL for HTR in a number of benchmark cases.
format Preprint
id arxiv_https___arxiv_org_abs_2404_11585
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Spatial Context-based Self-Supervised Learning for Handwritten Text Recognition
Penarrubia, Carlos
Garrido-Munoz, Carlos
Valero-Mas, Jose J.
Calvo-Zaragoza, Jorge
Artificial Intelligence
Handwritten Text Recognition (HTR) is a relevant problem in computer vision, and implies unique challenges owing to its inherent variability and the rich contextualization required for its interpretation. Despite the success of Self-Supervised Learning (SSL) in computer vision, its application to HTR has been rather scattered, leaving key SSL methodologies unexplored. This work focuses on one of them, namely Spatial Context-based SSL. We investigate how this family of approaches can be adapted and optimized for HTR and propose new workflows that leverage the unique features of handwritten text. Our experiments demonstrate that the methods considered lead to advancements in the state-of-the-art of SSL for HTR in a number of benchmark cases.
title Spatial Context-based Self-Supervised Learning for Handwritten Text Recognition
topic Artificial Intelligence
url https://arxiv.org/abs/2404.11585