End-to-end Handwritten Paragraph Text Recognition Using a Vertical Attention Network - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue IEEE Transactions on Pattern Analysis and Machine Intelligence Année : 2022

End-to-end Handwritten Paragraph Text Recognition Using a Vertical Attention Network

Résumé

Unconstrained handwritten text recognition remains challenging for computer vision systems. Paragraph text recognition is traditionally achieved by two models: the first one for line segmentation and the second one for text line recognition. We propose a unified end-to-end model using hybrid attention to tackle this task. We achieve state-of-the-art character error rate at line and paragraph levels on three popular datasets: 1.90% for RIMES, 4.32% for IAM and 3.63% for READ 2016. The proposed model can be trained from scratch, without using any segmentation label contrary to the standard approach. Our code and trained model weights are available at https://github.com/FactoDeepLearning/VerticalAttentionOCR.

Dates et versions

hal-03413862 , version 1 (04-11-2021)

Identifiants

Citer

Denis Coquenet, Clément Chatelain, Thierry Paquet. End-to-end Handwritten Paragraph Text Recognition Using a Vertical Attention Network. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, ⟨10.1109/TPAMI.2022.3144899⟩. ⟨hal-03413862⟩
31 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More