Improving the robustness of LSTMs for word classification using stressed word endings in dual-state word-beam search

Mahya Ameryan*, Lambert Schomaker

*Bijbehorende auteur voor dit werk

OnderzoeksoutputAcademicpeer review

1 Citaat (Scopus)
66 Downloads (Pure)

Samenvatting

In recent years, long short-term memory neural networks (LSTMs) followed by a connectionist temporal classification (CTC) have shown strength in solving handwritten text recognition problems. Such networks can handle not only sequence variability but also geometric variation by using a convolutional front end, at the input side. Although different approaches have been introduced for decoding activations in the CTC output layer, only limited consideration is given to the use of proper label-coding schemes. In this paper, we use a limited-size ensemble of end-to-end convolutional LSTM Neural Networks to evaluate four label-coding schemes. Additionally, we evaluate two CTC search techniques: Best-path search vs dual-state word-beam search (DSWBS). The classifiers in the ensemble have comparable architectures but variable numbers of hidden units. We tested the coding and search approaches on three datasets: A standard benchmark IAM dataset (English) and two more difficult historical handwritten datasets (diaries and field notes, highly multilingual). Results show that stressing the word endings in the label-coding scheme yields a higher performance, especially for DSWBS. However, stressing the start-of-word shapes with a token appears to be disadvantageous.
Originele taal-2English
TitelProceedings of the 2020, 17th International Conference on Frontiers in Handwriting Recognition (ICFHR)
UitgeverijInstitute of Electrical and Electronics Engineers (IEEE)
Pagina's13-18
Aantal pagina's6
ISBN van elektronische versie978-1-7281-9966-5
ISBN van geprinte versie978-1-7281-9967-2
DOI's
StatusPublished - sep-2020
Evenement 2020 17th International Conference on Frontiers in Handwriting Recognition (ICFHR) - Dortmund, Germany
Duur: 25-nov-202025-nov-2020

Publicatie series

NaamProceedings of International Conference on Frontiers in Handwriting Recognition, ICFHR
Volume2020-September
ISSN van geprinte versie2167-6445
ISSN van elektronische versie2167-6453

Conference

Conference 2020 17th International Conference on Frontiers in Handwriting Recognition (ICFHR)
Land/RegioGermany
StadDortmund
Periode25/11/202025/11/2020

Citeer dit