Beyond OCR: Multi-faceted understanding of handwritten document characteristics

OnderzoeksoutputAcademicpeer review

18 Citaten (Scopus)

Samenvatting

Handwritten document understanding is a fundamental research problem in pattern recognition and it relies on the effective features. In this paper, we propose a joint feature distribution (JFD) principle to design novel discriminative features which could be the joint distribution of features on adjacent positions or the joint distribution of different features on the same location. Following the proposed JFD principle, we introduce seventeen features, including twelve textural-based and five grapheme-based features. We evaluate these features for different applications from four different perspectives to understand handwritten documents beyond OCR, by writer identification, script recognition, historical manuscript dating and localization. Extensive experimental results demonstrate that our novel QuadHinge and CoHinge features following the JFD principle provide promising results on these four applications.
Originele taal-2English
Pagina's (van-tot)321-333
Aantal pagina's13
TijdschriftPattern recognition
Volume63
DOI's
StatusPublished - mrt-2017

Citeer dit