Abstract
An effective method for writer identification and verification is based on assuming that each writer acts as a stochastic generator of ink-trace fragments, or graphemes. The probability distribution of these simple shapes in a given handwriting sample is characteristic for the writer and is computed using a common codebook of graphemes obtained by clustering. In previous studies we used contours to encode the graphemes, in the current paper we explore a complementary shape representation using normalized bilmaps. The most important aim of the current work is to compare three different clustering methods for generating the grapheme codebook: k-means, Kohonen SOM 1D and 2D. Large scale computational experiments show that the proposed method is robust to the underlying shape representation used (whether contours or normalized bitmaps), to the size of codebook used (stable performance for sizes from 10(2) to 2.5 x 10(3)) and to the clustering method used to generate the codebook (essentially the same performance was obtained for all three clustering methods).
Original language | English |
---|---|
Title of host publication | Proceedings of the 8th International Conference on Document Analysis and Recognition (ICDAR 2005) |
Place of Publication | Piscataway |
Publisher | IEEE (The Institute of Electrical and Electronics Engineers) |
Pages | 1275-1279 |
Number of pages | 5 |
Volume | II |
ISBN (Print) | 0-7695-2420-6 |
Publication status | Published - 2005 |
Event | 8th International Conference on Document Analysis and Recognition (ICDAR 2005) - Duration: 29-Aug-2005 → 1-Sept-2005 |
Other
Other | 8th International Conference on Document Analysis and Recognition (ICDAR 2005) |
---|---|
Period | 29/08/2005 → 01/09/2005 |