A comparison of clustering methods for writer identification and verification

M.L. Bulacu, L.R.B. Schomaker

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

36 Citations (Scopus)

Abstract

An effective method for writer identification and verification is based on assuming that each writer acts as a stochastic generator of ink-trace fragments, or graphemes. The probability distribution of these simple shapes in a given handwriting sample is characteristic for the writer and is computed using a common codebook of graphemes obtained by clustering. In previous studies we used contours to encode the graphemes, in the current paper we explore a complementary shape representation using normalized bilmaps. The most important aim of the current work is to compare three different clustering methods for generating the grapheme codebook: k-means, Kohonen SOM 1D and 2D. Large scale computational experiments show that the proposed method is robust to the underlying shape representation used (whether contours or normalized bitmaps), to the size of codebook used (stable performance for sizes from 10(2) to 2.5 x 10(3)) and to the clustering method used to generate the codebook (essentially the same performance was obtained for all three clustering methods).

Original languageEnglish
Title of host publicationProceedings of the 8th International Conference on Document Analysis and Recognition (ICDAR 2005)
Place of PublicationPiscataway
PublisherIEEE (The Institute of Electrical and Electronics Engineers)
Pages1275-1279
Number of pages5
VolumeII
ISBN (Print)0-7695-2420-6
Publication statusPublished - 2005
Event8th International Conference on Document Analysis and Recognition (ICDAR 2005) -
Duration: 29-Aug-20051-Sept-2005

Other

Other8th International Conference on Document Analysis and Recognition (ICDAR 2005)
Period29/08/200501/09/2005

Fingerprint

Dive into the research topics of 'A comparison of clustering methods for writer identification and verification'. Together they form a unique fingerprint.

Cite this