A new method for writer identification based on histogram symbolic representation
Alaei, A, Roy, PP 2014, 'A new method for writer identification based on histogram symbolic representation', in Proceedings of the 14th International Conference on Frontiers in Handwriting Recognition (ICFHR), Crete, Greece, 1- 4 September, pp. 216-221. ISBN: 9781479943340
Published version available from
In this paper, a new model-based writer identification scheme using histogram symbolic representation approach is proposed. In the proposed scheme, initially, some pre-processing techniques are employed to enhance image quality and extract text-lines from each handwritten document image. For each extracted text-line, a set of 92 features are computed based on analysis of connected component, enclosed region, lower and upper contours, fractal code, and Curve let. Considering the extracted feature vectors, a histogram is created for each feature of every writer as a histogram-valued symbolic data. This process results in a handwriting style model for each individual that consists of a set of histograms. To evaluate the proposed scheme, two different handwritten datasets written in two different scripts (Kannada as an Indian based script and English) were used. The first dataset contains 228 pages written in Kannada by 57 people. The other one is the dataset used in SigWiComp2013 composed of 330 document pages written in English by 55 individuals. The same criteria used in the SigWiComp2013 were followed in our evaluation strategy. Concerning the Kannada dataset, an F-measure of 92.79% was obtained when 114 documents were used in learning stage and the rest (114) were used for testing. For the SigWiComp2013 dataset an F-measure of 26.67% was obtained that is fairly comparable to the best result reported in the literature.