A baseline dependent approach for Persian handwritten character segmentation

Document Type

Conference publication

Publication details

Alaei, A, Nagabhushan, P & Pal, U 2010, 'A baseline dependent approach for Persian handwritten character segmentation', in Proceedings of the 20th International Conference on Pattern Recognition, Istanbul, Turkey, 23-26 August,, pp. 1977-198. ISBN: 9781424475414.

Published version available from:


Peer Reviewed



In this paper, an efficient approach to segment Persian off-line handwritten text-line into characters is presented. The proposed algorithm first traces the baseline of the input text-line image and straightens it. Subsequently, it over-segments each word/subwords using features extracted from histogram analysis and then removes extra segmentation points using some baseline dependent as well as language dependent rules. We tested the proposed character segmentation scheme with 2 different datasets. On a test set of 899 Persian words/subwords created by us, 90.26% of the characters were segmented correctly. From another dataset of 200 handwritten Arabic word images we obtained 93.49% correct segmentation accuracy.