An Approach based on Run Length Count for Denoising the Kannada Characters
Publication Type:Journal Article
Source:International Journal of Computer Applications, Foundation of Computer Science, Volume 50, Issue 18 (2012)
Keywords:Department of Information Science and Engineering
Optical Character Recognition (OCR) is one of the important fields in image processing and pattern recognition domain. OCR with high accuracy finds application in offices, banks, healthcare etc. The accuracy of the OCR is primarily dependent on the quality of the input image. So, to achieve high accuracy OCR we should provide a high quality image, which is free from different types of noises, degradation, skews etc. In this paper, we have made an attempt to remove the noise, which is present in the input image. A novel method based on run length count is proposed to denoise the images. In this approach first the noisy image is binarized. Based on the horizontal and vertical run length count, the noise in the image will be identified and eliminated. The algorithm is tested with noisy epigraphical document images, noisy printed document images. The effectiveness of the algorithm is verified with images having synthetic noise derived from Gaussian, Speckle and Poisson noise models. The experimental results show that the proposed method is efficient for noise elimination.