International Journal of Scientific & Technology Research

Home About Us Scope Editorial Board Blog/Latest News Contact Us
10th percentile
Powered by  Scopus
Scopus coverage:
Nov 2018 to May 2020


IJSTR >> Volume 9 - Issue 6, June 2020 Edition

International Journal of Scientific & Technology Research  
International Journal of Scientific & Technology Research

Website: http://www.ijstr.org

ISSN 2277-8616

The Effect Of Binarization Algorithms Considering Color-To-Gray Scale Conversion Methods On Historic Document Images

[Full Text]



Paramasivam M E, Sabeenian R S, Dinesh P M



Image processing, Image analysis, Image color analysis, Image enhancement, Document Image Processing, Natural Image Processing, Color-to-Gray Scale Conversion, Document Image Binarization, Contrast Measure, Contrast-Per-Pixel, Binarization Metrics, F-Measure.



Character recognition from historic document images has been a challenge for computer scientists. The background of these documents contain enormous degradations, which gets visualized as noise. A few key preprocessing steps before character recognition are color-to-gray scale (C2G) transformation, binarization and segmentation. Numerous algorithms have been proposed, however there has been no generalized method for binarization invariant of the type image. Many C2G methods have been projected, but each of which directly mapped the noise from color image on to its gray scale version. This paper has tested the effect of C2G transformation on local and global binarization methods using images of DIBCO 2013 dataset. We have analyzed how variation of degradations, along with color contributions affects the binary image formation. Results show that not all binarization methods provide outputs for varying gray scale images. The gray scale image obtained by gamma correction based C2G conversion, eradicated noised to a maximum extent and hence has supported any kind of binarization algorithms. The qualitative measure for all obtained binary images was computed using F-Measure with their respective ground truths. To understand the contribution of each color channel in RGB color space, contrast-per-pixel (CPP) was computed. The identical F-Measure values for images with equal CPP, invariant of C2G transformation used has also been examined in this paper. To conclude, we have tested the interweaving relation between two major areas of image processing viz., Natural Image Processing and Document Image Processing.



[1] D. V. Nikolaos Ntogas, Digital Restoration by Denoising and Binarization of Historical Manuscripts Images, InTech, 2012.
[2] P. S. &. S. B. Dhok, "Review of Text Extraction Algorithms for Scenetext and Document Images," IETE Technical Review, pp. 1-21, 2016.
[3] C. Y. S. a. K. Y. S. Mori, "Historical review of OCR research and development," in IEEE, 1992.
[4] S.-W. L. C. Y. S. Yuan Y. Tang, "Automatic document processing: A survey," Pattern Recognition, vol. 29, no. 12, pp. 1931-1952, 1996.
[5] T. D. a. A. S. D. Ghosh, "Script Recognition—A Review," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 32, no. 12, pp. 2142-2161, 2010.
[6] B. C. U. Pal, "Indian script character recognition: a survey," Pattern Recognition, vol. 37, no. 9, pp. 1887-1899, 2004.
[7] P. B. P. &. A. G. Ramakrishnan, "OCR in Indian Scripts: A Survey," IETE Technical Review, vol. 22, no. 3, pp. 217-227, 2005.
[8] R. Hedjam, R. F. Moghaddam and M. Cheriet, "Text extraction from degraded document images," in 2nd European Workshop on Visual Information Processing (EUVIP) 2010, 2010.
[9] B. Gatos, K. Ntirogiannis and I. Pratikakis, "ICDAR 2009 Document Image Binarization Contest (DIBCO 2009)," in Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on, 2009.
[10] I. Pratikakis, B. Gatos and K. Ntirogiannis, "ICDAR 2011 Document Image Binarization Contest (DIBCO 2011)," in International Conference on Document Analysis and Recognition (ICDAR) 2011, 2011.
[11] A. Antonacopoulos, C. Clausner, C. Papadopoulos and S. Pletschacher, "ICDAR 2013 Competition on Historical Book Recognition (HBR 2013)," in 12th International Conference on Document Analysis and Recognition (ICDAR) 2013 , 2013.
[12] I. Pratikakis, B. Gatos and K. Ntirogiannis, "H-DIBCO 2010 - Handwritten Document Image Binarization Competition," in Proceedings of the 2010 12th International Conference on Frontiers in Handwriting Recognition, Washington, 2010.
[13] I. Pratikakis, B. Gatos and K. Ntirogiannis, "ICFHR 2012 Competition on Handwritten Document Image Binarization (H-DIBCO 2012)," in International Conference on Frontiers in Handwriting Recognition (ICFHR) 2012, 2012.
[14] F. Ghani, E. Khan and M. A. Khan, "Restoration of Old Manuscripts using Image Processing Techniques," IETE Journal of Research, vol. 46, no. 5, pp. 325-329, 2000.
[15] S. Jayaraman, S. Esakkirajan and T. Veerakumar, Digital Image Processing, NewDelhi, Delh: Tata Mc-Graw Hill Education Private Limited, 2011.
[16] R. C. Gonzalez and R. E. Woods, Digital Image Processing (3rd Edition), Upper Saddle River, NJ, USA: Prentice-Hall, Inc., 2006.
[17] C. Kanan and G. W. Cottrell, "Color-to-Grayscale: Does the Method Matter in Image Recognition?," PLoS ONE, vol. 7, p. e29740, Jan 2012.
[18] S. Susstrunk, R. Buckley and S. Swen, "Standard RGB Color Spaces," in In The Seventh Color Imaging Conference: Color Science, Systems, and Applications, 1999.
[19] K. E. Spaulding, G. J. Woolfe and E. J. Giorgianni, "Image States and Standard Color Encodings (RIMM/ROMM RGB)," Color and Imaging Conference, pp. 288-294, 2000.
[20] A. Koschan and M. A. Abidi, Digital Color Image Processing, New York, NY, USA: Wiley-Interscience, 2008.
[21] W. K. Pratt, Digital Image Processing: PIKS Inside, 3rd ed., New York, NY, USA: John Wiley & Sons, Inc., 2001.
[22] M. Nixon and A. S. Aguado, Feature Extraction & Image Processing, Second Edition, 2nd ed., Academic Press, 2008.
[23] M. Qiu, G. D. Finlayson and G. Qiu, "Contrast Maximizing and Brightness Preserving Color to Grayscale Image Conversion," Conference on Colour in Graphics, Imaging, and Vision, pp. 347-351, 2008.
[24] J. Zhengmeng. and K. Michael Ng, "A contrast maximization method for color-to-grayscale conversion," Multidimensional Systems and Signal Processing, vol. 26, pp. 869-877, 2015.
[25] M. G. a. N. A. Dodgson, "Decolorize: Fast, Contrast Enhancing, Color to Grayscale Conversion," Pattern Recognition, vol. 40, no. 11, pp. 2891-2896, #oct# 2005.
[26] C. Lu, L. Xu and J. Jia, "Contrast preserving decolorization," in IEEE International Conference on Computational Photography (ICCP) 2012 , 2012.
[27] Y. Kim, C. Jang, J. Demouth and S. Lee, "Robust Color-to-gray via Nonlinear Global Mapping," ACM Trans. Graph., vol. 28, pp. 1611-1614, Dec 2009.
[28] B. Gatos, I. Pratikakis and S. J. Perantonis, "Efficient Binarization of Historical and Degraded Document Images," in Document Analysis Systems, 2008. DAS '08. The Eighth IAPR International Workshop on, 2008.
[29] R. F. Moghaddam and M. Cheriet, "RSLDI: Restoration of single-sided low-quality document images," Pattern Recognition , vol. 42, pp. 3355-3364, 2009.
[30] M. Cheriet, N. Kharma, C.-l. Liu and C. Suen, Character Recognition Systems: A Guide for Students and Practitioners, Wiley-Interscience, 2007.
[31] C. C. Fung and R. Chamchong, "A Review of Evaluation of Optimal Binarization Technique for Character Segmentation in Historical Manuscripts," in Knowledge Discovery and Data Mining, 2010. WKDD '10. Third International Conference on, 2010.
[32] O. D. Trier and T. Taxt, "Evaluation of Binarization Methods for Document Images," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 17, pp. 312-315, 1995.
[33] O. D. Trier and A. K. Jain, "Goal-directed evaluation of binarization methods," Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 17, pp. 1191-1201, #dec# 1995.
[34] M. Sezgin and B. Sankur, "Survey over image thresholding techniques and quantitative performance evaluation," Journal of Electronic Imaging, vol. 13, pp. 146-168, 2004.
[35] N. Otsu, "A Threshold Selection Method from Gray-Level Histograms," Systems, Man and Cybernetics, IEEE Transactions on, vol. 9, pp. 62-66, Jan 1979.
[36] K. Fukunaga, Introduction to Statistical Pattern Recognition (2nd Ed.), San Diego, CA, USA: Academic Press Professional, Inc., 1990.
[37] N. Papamarkos and B. Gatos, "A New Approach for Multilevel Threshold Selection," CVGIP: Graph. Models Image Process., vol. 56, pp. 357-370, Sep 1994.
[38] D.-M. Tsai and Y.-H. Chen, "A fast histogram-clustering approach for multi-level thresholding," Pattern Recognition Letters , vol. 13, pp. 245-252, 1992.
[39] B. Gatos, I. Pratikakis and S. J. Perantonis, "Adaptive degraded document image binarization," Pattern Recognition , vol. 39, pp. 317-327, 2006.
[40] W. Niblack, An Introduction to Digital Image Processing, Birkeroed, Denmark: Strandberg Publishing Company, 1985.
[41] J. Sauvola and M. Pietikäinen, "Adaptive document image binarization," Pattern Recogniton, vol. 33, pp. 225-236, 2000.
[42] K. Khurshid, I. Siddiqi, C. Faure and N. Vincent, "Comparison of Niblack inspired binarization methods for ancient documents," in Proc. SPIE 7247, Document Recognition and Retrieval XVI, 72470U, 2009.
[43] C. Wolf and J.-M. Jolion, "Extraction and recognition of artificial text in multimedia documents," Formal Pattern Analysis and Applications, vol. 6, pp. 309-326, 2004.
[44] P. D. Wellner, "Adaptive thresholding for digital desk," Xerox Research Centre, Europe, 1993.
[45] D. Bradley and G. Roth, "Adaptive Thresholding using the Integral Image," Journal of Graphics, GPU, and Game Tools, vol. 12, no. 2, pp. 13-21, 2007.
[46] V. Wu, R. Manmatha and E. M. Riseman, "Textfinder: an automatic system to detect and recognize text in images," Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 21, pp. 1224-1229, Nov 1999.
[47] K. Ntirogiannis, B. Gatos and I. Pratikakis, "An Objective Evaluation Methodology for Document Image Binarization Techniques," in Document Analysis Systems, 2008. DAS '08. The Eighth IAPR International Workshop on, 2008.
[48] Paramasivam and Sabeenian, "Contrast Based Color Plane Selection for Binarization of Historical Document Images," in 2nd International Conference on on Emerging Trends in Electrical, Communication and Information Technologies (ICECIT 2015), 2015.