DOI QR코드

DOI QR Code

A Novel Character Segmentation Method for Text Images Captured by Cameras

  • Lue, Hsin-Te (Institute of Computer Science and Information Engineering, National Central University) ;
  • Wen, Ming-Gang (Department of Information Management, National United University) ;
  • Cheng, Hsu-Yung (Institute of Computer Science and Information Engineering, National Central University) ;
  • Fan, Kuo-Chin (Institute of Computer Science and Information Engineering, National Central University) ;
  • Lin, Chih-Wei (Institute of Computer Science and Information Engineering, National Central University) ;
  • Yu, Chih-Chang (Department of Computer Science and Information Engineering, Vanung University)
  • Received : 2010.03.13
  • Accepted : 2010.08.02
  • Published : 2010.10.31

Abstract

Due to the rapid development of mobile devices equipped with cameras, instant translation of any text seen in any context is possible. Mobile devices can serve as a translation tool by recognizing the texts presented in the captured scenes. Images captured by cameras will embed more external or unwanted effects which need not to be considered in traditional optical character recognition (OCR). In this paper, we segment a text image captured by mobile devices into individual single characters to facilitate OCR kernel processing. Before proceeding with character segmentation, text detection and text line construction need to be performed in advance. A novel character segmentation method which integrates touched character filters is employed on text images captured by cameras. In addition, periphery features are extracted from the segmented images of touched characters and fed as inputs to support vector machines to calculate the confident values. In our experiment, the accuracy rate of the proposed character segmentation system is 94.90%, which demonstrates the effectiveness of the proposed method.

Keywords

References

  1. X. Chen et al., "Automatic Detection and Recognition of Signs from Natural Scenes," IEEE Trans. Image Process., vol. 13, no. 1, 2004, pp. 87-99. https://doi.org/10.1109/TIP.2003.819223
  2. R. Lienhart and A. Wernicke, "Localizing and Segmenting Text in Images and Videos," IEEE Trans. Circuits Syst. Video Technol., vol. 12, no. 4, 2002, pp. 256-268. https://doi.org/10.1109/76.999203
  3. M.R. Lyu, J. Song, and M. Cai, "A Comprehensive Method for Multilingual Video Text Detection, Localization, and Extraction," IEEE Trans. Circuits Syst. Video Technol., vol. 15, no. 2, 2005, pp. 243-255. https://doi.org/10.1109/TCSVT.2004.841653
  4. V. Wu, R. Manmatha, and E.M. Riseman, "Textfinder: An Automatic System to Detect and Recognize Text in Images," IEEE Trans. Pattern Anal. Mach. Intell., vol. 21, no. 11, 1999, pp. 1224-1229. https://doi.org/10.1109/34.809116
  5. W. Wu, X. Chen, and J. Yang, "Detection of Text on Road Signs from Video," IEEE Trans. Intell. Transport. Syst., vol. 6, no. 4, 2005, pp. 378-390. https://doi.org/10.1109/TITS.2005.858619
  6. S. Hu and M. Chen, "Adaptive Frechet Kernel Based Support Vector Machine for Text Detection," IEEE Int. Conf. Acoustics, Speech, Signal Process., vol. 5, 2005, pp. 365-368.
  7. K.C. Kim et al., "Scene Text Extraction in Natural Scene Images Using Hierarchical Feature Combining and Verification," Proc. 17th Int. Conf. Pattern Recognition, vol. 2, 2004, pp. 679-682.
  8. K.L. Kim, K. Jung, and J.H. Kim, "Texture-Based Approach for Text Detection in Images Using Support Vector Machines and Continuously Adaptive Mean Shift Algorithm," IEEE Trans. Pattern Anal. Mach. Intell., vol. 25, no. 12, 2003, pp. 1631-1639. https://doi.org/10.1109/TPAMI.2003.1251157
  9. T. Yamguchi and M. Maruyama, "Character Extraction from Natural Scene Images by Hierarchical Classifiers," Proc. 17th Int. Conf. Pattern Recognition, vol. 2, 2004, pp. 687-690.
  10. K.C. Fan and L.S. Wang, "Classification of Machine-Printed and Handwritten Texts Using Character Block Layout Variance," Pattern Recognition, vol. 31, no. 9, 1998, pp.1275-1284. https://doi.org/10.1016/S0031-3203(97)00143-X
  11. J.L. Meunier, "Optimized XY-Cut for Determining a Page Reading Order," Proc. 8th Int. Conf. Document Anal. Recog., vol. 1, 2005, pp. 347-351.
  12. K.C. Fan, L.S. Wang, and Y.K. Wang, "Page Segmentation and Identification for Intelligent Signal Processing," Signal Process., vol. 45, no. 3, 1995, pp. 329-346. https://doi.org/10.1016/0165-1684(95)00061-H
  13. M.C. Jung, Y.C. Shin, and S.N. Srihari, "Machine Printed Character Segmentation Method Using Side Profiles," IEEE Int. Conf. Syst., Man, Cybernetics, vol. 6, 1999, pp. 863-867.
  14. S. Liang, M. Shridhar, and M. Ahmadi, "Efficient Algorithms for Segmentation and Recognition of Printed Characters in Document Processing," IEEE Pacific Rim Conf. Commun., Comput. Signal Process., vol. 1, 1993, pp. 240-243.
  15. C.L. Liu, H. Sako, and H. Fujisawa, "Effects of Classifier Structures and Training Regimes on Integrated Segmentation and Recognition of Handwritten Numeral Strings," IEEE Trans. Pattern Anal. Mach. Intell., vol. 26, no. 11, 2004, pp.1395-1407. https://doi.org/10.1109/TPAMI.2004.104
  16. S. Marinai, M. Gori, and G. Soda, "Artificial Neural Networks for Document Analysis and Recognition," IEEE Trans. Pattern Anal. Mach. Intell., vol. 27, no. 1, 2005, pp. 23-35. https://doi.org/10.1109/TPAMI.2005.4
  17. N. Otsu, "A Threshold Selection Method from Gray Level Histograms," IEEE Trans. Syst., Man Cybern., vol. 9, no. 1, 1979, pp. 62-66. https://doi.org/10.1109/TSMC.1979.4310076
  18. K.I. Kim et al., "Support Vector Machines for Texture Classification," IEEE Trans. Pattern Anal. Mach. Intell., vol. 24, no. 11, 2002, pp. 1542-1550. https://doi.org/10.1109/TPAMI.2002.1046177
  19. V. Vapnik, The Nature of Statistical Learning Theory, New York: Springer-Verlag, 1995.
  20. L.S. Wang, Document Analysis for Lossless Reproduction, Dissertation, Institute of Computer Science Information Engineering, National Central University, 1997.
  21. A. Zramdini and R. Ingold, "Optical Font Recognition Using Typographical Features," IEEE Trans. Pattern Anal. Mach. Intelligence, vol. 20, no. 8, 1998, pp. 877-882. https://doi.org/10.1109/34.709616
  22. H. Beker and f. Piper, Cipher System: The Protection of Communication, John Wiley & Sons, 1983.
  23. C. M. Thillou et al., "A Multifunctional Reading Assistant for the Visually Impaired," EURASIP J. Image Video Process., vol. 2007, no. 4, 2007. Available athttp://downloads.hindawi.com/ journals/ivp/2007/064295.pdf.
  24. F. Chang, et al., "A Prototype Classification Method and its Application to Handwritten Character Recognition," IEEE Int. Conf. Syst., Man Cybern., vol. 5, 2004, pp. 4738-4743.

Cited by

  1. Best Combination of Binarization Methods for License Plate Character Segmentation vol.35, pp.3, 2010, https://doi.org/10.4218/etrij.13.0112.0545