DOI QR코드

DOI QR Code

The Character Recognition System of Mobile Camera Based Image

모바일 이미지 기반의 문자인식 시스템

  • Park, Young-Hyun (Information and Telecommunication, Korea Aerospace University) ;
  • Lee, Hyung-Jin (Information and Telecommunication, Korea Aerospace University) ;
  • Baek, Joong-Hwan (Information and Telecommunication, Korea Aerospace University)
  • 박영현 (한국항공대학교 정보통신공학과) ;
  • 이형진 (한국항공대학교 정보통신공학과) ;
  • 백중환 (한국항공대학교 정보통신공학과)
  • Received : 2010.04.23
  • Accepted : 2010.05.13
  • Published : 2010.05.31

Abstract

Recently, due to the development of mobile phone and supply of smart phone, many contents have been developed. Especially, since the small-sized cameras are equiped in mobile devices, people are interested in the image based contents development, and it also becomes important part in their practical use. Among them, the character recognition system can be widely used in the applications such as blind people guidance systems, automatic robot navigation systems, automatic video retrieval and indexing systems, automatic text translation systems. Therefore, this paper proposes a system that is able to extract text area from the natural images captured by smart phone camera. The individual characters are recognized and result is output in voice. Text areas are extracted using Adaboost algorithm and individual characters are recognized using error back propagated neural network.

최근 모마일 폰의 발달과 스마트 폰의 보급으로 인해서 많은 콘텐츠들이 개발되어지고 있다. 특히, 모바일 휴대장치에 소형 카메라가 탑재되면서부터 카메라로부터 입력되어지는 영상 기반 콘텐츠 개발은 사람들의 흥미뿐만 아니라 활용 면에서도 중요한 부분을 차지하고 있다. 그중 문자인식 시스템은 시각 장애인 보행 보조 시스템, 로봇 자동 주행 시스템, 비디오 자동 검색 및 색인 시스템, 텍스트 자동 번역 시스템 등과 같은 활용영역에서 매우 광범위하게 쓰일 수 있다. 따라서 본 논문에서는 스마트 폰 카메라로 입력되는 자연 영상에 포함되어 있는 텍스트를 추출 및 인식하고 음성으로 출력해주는 시스템을 제안하였다. 텍스트 영역을 추출하기 위해 Adaboost 알고리즘을 이용하고 추출된 개별 텍스트 후보영역의 문자 인식에는 오류 역전파 신경망을 이용하였다.

Keywords

References

  1. Anil K. Jain, Bin Yu, "Automatic Text Location in Images and Video Frames," Pattern Recognition, Vol. 31, No. 12, pp. 2055-2076, 1998. https://doi.org/10.1016/S0031-3203(98)00067-3
  2. Yu Zhong, Kalle Karu, Anil K. Jain, "Locating Text in Complex Images," Pattern Recognition, Vol. 28, No. 10, pp. 1523-1535, 1995. https://doi.org/10.1016/0031-3203(95)00030-4
  3. H. K. Kim, "Efficient Automatic Text Location Method and Content-based Indexing and Structuring of Video Database," Journal of Visual Communications and Image Representation, Vol. 7, pp. 336-344, 1996. https://doi.org/10.1006/jvci.1996.0029
  4. Pyeoung-Kee Kim, "Automatic Text Location in Complex Color Images using Local Color Quantization," TENCON 99. Proceedings of the IEEE Region 10 Technical Conference, Vol. 1, pp. 629-632, 1999.
  5. J. Ohya, A. Shio, S. Akamatsu, "Recognizing Characters in Scene images," IEEE Transactions Pattern Analysis and Machine Intelligence, PAMI-16(2), pp. 67-82, 1995.
  6. Lixu Gu, Toyahisa Kaneko, "Robust Extraction of Characters from Color Scene Image Using Mathematical Morphology," Proceeding of 7th International Conference on Pattern Recognition, Vol. 2, pp. 1002-1004, 1998.
  7. Jung Sung Uk, "Efficient Rectangle Feature Extraction for Real-time Facial Expression Recognition based on AdaBoost" : Korea Advanced Institute of Science and Technology, 2005.
  8. Yoav Freund, Robert E. Schapire, "A Short Introduction to Boosting" : Journal of Japanese Society for Artificial Intelligence, 14(5):771-780, September, 1999.
  9. Paul Viola, Michael J. Jones, "Robust Real-Time Face Detection" : International Journal of Computer Vision 57(2), 137-54, 2004. https://doi.org/10.1023/B:VISI.0000013087.49260.fb
  10. Bong-Wha Hong, Jie-Young Lee, "On the Enhancement of the Recognition Performance for Back Propagation Neural Networks", 한국 OA학회학회지 제4권 4호, 1999.
  11. B.Widrow and M.A.Lehr, "30 Years of Adaptive Neural Network : Perceptron, Msdaline, and Backpropagation", Proceeding of the IEEE, Vol. 78, pp. 1415-1441, September, 1990. https://doi.org/10.1109/5.58323
  12. Yoh-Han Pao, "Adaptive Pattern Recognition and Neural Network", Addison Wesley Publishing Company Inc., 1989.