Skip to main content

An Application of Deep Learning in Character Recognition: An Overview

  • Chapter
  • First Online:
Handbook of Deep Learning Applications

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 136))

Abstract

For automated document analysis, OCR (Optical character recognition) is a basic building block. The robust automated document analysis system can have impact over a wider sphere of life. Many of the researchers have been working hard to build OCR systems in various languages with significant degree of accuracy, character recognition rate and minimum error rate. Deep learning is the start of art technique with efficient and accurate result as compared to other techniques. Every language, moreover every script have its own challenges e.g. scripts where characters are well separated are less challenging as compared to cursive scripts where characters are attached with one another. In this chapter, we would take a detailed account of the state of art deep learning techniques for Arabic like script, Latin script and symbolic script.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. S. Naz, A.I. Umar, S.H. Shirazi, S.B. Ahmed, M.I. Razzak, I. Siddiqi, Segmentation techniques for recognition of Arabic-like scripts: a comprehensive survey. Educ. Inf. Technol. 21(5), 1225–1241 (2016)

    Article  Google Scholar 

  2. M.I. Razzak, S.A. Husain, A.A. Mirza, A. Belaid, Fuzzy based preprocessing using fusion of online and offline trait for online Urdu script based languages character recognition. Int. J. Innov. Comput. Inf. Control 8(5), 21 (2012)

    Google Scholar 

  3. S. Naz, S.B. Ahmed, R. Ahmad, M.I. Razzak, Arabic script based digit recognition systems, in International Conference on Recent Advances in Computer Systems (RACS) (2016), pp. 67–73

    Google Scholar 

  4. M.I. Razzak, M. Sher, S.A. Hussain, Locally baseline detection for online Arabic script based languages character recognition. Int. J. Phys. Sci. 5(7), 955–959 (2010)

    Google Scholar 

  5. O. Morillot et al., The UOB-Telecom ParisTech Arabic Handwriting Recognition and Translation Systems for the OpenHart 2013 Competition To cite this version (2014)

    Google Scholar 

  6. N. Stamatopoulos, G. Sfikas, Historical Document Processing, vol. 2014 (2016)

    Google Scholar 

  7. S. Naz, A.I. Umar, M.I. Razzak, Lexicon reduction for Urdu/Arabic script based character recognition: a multilingual OCR. Mehran Univ. Res. J. Eng. Technol. 35(2), 209 (2016)

    Google Scholar 

  8. S. Chen, Using multiple sequence alignment and statistical language model to integrate multiple chinese address recognition outputs (2015), pp. 151–155

    Google Scholar 

  9. M.I. Razzak, S.A. Hussain, M. Sher, Z.S. Khan, Combining offline and online preprocessing for online Urdu character recognition, in Proceedings of the International Multiconference of Engineers and Computer Scientists, vol. 1 (2009), pp. 18–20

    Google Scholar 

  10. M.I. Razzak, S.A. Hussain, A. Belaïd, M. Sher, Multi-font numerals recognition for urdu script based languages. Int. J. Recent Trends Eng. (IJRTE) (2009)

    Google Scholar 

  11. S.B. Ahmed, S. Naz, S. Swati, M.I. Razzak, A.I. Umar, A.A. Khan, UCOM offline dataset—an Urdu handwritten dataset generation. Int. Arab J. Inf. Technol. 14(2), 239–245 (2017)

    Google Scholar 

  12. A. Kh, A. Kacem, A. Bela, M. Elloumi, A. Kh, Arabic handwritten words off-line recognition based on HMMs and DBNs Arabic handwritten words off-line recognition based on HMMs and DBNs, in 13th International Conference on Document Analysis and Recognition (2015), pp. 51–55

    Google Scholar 

  13. S. Naz, M.I. Razzak, K. Hayat, M.W. Anwar, S.Z. Khan, Challenges in baseline detection of Arabic script based languages, in Intelligent Systems for Science and Information (Springer, Cham, 2014), pp. 181–196

    Chapter  Google Scholar 

  14. D.C. Cires, Deep, Big, Simple Neural Nets for Handwritten, vol. 3220 (2010), pp. 3207–3220

    Google Scholar 

  15. S.F. Rashid, M.-P. Schambach, J. Rottland, S.V.D. Nüll, Low resolution Arabic recognition with multidimensional recurrent neural networks, pp. 1–5

    Google Scholar 

  16. S. Naz, K. Hayat, M.I. Razzak, M.W. Anwar, H. Akbar, Arabic script based language character recognition: Nasta’liq vs Naskh analysis, in 2013 World Congress on Computer and Information Technology (WCCIT) (IEEE, 2013), pp. 1–7

    Google Scholar 

  17. S. Naz, K. Hayat, M.I. Razzak, M.W. Anwar, S.A. Madani, S.U. Khan, The optical character recognition of Urdu like cursive script Post Ph.D. View project. Pattern Recognit. 1–20 (2014)

    Google Scholar 

  18. M. Liwicki, A novel approach to on-line handwriting recognition based on bidirectional long short-term memory networks

    Google Scholar 

  19. A. Graves, Offline handwriting recognition with multidimensional recurrent neural networks, pp. 1–8

    Google Scholar 

  20. A. Graves, Offline Arabic handwriting recognition with multidimensional recurrent neural networks

    Google Scholar 

  21. A. Graves, S. Fern, Multi-dimensional recurrent neural networks (2013), pp. 1–10

    Google Scholar 

  22. A. Graves, S. Fern, M. Liwicki, H. Bunke, Unconstrained online handwriting recognition with recurrent neural networks, pp. 1–8

    Google Scholar 

  23. I. Ahmad, X. Wang, R. Li, M. Ahmed, R. Ullah, Line and ligature segmentation of Urdu Nastaleeq Text, vol. 5 (2017)

    Article  Google Scholar 

  24. Z. Ahmad, J. Khan, I. Shamsher, Urdu compound character recognition using feed forward neural networks (2009)

    Google Scholar 

  25. M. Seuret, M. Alberti, R. Ingold, M. Liwicki, PCA-initialized deep neural networks applied to document image analysis

    Google Scholar 

  26. S.B. Ahmed, S. Naz, M.I. Razzak, R. Yousaf, Deep learning based isolated Arabic scene character recognition, in 2017 1st International Workshop on Arabic Script Analysis and Recognition (ASAR) (2017), pp. 46–51

    Google Scholar 

  27. D.C. Cires, U. Meier, L.M. Gambardella, Convolutional neural network committees for handwritten character classification, vol. 10 (2011), pp. 1135–1139

    Google Scholar 

  28. I. Ahmad, X. Wang, R. Li, S. Rasheed, Offline Urdu Nastaleeq optical character recognition based on stacked denoising autoencoder (2016), pp. 146–157

    Article  Google Scholar 

  29. M.I. Razzak, F. Anwar, S.A. Husain, A. Belaid, M. Sher, HMM and fuzzy logic: a hybrid approach for online Urdu script-based languages’ character recognition. Knowl.-Based Syst. 23(8), 914–923 (2010)

    Article  Google Scholar 

  30. S.B. Ahmed, S. Naz, S. Swati, M.I. Razzak, Handwritten Urdu character recognition using one-dimensional BLSTM classifier. Neural Computing and Applications, pp. 1–9 (2017)

    Google Scholar 

  31. R. Ahmad, M.Z. Afzal, S.F. Rashid, M. Liwicki, T. Breuel, A. Dengel, KPTI: Katib’s Pashto text imagebase and deep learning benchmark (2016)

    Google Scholar 

  32. S. Naz, S. Bin, R. Ahmad, M. Imran, Zoning features and 2DLSTM for Urdu text-line recognition. Procedia—Procedia Comput. Sci. 96(September), 16–22 (2016)

    Article  Google Scholar 

  33. S. Naz, A.I. Umar, R. Ahmad, S.B. Ahmed, S.H. Shirazi, M.I. Razzak, Urdu Nasta’liq text recognition system based on multi-dimensional recurrent neural network and statistical features. Neural Comput. Appl. 2015

    Google Scholar 

  34. S. Naz et al., Offline Cursive Urdu-Nastaliq script recognition using multidimensional recurrent neural networks author’s accepted manuscript. Neurocomputing (2016)

    Google Scholar 

  35. S. Naz, A.I. Umar, R. Ahmed, M.I. Razzak, S.F. Rashid, Urdu Nasta’liq text recognition using implicit segmentation based on multi-dimensional long short term memory neural networks. SpringerPlus (2016)

    Google Scholar 

  36. S.B. Ahmed, S. Naz, S. Swati, M.I. Razzak, Handwritten Urdu character recognition using one-dimensional BLSTM classifier. Neural Comput. Appl. 1–9 (2017)

    Google Scholar 

  37. I. Ahmad, X. Wang, M. Guang, H. Ahmad, R. Ullah, Ligature based Urdu Nastaleeq sentence recognition using gated bidirectional long short term memory. Cluster Comput. (2017)

    Google Scholar 

  38. F. Slimane, R. Ingold, S. Kanoun, A.M. Alimi, J. Hennebert, A new Arabic printed text image database and evaluation protocols (2009), pp. 0–4

    Google Scholar 

  39. S. Naz, A.I. Umar, R. Ahmad, I. Siddiqi, Urdu Nastaliq recognition using convolutional-recursive deep learning. Neurocomputing (2017)

    Google Scholar 

  40. S.B. Ahmed, S. Naz, M.I. Razzak, R. Yusof, T.M. Breuel, Balinese character recognition using bidirectional LSTM classifier, in Advances in Machine Learning and Signal Processing (Springer, Cham, 2016), pp. 201–211

    Google Scholar 

  41. H. El, A. Volker, ICDAR 2009-Arabic handwriting recognition competition (2011), pp. 3–13

    Google Scholar 

  42. V. Pham, Dropout improves recurrent neural networks for handwriting recognition

    Google Scholar 

  43. Y. Chherawala, P.P. Roy, M. Cheriet, Feature design for offline Arabic handwriting recognition: handcrafted vs automated? (2013)

    Google Scholar 

  44. T.I. Society, O. Engineering, A comparison of 1D and 2D LSTM architectures for the recognition of handwritten Arabic (2015)

    Google Scholar 

  45. S.B. Ahmed, S. Naz, M.I. Razzak, S.F. Rashid, M.Z. Afzal, T.M. Breuel, Evaluation of cursive and non-cursive scripts using recurrent neural networks. Neural Comput. Appl. 27(3), 603–613 (2016)

    Article  Google Scholar 

  46. R. Odate, H. Goto, Fast and accurate candidate reduction using the multiclass LDA for Japanese/Chinese character recognition, in 2015 IEEE International Conference on Image Processing (ICIP) (IEEE, 2015), pp. 951–955

    Google Scholar 

  47. Y. Lu, J. Li, H. Zhang, S. Lin, Chinese character recognition of e-commerce platform pictures (2017), pp. 28–31

    Google Scholar 

  48. Z. Zhong, L. Jin, Z. Feng, I. Engineering, Multi-font printed Chinese character recognition using multi-pooling convolutional neural network (2015), pp. 96–100

    Google Scholar 

  49. Y. Tang, L. Peng, Q. Xu, Y. Wang, A. Furuhata, CNN based transfer learning for historical Chinese character recognition

    Google Scholar 

  50. X. Yu, W. Fan, J. Sun, S. Naoi, Semi-supervised learning feature representation for historical Chinese character recognition (2017), pp. 73–77

    Article  Google Scholar 

  51. D. Cires, T.R.N. Idsia, D. Cires, Multi-column deep neural networks for offline handwritten Chinese character classification (2013)

    Google Scholar 

  52. N. Liu et al., Robust math formula recognition in degraded Chinese document images (2017)

    Google Scholar 

  53. D.K. Ning Liu, D. Zhang, X. Xu, L. Guo, L. Chen, W. Liu, MFR100 dataset

    Google Scholar 

  54. Z. Xie, Z. Sun, L. Jin, H. Ni, T. Lyons, Learning spatial-semantic context with fully convolutional recurrent network for online handwritten Chinese text recognition

    Google Scholar 

  55. C. Wu, W. Fan, Y. He, J. Sun, S. Naoi, Handwritten character recognition by alternately trained relaxation convolutional neural network (2014)

    Google Scholar 

  56. R. Messina, Segmentation-free handwritten Chinese text recognition with LSTM-RNN

    Google Scholar 

  57. R. Messina, J. Louradour, Segmentation-free handwritten Chinese text recognition with LSTM-RNN (2015), pp. 171–175

    Google Scholar 

  58. G. Jin, The PH corpus

    Google Scholar 

  59. A. Eisele, Y. Chen, Multi UN: a multilingual corpus from United Nation documents, in Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC’IO), ed. by N.C.C. Chair, K. Choukri, B. Maegaard, J. Mariani, J. Odijk, S. Piperidis, M. Rosner, D. Tapias (European Language Resource, Valletta, Malta)

    Google Scholar 

  60. L.A. McEnery, Z. Xiao, The Lancaster Corpus of Mandarin Chinese: a corpus for monolingual and contrastive language study. Eur. Lang. Resour. Assoc. (2004)

    Google Scholar 

  61. Z. Yan, C. Yan, C. Zhang, Rare Chinese character recognition by radical extraction network (2017), pp. 924–929

    Google Scholar 

  62. C.-L. Liu, F. Yin, D.-H. Wang, Q.-F. Wang, Casia online and offline Chinese handwriting databases, in 2011 International Conference on Document Analysis and Recognition (ICDAR) (IEEE, 2011), pp. 37–41

    Google Scholar 

  63. S. Nishide, H.G. Okuno, T. Ogata, J. Tani, Handwriting prediction based character recognition using recurrent neural network (2011), pp. 2549–2554

    Google Scholar 

  64. A. Chaudhuri, S.K. Ghosh, Optical character recognition system for Czech language using hierarchical deep learning networks, vol. 1

    Google Scholar 

  65. L. Chen, S. Wang, W. Fan, J. Sun, N. Satoshi, Deep learning based language and orientation recognition in document analysis (2015), pp. 436–440

    Google Scholar 

  66. D. Cires, U. Meier, Multi-column deep neural networks for image classification (2011)

    Google Scholar 

  67. L. Kang, J. Kumar, P. Ye, Y. Li, D. Doermann, Convolutional neural networks for document image classification (2014), pp. 3168–3172

    Google Scholar 

  68. L. Noce, I. Gallo, A. Zamberletti, A. Calefati, Embedded textual content for document image classification with convolutional neural networks (2016)

    Google Scholar 

  69. G. Zhong, H. Yao, Y. Liu, C. Hong, T. Pham, Classification of photographed document images based on deep-learning features, in ICGIP 2016, vol. 10225 (2017), pp. 1–6

    Google Scholar 

  70. T. Wang, D.J. Wu, A.Y. Ng, End-to-end text recognition with convolutional neural networks, in ICPR (2012), pp. 3304–3308

    Google Scholar 

  71. O. Morillot, L. Likforman-sulem, O. Morillot, L. Likforman-sulem, New baseline correction algorithm for text-line recognition with bidirectional recurrent neural networks neural networks

    Google Scholar 

  72. A. Graves, M. Liwicki, S. Ferna, R. Bertolami, H. Bunke, A novel connectionist system for unconstrained handwriting recognition. IEEE Trans. Pattern Anal. Mach. Intell. 31(5), 855–868 (2009)

    Article  Google Scholar 

  73. A. Ray, Text recognition using deep BLSTM networks

    Google Scholar 

  74. P.P. Roy, G. Zhong, M. Cheriet, Tandem hidden Markov models using deep belief networks for offline handwriting recognition. Front. Inf. Technol. Electron. Eng. 18(61403353), 978–988 (2017)

    Article  Google Scholar 

  75. S. Sudholt, G.A. Fink, A.W. Spotting, PHOCNet: a deep convolutional neural network for word spotting in handwritten documents (2016), pp. 277–282

    Google Scholar 

  76. A. Rehman, S. Naz, M.I. Razzak, Writer identification using machine learning approaches: a comprehensive review, in Springer Multimedia Tools and Applications (2018)

    Google Scholar 

  77. W. Feng, N. Guan, Y. Li, X. Zhang, Z. Luo, Audio visual speech recognition with multimodal recurrent neural networks (2017), pp. 681–688

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Saeeda Naz .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Saeed, S., Naz, S., Razzak, M.I. (2019). An Application of Deep Learning in Character Recognition: An Overview. In: Balas, V., Roy, S., Sharma, D., Samui, P. (eds) Handbook of Deep Learning Applications. Smart Innovation, Systems and Technologies, vol 136. Springer, Cham. https://doi.org/10.1007/978-3-030-11479-4_3

Download citation

Publish with us

Policies and ethics