Abstract
For automated document analysis, OCR (Optical character recognition) is a basic building block. The robust automated document analysis system can have impact over a wider sphere of life. Many of the researchers have been working hard to build OCR systems in various languages with significant degree of accuracy, character recognition rate and minimum error rate. Deep learning is the start of art technique with efficient and accurate result as compared to other techniques. Every language, moreover every script have its own challenges e.g. scripts where characters are well separated are less challenging as compared to cursive scripts where characters are attached with one another. In this chapter, we would take a detailed account of the state of art deep learning techniques for Arabic like script, Latin script and symbolic script.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
S. Naz, A.I. Umar, S.H. Shirazi, S.B. Ahmed, M.I. Razzak, I. Siddiqi, Segmentation techniques for recognition of Arabic-like scripts: a comprehensive survey. Educ. Inf. Technol. 21(5), 1225–1241 (2016)
M.I. Razzak, S.A. Husain, A.A. Mirza, A. Belaid, Fuzzy based preprocessing using fusion of online and offline trait for online Urdu script based languages character recognition. Int. J. Innov. Comput. Inf. Control 8(5), 21 (2012)
S. Naz, S.B. Ahmed, R. Ahmad, M.I. Razzak, Arabic script based digit recognition systems, in International Conference on Recent Advances in Computer Systems (RACS) (2016), pp. 67–73
M.I. Razzak, M. Sher, S.A. Hussain, Locally baseline detection for online Arabic script based languages character recognition. Int. J. Phys. Sci. 5(7), 955–959 (2010)
O. Morillot et al., The UOB-Telecom ParisTech Arabic Handwriting Recognition and Translation Systems for the OpenHart 2013 Competition To cite this version (2014)
N. Stamatopoulos, G. Sfikas, Historical Document Processing, vol. 2014 (2016)
S. Naz, A.I. Umar, M.I. Razzak, Lexicon reduction for Urdu/Arabic script based character recognition: a multilingual OCR. Mehran Univ. Res. J. Eng. Technol. 35(2), 209 (2016)
S. Chen, Using multiple sequence alignment and statistical language model to integrate multiple chinese address recognition outputs (2015), pp. 151–155
M.I. Razzak, S.A. Hussain, M. Sher, Z.S. Khan, Combining offline and online preprocessing for online Urdu character recognition, in Proceedings of the International Multiconference of Engineers and Computer Scientists, vol. 1 (2009), pp. 18–20
M.I. Razzak, S.A. Hussain, A. Belaïd, M. Sher, Multi-font numerals recognition for urdu script based languages. Int. J. Recent Trends Eng. (IJRTE) (2009)
S.B. Ahmed, S. Naz, S. Swati, M.I. Razzak, A.I. Umar, A.A. Khan, UCOM offline dataset—an Urdu handwritten dataset generation. Int. Arab J. Inf. Technol. 14(2), 239–245 (2017)
A. Kh, A. Kacem, A. Bela, M. Elloumi, A. Kh, Arabic handwritten words off-line recognition based on HMMs and DBNs Arabic handwritten words off-line recognition based on HMMs and DBNs, in 13th International Conference on Document Analysis and Recognition (2015), pp. 51–55
S. Naz, M.I. Razzak, K. Hayat, M.W. Anwar, S.Z. Khan, Challenges in baseline detection of Arabic script based languages, in Intelligent Systems for Science and Information (Springer, Cham, 2014), pp. 181–196
D.C. Cires, Deep, Big, Simple Neural Nets for Handwritten, vol. 3220 (2010), pp. 3207–3220
S.F. Rashid, M.-P. Schambach, J. Rottland, S.V.D. Nüll, Low resolution Arabic recognition with multidimensional recurrent neural networks, pp. 1–5
S. Naz, K. Hayat, M.I. Razzak, M.W. Anwar, H. Akbar, Arabic script based language character recognition: Nasta’liq vs Naskh analysis, in 2013 World Congress on Computer and Information Technology (WCCIT) (IEEE, 2013), pp. 1–7
S. Naz, K. Hayat, M.I. Razzak, M.W. Anwar, S.A. Madani, S.U. Khan, The optical character recognition of Urdu like cursive script Post Ph.D. View project. Pattern Recognit. 1–20 (2014)
M. Liwicki, A novel approach to on-line handwriting recognition based on bidirectional long short-term memory networks
A. Graves, Offline handwriting recognition with multidimensional recurrent neural networks, pp. 1–8
A. Graves, Offline Arabic handwriting recognition with multidimensional recurrent neural networks
A. Graves, S. Fern, Multi-dimensional recurrent neural networks (2013), pp. 1–10
A. Graves, S. Fern, M. Liwicki, H. Bunke, Unconstrained online handwriting recognition with recurrent neural networks, pp. 1–8
I. Ahmad, X. Wang, R. Li, M. Ahmed, R. Ullah, Line and ligature segmentation of Urdu Nastaleeq Text, vol. 5 (2017)
Z. Ahmad, J. Khan, I. Shamsher, Urdu compound character recognition using feed forward neural networks (2009)
M. Seuret, M. Alberti, R. Ingold, M. Liwicki, PCA-initialized deep neural networks applied to document image analysis
S.B. Ahmed, S. Naz, M.I. Razzak, R. Yousaf, Deep learning based isolated Arabic scene character recognition, in 2017 1st International Workshop on Arabic Script Analysis and Recognition (ASAR) (2017), pp. 46–51
D.C. Cires, U. Meier, L.M. Gambardella, Convolutional neural network committees for handwritten character classification, vol. 10 (2011), pp. 1135–1139
I. Ahmad, X. Wang, R. Li, S. Rasheed, Offline Urdu Nastaleeq optical character recognition based on stacked denoising autoencoder (2016), pp. 146–157
M.I. Razzak, F. Anwar, S.A. Husain, A. Belaid, M. Sher, HMM and fuzzy logic: a hybrid approach for online Urdu script-based languages’ character recognition. Knowl.-Based Syst. 23(8), 914–923 (2010)
S.B. Ahmed, S. Naz, S. Swati, M.I. Razzak, Handwritten Urdu character recognition using one-dimensional BLSTM classifier. Neural Computing and Applications, pp. 1–9 (2017)
R. Ahmad, M.Z. Afzal, S.F. Rashid, M. Liwicki, T. Breuel, A. Dengel, KPTI: Katib’s Pashto text imagebase and deep learning benchmark (2016)
S. Naz, S. Bin, R. Ahmad, M. Imran, Zoning features and 2DLSTM for Urdu text-line recognition. Procedia—Procedia Comput. Sci. 96(September), 16–22 (2016)
S. Naz, A.I. Umar, R. Ahmad, S.B. Ahmed, S.H. Shirazi, M.I. Razzak, Urdu Nasta’liq text recognition system based on multi-dimensional recurrent neural network and statistical features. Neural Comput. Appl. 2015
S. Naz et al., Offline Cursive Urdu-Nastaliq script recognition using multidimensional recurrent neural networks author’s accepted manuscript. Neurocomputing (2016)
S. Naz, A.I. Umar, R. Ahmed, M.I. Razzak, S.F. Rashid, Urdu Nasta’liq text recognition using implicit segmentation based on multi-dimensional long short term memory neural networks. SpringerPlus (2016)
S.B. Ahmed, S. Naz, S. Swati, M.I. Razzak, Handwritten Urdu character recognition using one-dimensional BLSTM classifier. Neural Comput. Appl. 1–9 (2017)
I. Ahmad, X. Wang, M. Guang, H. Ahmad, R. Ullah, Ligature based Urdu Nastaleeq sentence recognition using gated bidirectional long short term memory. Cluster Comput. (2017)
F. Slimane, R. Ingold, S. Kanoun, A.M. Alimi, J. Hennebert, A new Arabic printed text image database and evaluation protocols (2009), pp. 0–4
S. Naz, A.I. Umar, R. Ahmad, I. Siddiqi, Urdu Nastaliq recognition using convolutional-recursive deep learning. Neurocomputing (2017)
S.B. Ahmed, S. Naz, M.I. Razzak, R. Yusof, T.M. Breuel, Balinese character recognition using bidirectional LSTM classifier, in Advances in Machine Learning and Signal Processing (Springer, Cham, 2016), pp. 201–211
H. El, A. Volker, ICDAR 2009-Arabic handwriting recognition competition (2011), pp. 3–13
V. Pham, Dropout improves recurrent neural networks for handwriting recognition
Y. Chherawala, P.P. Roy, M. Cheriet, Feature design for offline Arabic handwriting recognition: handcrafted vs automated? (2013)
T.I. Society, O. Engineering, A comparison of 1D and 2D LSTM architectures for the recognition of handwritten Arabic (2015)
S.B. Ahmed, S. Naz, M.I. Razzak, S.F. Rashid, M.Z. Afzal, T.M. Breuel, Evaluation of cursive and non-cursive scripts using recurrent neural networks. Neural Comput. Appl. 27(3), 603–613 (2016)
R. Odate, H. Goto, Fast and accurate candidate reduction using the multiclass LDA for Japanese/Chinese character recognition, in 2015 IEEE International Conference on Image Processing (ICIP) (IEEE, 2015), pp. 951–955
Y. Lu, J. Li, H. Zhang, S. Lin, Chinese character recognition of e-commerce platform pictures (2017), pp. 28–31
Z. Zhong, L. Jin, Z. Feng, I. Engineering, Multi-font printed Chinese character recognition using multi-pooling convolutional neural network (2015), pp. 96–100
Y. Tang, L. Peng, Q. Xu, Y. Wang, A. Furuhata, CNN based transfer learning for historical Chinese character recognition
X. Yu, W. Fan, J. Sun, S. Naoi, Semi-supervised learning feature representation for historical Chinese character recognition (2017), pp. 73–77
D. Cires, T.R.N. Idsia, D. Cires, Multi-column deep neural networks for offline handwritten Chinese character classification (2013)
N. Liu et al., Robust math formula recognition in degraded Chinese document images (2017)
D.K. Ning Liu, D. Zhang, X. Xu, L. Guo, L. Chen, W. Liu, MFR100 dataset
Z. Xie, Z. Sun, L. Jin, H. Ni, T. Lyons, Learning spatial-semantic context with fully convolutional recurrent network for online handwritten Chinese text recognition
C. Wu, W. Fan, Y. He, J. Sun, S. Naoi, Handwritten character recognition by alternately trained relaxation convolutional neural network (2014)
R. Messina, Segmentation-free handwritten Chinese text recognition with LSTM-RNN
R. Messina, J. Louradour, Segmentation-free handwritten Chinese text recognition with LSTM-RNN (2015), pp. 171–175
G. Jin, The PH corpus
A. Eisele, Y. Chen, Multi UN: a multilingual corpus from United Nation documents, in Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC’IO), ed. by N.C.C. Chair, K. Choukri, B. Maegaard, J. Mariani, J. Odijk, S. Piperidis, M. Rosner, D. Tapias (European Language Resource, Valletta, Malta)
L.A. McEnery, Z. Xiao, The Lancaster Corpus of Mandarin Chinese: a corpus for monolingual and contrastive language study. Eur. Lang. Resour. Assoc. (2004)
Z. Yan, C. Yan, C. Zhang, Rare Chinese character recognition by radical extraction network (2017), pp. 924–929
C.-L. Liu, F. Yin, D.-H. Wang, Q.-F. Wang, Casia online and offline Chinese handwriting databases, in 2011 International Conference on Document Analysis and Recognition (ICDAR) (IEEE, 2011), pp. 37–41
S. Nishide, H.G. Okuno, T. Ogata, J. Tani, Handwriting prediction based character recognition using recurrent neural network (2011), pp. 2549–2554
A. Chaudhuri, S.K. Ghosh, Optical character recognition system for Czech language using hierarchical deep learning networks, vol. 1
L. Chen, S. Wang, W. Fan, J. Sun, N. Satoshi, Deep learning based language and orientation recognition in document analysis (2015), pp. 436–440
D. Cires, U. Meier, Multi-column deep neural networks for image classification (2011)
L. Kang, J. Kumar, P. Ye, Y. Li, D. Doermann, Convolutional neural networks for document image classification (2014), pp. 3168–3172
L. Noce, I. Gallo, A. Zamberletti, A. Calefati, Embedded textual content for document image classification with convolutional neural networks (2016)
G. Zhong, H. Yao, Y. Liu, C. Hong, T. Pham, Classification of photographed document images based on deep-learning features, in ICGIP 2016, vol. 10225 (2017), pp. 1–6
T. Wang, D.J. Wu, A.Y. Ng, End-to-end text recognition with convolutional neural networks, in ICPR (2012), pp. 3304–3308
O. Morillot, L. Likforman-sulem, O. Morillot, L. Likforman-sulem, New baseline correction algorithm for text-line recognition with bidirectional recurrent neural networks neural networks
A. Graves, M. Liwicki, S. Ferna, R. Bertolami, H. Bunke, A novel connectionist system for unconstrained handwriting recognition. IEEE Trans. Pattern Anal. Mach. Intell. 31(5), 855–868 (2009)
A. Ray, Text recognition using deep BLSTM networks
P.P. Roy, G. Zhong, M. Cheriet, Tandem hidden Markov models using deep belief networks for offline handwriting recognition. Front. Inf. Technol. Electron. Eng. 18(61403353), 978–988 (2017)
S. Sudholt, G.A. Fink, A.W. Spotting, PHOCNet: a deep convolutional neural network for word spotting in handwritten documents (2016), pp. 277–282
A. Rehman, S. Naz, M.I. Razzak, Writer identification using machine learning approaches: a comprehensive review, in Springer Multimedia Tools and Applications (2018)
W. Feng, N. Guan, Y. Li, X. Zhang, Z. Luo, Audio visual speech recognition with multimodal recurrent neural networks (2017), pp. 681–688
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Saeed, S., Naz, S., Razzak, M.I. (2019). An Application of Deep Learning in Character Recognition: An Overview. In: Balas, V., Roy, S., Sharma, D., Samui, P. (eds) Handbook of Deep Learning Applications. Smart Innovation, Systems and Technologies, vol 136. Springer, Cham. https://doi.org/10.1007/978-3-030-11479-4_3
Download citation
DOI: https://doi.org/10.1007/978-3-030-11479-4_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-11478-7
Online ISBN: 978-3-030-11479-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)