A Recognition System for Devanagari Handwritten Digits using CNN, a novel approach to recognizing transcribed digits in the Devanagari script using Convolutional Neural Networks (CNN). This framework represents a significant contribution to the field of pattern recognition and language processing objective of the research project is to perform a literature review, identify an algorithm for a digits recognition system implement the Devanagari digits recognition system for educational activities. In the first phase, a dataset of 150 transcribed digit images is curated, allocating 75% for training (113 images) and 25% for validation (37 images). A Convolutional Neural Network (CNN) is designed with five convolutional layers, each utilizing 3 × 3 filters with 16, 32, 64, 128, and 128 feature maps, respectively. The experiments conducted involve varying the number of epochs, with results captured at 5, 10, 20, and 100 epochs. This comprehensive evaluation aims to understand the model's convergence and performance over different training durations. The outcomes of this phase contribute to the fine-tuning and optimization of the model for subsequent phases. In the second phase, the dataset is expanded to 100*10 (1000) images, each resized to 28 × 28 pixels through cropping. The CNN architecture remains consistent, with the previously determined layer configuration. Similar experiments are conducted, assessing the model's performance over 5, 10, 20, and 100 epochs. This model with a data size of 1000 demonstrates superior accuracy (100% on mini-batches) compared to the 150 model, with consistently high validation accuracy, while both models exhibit decreasing trends in mini-batch and validation losses, favoring the larger dataset, and maintaining a constant learning rate at 0.0100, albeit with a slightly longer time elapsed for each epoch due to the increased data size. 98.37398 accuracy in the phase 2 experiment in 100 epochs. Similar research and contributions and Devanagari’s character and word recognition system.
Published in | American Journal of Electrical and Computer Engineering (Volume 8, Issue 2) |
DOI | 10.11648/j.ajece.20240802.11 |
Page(s) | 21-30 |
Creative Commons |
This is an Open Access article, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited. |
Copyright |
Copyright © The Author(s), 2024. Published by Science Publishing Group |
Deep Learning, CNN, Image Processing, Digit Recognition, Ethnic Language
Handwritten Digit | Label | Crop file size | Normalize file size | filename |
---|---|---|---|---|
0 | 78*40 | 28*28 | 0_1.jpg | |
1 | 35*39 | 28*28 | 1_1.jpg | |
2 | 105*100 | 28*28 | 2_1.jpg | |
3 | 80*140 | 28*28 | 3_1.jpg | |
4 | 58*55 | 28*28 | 4_1.jpg | |
5 | 44*40 | 28*28 | 5_1.jpg | |
6 | 29*37 | 28*28 | 6_1.jpg | |
7 | 30*72 | 28*28 | 7_1.jpg | |
8 | 70*70 | 28*28 | 8_1.jpg | |
9 | 28*28 | 28*28 | 9_1.jpg |
Epoch | Iteration | Time Elapsed (hh:mm: ss) | Mini-batch Accuracy | Validation Accuracy | Mini-batch Loss | Validation Loss | Learning Rate |
---|---|---|---|---|---|---|---|
1 | 1 | 00:00:07 | 10.09% | 20.51% | 2.9026 | 2.4778 | 0.0100 |
5 | 5 | 00:00:09 | 95.41% | 66.67% | 0.5519 | 1.1754 | 0.0100 |
Epoch | Iteration | Time Elapsed (hh:mm: ss) | Mini-batch Accuracy | Validation Accuracy | Mini-batch Loss | Validation Loss | Learning Rate |
---|---|---|---|---|---|---|---|
1 | 1 | 00:00:05 | 7.34% | 7.69% | 2.7801 | 2.4966 | 0.0100 |
10 | 10 | 00:00:08 | 100.00% | 84.62% | 0.1028 | 0.6449 | 0.0100 |
Epoch | Iteration | Time Elapsed (hh:mm:ss) | Mini-batch Accuracy | Validation Accuracy | Mini-batch Loss | Validation Loss | Learning Rate |
---|---|---|---|---|---|---|---|
1 | 1 | 00:00:06 | 6.42% | 12.82% | 2.7626 | 2.5114 | 0.0100 |
20 | 20 | 00:00:14 | 100.00% | 79.49% | 0.0265 | 0.5313 | 0.0100 |
Epoch | Iteration | Time Elapsed (hh:mm:ss) | Mini-batch Accuracy | Validation Accuracy | Mini-batch Loss | Validation Loss | Learning Rate |
---|---|---|---|---|---|---|---|
1 | 1 | 00:00:06 | 6.42% | 12.82% | 2.8182 | 2.5270 | 0.0100 |
50 | 50 | 00:00:21 | 100.00% | 0.0063 | 0.0100 | ||
100 | 100 | 00:00:39 | 100.00% | 82.05% | 0.0039 | 0.4195 | 0.0100 |
Epoch | Iteration | Time Elapsed (hh:mm: ss) | Mini-batch Accuracy | Validation Accuracy | Mini-batch Loss | Validation Loss | Learning Rate |
---|---|---|---|---|---|---|---|
1 | 1 | 00:00:09 | 8.59% | 13.82% | 2.7490 | 2.5070 | 0.0100 |
5 | 25 | 00:00:23 | 100.00% | 96.75% | 0.0572 | 0.1764 | 0.0100 |
Epoch | Iteration | Time Elapsed (hh:mm:ss) | Mini-batch Accuracy | Validation Accuracy | Mini-batch Loss | Validation Loss | Learning Rate |
---|---|---|---|---|---|---|---|
1 | 1 | 00:00:06 | 12.50% | 15.04% | 2.6363 | 2.4576 | 0.0100 |
10 | 50 | 00:00:34 | 100.00% | 97.56% | 0.0221 | 0.0908 | 0.0100 |
Epoch | Iteration | Time Elapsed (hh:mm: ss) | Mini-batch Accuracy | Validation Accuracy | Mini-batch Loss | Validation Loss | Learning Rate |
---|---|---|---|---|---|---|---|
1 | 1 | 00:00:07 | 4.69% | 16.26% | 2.7612 | 2.3145 | 0.0100 |
10 | 50 | 00:00:26 | 100.00% | 0.0166 | 0.0100 | ||
20 | 100 | 00:00:51 | 100.00% | 97.97% | 0.0085 | 0.0755 | 0.0100 |
Epoch | Iteration | Time Elapsed (hh:mm:ss) | Mini-batch Accuracy | Validation Accuracy | Mini-batch Loss | Validation Loss | Learning Rate |
---|---|---|---|---|---|---|---|
1 | 1 | 00:00:07 | 8.59% | 16.67% | 2.6120 | 2.4114 | 0.0100 |
10 | 50 | 00:00:28 | 100.00% | 0.0156 | 0.0100 | ||
20 | 100 | 00:00:50 | 100.00% | 97.56% | 0.0092 | 0.0885 | 0.0100 |
30 | 150 | 00:01:10 | 100.00% | 0.0071 | 0.0100 | ||
40 | 200 | 00:01:30 | 100.00% | 97.97% | 0.0045 | 0.0729 | 0.0100 |
50 | 250 | 00:01:51 | 100.00% | 0.0041 | 0.0100 | ||
60 | 300 | 00:02:11 | 100.00% | 97.97% | 0.0031 | 0.0675 | 0.0100 |
70 | 350 | 00:02:30 | 100.00% | 0.0028 | 0.0100 | ||
80 | 400 | 00:02:52 | 100.00% | 98.37% | 0.0024 | 0.0596 | 0.0100 |
90 | 450 | 00:03:12 | 100.00% | 0.0023 | 0.0100 | ||
100 | 500 | 00:03:30 | 100.00% | 98.37% | 0.0019 | 0.0552 | 0.0100 |
AI | Artificial Intelligences |
ANN | Artificial Neural Networks |
CNN | Convolutional Neural Networks |
KNN | K-Nearest Neighbors |
LSTM | Long Short-Term Memory |
RFC | Random Forest Classifier |
RNN | Recurrent Neural Networks |
SVM | Support Vector Machines |
[1] | A. Siddiqa and C. D S, "A Recognition System for Handwritten Digits Using CNN," International Journal of Science and Research (IJSR), 2020. |
[2] | DeepNetDevanagari: a deep learning model for Devanagari ancient character recognition, Multimedia Tools and Applications, 08 03 2021. |
[3] | S. Singh, N. K. Garg, and M. Kumar, “Feature extraction and classification techniques for handwritten Devanagari text recognition: a survey.,” Multimed Tools Appl, vol. 82, no. 1, pp. 747-747–775, 2023, |
[4] | S. R. Narang, M. K. Jindal, and M. Kumar, “Ancient text recognition: a review.,” Artif Intell Rev, vol. 53, no. 8, pp. 5517-5517–5558, 2020, |
[5] | S. P. Deore and P. Albert, "Devanagari Handwritten Character Recognition using fine-tuned Deep Convolutional Neural Network on trivial dataset," Sadhana, vol. 45, no. 243, 17 08 2020. |
[6] | R. Sethi and I. Kaushik, "Hand Written Digit Recognition using Machine Learning," 2020 IEEE 9th International Conference on Communi-cation Systems and Network Technologies (CSNT), pp. 49-54, 2020. |
[7] | G. Deng, M. Tang, Y. Zhang, Y. Huang, and X. Duan, “Privacy-Preserving Outsourced Artificial Neural Network Training for Secure Image Classification.,” Applied Sciences (2076-3417), vol. 12, no. 24, pp. 12873-12873–12887, 2022, |
[8] | F. Siddique, S. Sakib and M. A. B. Siddique, "Recognition of Handwritten Digit using Convolutional Neural Network in Python with Tensor-flow and Comparison of Performance for Various Hidden Layers," 2019 5th International Conference on Advances in Electrical Engineering (ICAEE), pp. 541-546, 2020. |
[9] | R. Geetha, T. Thilagam and T. Padmavathy, "Effective offline handwritten text recognition model based on a sequence-to-sequence approach with CNN–RNN networks.," Neural Computing and Applications, 19 11 2020. |
[10] | Lindsay, G. W. (2022). Convolutional Neural Networks as a Model of the Visual System: Past, Present, and Future. Gatsby Computational Unit/Sainsbury Wellcome Centre. |
[11] | S. Sabbu and V. Ganesan, “LSTM-Based Neural Network to Recognize Human Activities Using Deep Learning Techniques.,” Applied Computational Intelligence & Soft Computing, pp. 1-1–8, 2022, |
[12] | F. Kratbi, A. Y. and H. S., "Support Vector Machines for Evaluating the," Original scientific paper Impact of the Forward Osmosis Membrane Characteristics on the Rejection of the Organic Molecules, vol. 72, no. 7, p. 417−431, 2023. |
[13] | J. Zhou, H. Xu, Z. Zhang, J. Lu, W. Guo, and Z. Li, “Using Recurrent Neural Network Structure and Multi-Head Attention with Convolution for Fraudulent Phone Text Recognition.,” Computer Systems Science & Engineering, vol. 46, no. 2, pp. 2277-2277–2297, 2023, |
[14] | C. Wu, W. Fan, Y. He, J. Sun and S. Naoi, "Handwritten Character Recognition by Alternately Trained Relaxation Convolutional Neural Network," 2014 14th International Conference on Frontiers in Handwriting Recognition, 2014. |
[15] | S. Ahlawat, A. Choudhary, N. Anand, S. Singh and B. Yoon, "Improved Handwritten Digit Recognition Using Convolutional Neural Networks (CNN)," Sensors, vol. 20, no. 12, 2020. |
[16] | C.-L. Liu, K. Nakashima, H. Sako and H. Fujisawa, "Handwritten digit recognition: investigation of normalization andfeature extraction techniques," Pattern Recognition, vol. 37, no. 2. |
[17] | S. Dua et al., “Developing a Speech Recognition System for Recognizing Tonal Speech Signals Using a Convolutional Neural Network.,” Applied Sciences (2076-3417), vol. 12, no. 12, pp. 6223-6223–6235, 2022, |
[18] | P. Sharma and J. Rai, "A Review of Feature Extraction Methods for Handwritten Character Recognition," Journal of Advances and Scholarly Researches in Allied Education, vol. 17, no. 2, 2020. |
[19] | D. Mungra, A. Agrawal, P. Sharma, S. Tanwar, and M. S. Obaidat, “PRATIT: a CNN-based emotion recognition system using histogram equalization and data augmentation.,” Multimed Tools Appl, vol. 79, no. 3/4, pp. 2285-2285–2307, 2020, |
[20] | A. Sen, T. K. Mishra, and R. Dash, “A novel hand gesture detection and recognition system based on ensemble-based convolutional neural network.,” Multimed Tools Appl, vol. 81, no. 28, pp. 40043-40043–40066, 2022, |
[21] | H. Singh, R. K. Sharma, and V. P. Singh, “Online handwriting recognition systems for Indic and non-Indic scripts: a review.,” Artif Intell Rev, vol. 54, no. 2, pp. 1525-1525–1579, 2021, |
[22] | M. Á. Serrano, A. Flammini, and F. Menczer, “Modeling Statistical Properties of Written Text.,” PLoS One, vol. 4, no. 4, pp. 1-1–8, 2009, |
[23] | M. Sun, Z. Song, X. Jiang, J. Pan, and Y. Pang, “Learning Pooling for Convolutional Neural Network,” Neurocomputing, vol. 224. Elsevier BV, pp. 96–104, Feb. 2017. |
APA Style
Ghimire, N. (2024). A Recognition System for Devanagari Handwritten Digits Using CNN. American Journal of Electrical and Computer Engineering, 8(2), 21-30. https://doi.org/10.11648/j.ajece.20240802.11
ACS Style
Ghimire, N. A Recognition System for Devanagari Handwritten Digits Using CNN. Am. J. Electr. Comput. Eng. 2024, 8(2), 21-30. doi: 10.11648/j.ajece.20240802.11
AMA Style
Ghimire N. A Recognition System for Devanagari Handwritten Digits Using CNN. Am J Electr Comput Eng. 2024;8(2):21-30. doi: 10.11648/j.ajece.20240802.11
@article{10.11648/j.ajece.20240802.11, author = {Nawaraj Ghimire}, title = {A Recognition System for Devanagari Handwritten Digits Using CNN }, journal = {American Journal of Electrical and Computer Engineering}, volume = {8}, number = {2}, pages = {21-30}, doi = {10.11648/j.ajece.20240802.11}, url = {https://doi.org/10.11648/j.ajece.20240802.11}, eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.ajece.20240802.11}, abstract = {A Recognition System for Devanagari Handwritten Digits using CNN, a novel approach to recognizing transcribed digits in the Devanagari script using Convolutional Neural Networks (CNN). This framework represents a significant contribution to the field of pattern recognition and language processing objective of the research project is to perform a literature review, identify an algorithm for a digits recognition system implement the Devanagari digits recognition system for educational activities. In the first phase, a dataset of 150 transcribed digit images is curated, allocating 75% for training (113 images) and 25% for validation (37 images). A Convolutional Neural Network (CNN) is designed with five convolutional layers, each utilizing 3 × 3 filters with 16, 32, 64, 128, and 128 feature maps, respectively. The experiments conducted involve varying the number of epochs, with results captured at 5, 10, 20, and 100 epochs. This comprehensive evaluation aims to understand the model's convergence and performance over different training durations. The outcomes of this phase contribute to the fine-tuning and optimization of the model for subsequent phases. In the second phase, the dataset is expanded to 100*10 (1000) images, each resized to 28 × 28 pixels through cropping. The CNN architecture remains consistent, with the previously determined layer configuration. Similar experiments are conducted, assessing the model's performance over 5, 10, 20, and 100 epochs. This model with a data size of 1000 demonstrates superior accuracy (100% on mini-batches) compared to the 150 model, with consistently high validation accuracy, while both models exhibit decreasing trends in mini-batch and validation losses, favoring the larger dataset, and maintaining a constant learning rate at 0.0100, albeit with a slightly longer time elapsed for each epoch due to the increased data size. 98.37398 accuracy in the phase 2 experiment in 100 epochs. Similar research and contributions and Devanagari’s character and word recognition system. }, year = {2024} }
TY - JOUR T1 - A Recognition System for Devanagari Handwritten Digits Using CNN AU - Nawaraj Ghimire Y1 - 2024/07/29 PY - 2024 N1 - https://doi.org/10.11648/j.ajece.20240802.11 DO - 10.11648/j.ajece.20240802.11 T2 - American Journal of Electrical and Computer Engineering JF - American Journal of Electrical and Computer Engineering JO - American Journal of Electrical and Computer Engineering SP - 21 EP - 30 PB - Science Publishing Group SN - 2640-0502 UR - https://doi.org/10.11648/j.ajece.20240802.11 AB - A Recognition System for Devanagari Handwritten Digits using CNN, a novel approach to recognizing transcribed digits in the Devanagari script using Convolutional Neural Networks (CNN). This framework represents a significant contribution to the field of pattern recognition and language processing objective of the research project is to perform a literature review, identify an algorithm for a digits recognition system implement the Devanagari digits recognition system for educational activities. In the first phase, a dataset of 150 transcribed digit images is curated, allocating 75% for training (113 images) and 25% for validation (37 images). A Convolutional Neural Network (CNN) is designed with five convolutional layers, each utilizing 3 × 3 filters with 16, 32, 64, 128, and 128 feature maps, respectively. The experiments conducted involve varying the number of epochs, with results captured at 5, 10, 20, and 100 epochs. This comprehensive evaluation aims to understand the model's convergence and performance over different training durations. The outcomes of this phase contribute to the fine-tuning and optimization of the model for subsequent phases. In the second phase, the dataset is expanded to 100*10 (1000) images, each resized to 28 × 28 pixels through cropping. The CNN architecture remains consistent, with the previously determined layer configuration. Similar experiments are conducted, assessing the model's performance over 5, 10, 20, and 100 epochs. This model with a data size of 1000 demonstrates superior accuracy (100% on mini-batches) compared to the 150 model, with consistently high validation accuracy, while both models exhibit decreasing trends in mini-batch and validation losses, favoring the larger dataset, and maintaining a constant learning rate at 0.0100, albeit with a slightly longer time elapsed for each epoch due to the increased data size. 98.37398 accuracy in the phase 2 experiment in 100 epochs. Similar research and contributions and Devanagari’s character and word recognition system. VL - 8 IS - 2 ER -