Reading Modi Lipi: A Deep Learning Journey in Character Recognition
Advancements in deep learning methodologies have played a significant role in the success of various character recognition processes. Character recognition refers to the technique of identifying either handwritten or printed characters from documents and their conversion into a form that can be read...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Universidad Internacional de La Rioja (UNIR)
2025-01-01
|
Series: | International Journal of Interactive Multimedia and Artificial Intelligence |
Subjects: | |
Online Access: | https://www.ijimai.org/journal/bibcite/reference/3487 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841560812072730624 |
---|---|
author | Kanchan Varpe Sachin Sakhare |
author_facet | Kanchan Varpe Sachin Sakhare |
author_sort | Kanchan Varpe |
collection | DOAJ |
description | Advancements in deep learning methodologies have played a significant role in the success of various character recognition processes. Character recognition refers to the technique of identifying either handwritten or printed characters from documents and their conversion into a form that can be read by machines. MODI script, an ancient Indian script, is categorized under the Devanagari script and holds historical significance. Despite its historical importance, there are only a few MODI translators available. Conversely, there exist a vast number of historical documents written in MODI that are yet to be deciphered. Recognizing characters in Indian language scripts poses many challenges due to the complex nature of the scripts and variations in individuals' writing styles. This paper provides an overview of the newest advancements in the Handwritten Optical Character Recognition (HWCR) methodology specifically designed for the MODI script. Utilization of residual networks and inception in image classification has gained popularity in recent times. In this paper the authors have implemented three techniques: ResNet9, ResNet50, and InceptionNet V3, trained specifically for handwritten MODI characters and vowels. The dataset used for training the models consists of handwritten MODI script images. The benchmark database from IEEE data port for handwritten MODI script is used to evaluate the performance. The dataset contains 46 classes, including 10 vowel classes and 36 consonant classes. Each class comprises 90 images, resulting in a total of 4140 images. The image size in the dataset is 227×227. The accuracy achieved by the trained models is as follows: 98.92% for ResNet9, 91.91% for ResNet50, and 86% for Inception Net V3. The obtained results have been compared with existing models and it is observed that the proposed model attained improved performance parameters and less training and validation losses in comparison to existing methods. There are several advantages of the proposed model in comparison to state of the art, namely minimal training and validation loss. In addition to this, the proposed approach improved generalization and robustness, and improved model scalability. |
format | Article |
id | doaj-art-db6f6decaa2445e2b689560b54aaa573 |
institution | Kabale University |
issn | 1989-1660 |
language | English |
publishDate | 2025-01-01 |
publisher | Universidad Internacional de La Rioja (UNIR) |
record_format | Article |
series | International Journal of Interactive Multimedia and Artificial Intelligence |
spelling | doaj-art-db6f6decaa2445e2b689560b54aaa5732025-01-03T15:20:35ZengUniversidad Internacional de La Rioja (UNIR)International Journal of Interactive Multimedia and Artificial Intelligence1989-16602025-01-0191758310.9781/ijimai.2024.09.002ijimai.2024.09.002Reading Modi Lipi: A Deep Learning Journey in Character RecognitionKanchan VarpeSachin SakhareAdvancements in deep learning methodologies have played a significant role in the success of various character recognition processes. Character recognition refers to the technique of identifying either handwritten or printed characters from documents and their conversion into a form that can be read by machines. MODI script, an ancient Indian script, is categorized under the Devanagari script and holds historical significance. Despite its historical importance, there are only a few MODI translators available. Conversely, there exist a vast number of historical documents written in MODI that are yet to be deciphered. Recognizing characters in Indian language scripts poses many challenges due to the complex nature of the scripts and variations in individuals' writing styles. This paper provides an overview of the newest advancements in the Handwritten Optical Character Recognition (HWCR) methodology specifically designed for the MODI script. Utilization of residual networks and inception in image classification has gained popularity in recent times. In this paper the authors have implemented three techniques: ResNet9, ResNet50, and InceptionNet V3, trained specifically for handwritten MODI characters and vowels. The dataset used for training the models consists of handwritten MODI script images. The benchmark database from IEEE data port for handwritten MODI script is used to evaluate the performance. The dataset contains 46 classes, including 10 vowel classes and 36 consonant classes. Each class comprises 90 images, resulting in a total of 4140 images. The image size in the dataset is 227×227. The accuracy achieved by the trained models is as follows: 98.92% for ResNet9, 91.91% for ResNet50, and 86% for Inception Net V3. The obtained results have been compared with existing models and it is observed that the proposed model attained improved performance parameters and less training and validation losses in comparison to existing methods. There are several advantages of the proposed model in comparison to state of the art, namely minimal training and validation loss. In addition to this, the proposed approach improved generalization and robustness, and improved model scalability.https://www.ijimai.org/journal/bibcite/reference/3487deep learningfeature extractionimage processinginceptionnet v3resnet50resnet9 |
spellingShingle | Kanchan Varpe Sachin Sakhare Reading Modi Lipi: A Deep Learning Journey in Character Recognition International Journal of Interactive Multimedia and Artificial Intelligence deep learning feature extraction image processing inceptionnet v3 resnet50 resnet9 |
title | Reading Modi Lipi: A Deep Learning Journey in Character Recognition |
title_full | Reading Modi Lipi: A Deep Learning Journey in Character Recognition |
title_fullStr | Reading Modi Lipi: A Deep Learning Journey in Character Recognition |
title_full_unstemmed | Reading Modi Lipi: A Deep Learning Journey in Character Recognition |
title_short | Reading Modi Lipi: A Deep Learning Journey in Character Recognition |
title_sort | reading modi lipi a deep learning journey in character recognition |
topic | deep learning feature extraction image processing inceptionnet v3 resnet50 resnet9 |
url | https://www.ijimai.org/journal/bibcite/reference/3487 |
work_keys_str_mv | AT kanchanvarpe readingmodilipiadeeplearningjourneyincharacterrecognition AT sachinsakhare readingmodilipiadeeplearningjourneyincharacterrecognition |