Reading Modi Lipi: A Deep Learning Journey in Character Recognition

Advancements in deep learning methodologies have played a significant role in the success of various character recognition processes. Character recognition refers to the technique of identifying either handwritten or printed characters from documents and their conversion into a form that can be read...

Full description

Saved in:

Bibliographic Details
Main Authors:	Kanchan Varpe, Sachin Sakhare
Format:	Article
Language:	English
Published:	Universidad Internacional de La Rioja (UNIR) 2025-01-01
Series:	International Journal of Interactive Multimedia and Artificial Intelligence
Subjects:	deep learning feature extraction image processing inceptionnet v3 resnet50 resnet9
Online Access:	https://www.ijimai.org/journal/bibcite/reference/3487
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1841560812072730624
author	Kanchan Varpe Sachin Sakhare
author_facet	Kanchan Varpe Sachin Sakhare
author_sort	Kanchan Varpe
collection	DOAJ
description	Advancements in deep learning methodologies have played a significant role in the success of various character recognition processes. Character recognition refers to the technique of identifying either handwritten or printed characters from documents and their conversion into a form that can be read by machines. MODI script, an ancient Indian script, is categorized under the Devanagari script and holds historical significance. Despite its historical importance, there are only a few MODI translators available. Conversely, there exist a vast number of historical documents written in MODI that are yet to be deciphered. Recognizing characters in Indian language scripts poses many challenges due to the complex nature of the scripts and variations in individuals' writing styles. This paper provides an overview of the newest advancements in the Handwritten Optical Character Recognition (HWCR) methodology specifically designed for the MODI script. Utilization of residual networks and inception in image classification has gained popularity in recent times. In this paper the authors have implemented three techniques: ResNet9, ResNet50, and InceptionNet V3, trained specifically for handwritten MODI characters and vowels. The dataset used for training the models consists of handwritten MODI script images. The benchmark database from IEEE data port for handwritten MODI script is used to evaluate the performance. The dataset contains 46 classes, including 10 vowel classes and 36 consonant classes. Each class comprises 90 images, resulting in a total of 4140 images. The image size in the dataset is 227×227. The accuracy achieved by the trained models is as follows: 98.92% for ResNet9, 91.91% for ResNet50, and 86% for Inception Net V3. The obtained results have been compared with existing models and it is observed that the proposed model attained improved performance parameters and less training and validation losses in comparison to existing methods. There are several advantages of the proposed model in comparison to state of the art, namely minimal training and validation loss. In addition to this, the proposed approach improved generalization and robustness, and improved model scalability.
format	Article
id	doaj-art-db6f6decaa2445e2b689560b54aaa573
institution	Kabale University
issn	1989-1660
language	English
publishDate	2025-01-01
publisher	Universidad Internacional de La Rioja (UNIR)
record_format	Article
series	International Journal of Interactive Multimedia and Artificial Intelligence
spelling	doaj-art-db6f6decaa2445e2b689560b54aaa5732025-01-03T15:20:35ZengUniversidad Internacional de La Rioja (UNIR)International Journal of Interactive Multimedia and Artificial Intelligence1989-16602025-01-0191758310.9781/ijimai.2024.09.002ijimai.2024.09.002Reading Modi Lipi: A Deep Learning Journey in Character RecognitionKanchan VarpeSachin SakhareAdvancements in deep learning methodologies have played a significant role in the success of various character recognition processes. Character recognition refers to the technique of identifying either handwritten or printed characters from documents and their conversion into a form that can be read by machines. MODI script, an ancient Indian script, is categorized under the Devanagari script and holds historical significance. Despite its historical importance, there are only a few MODI translators available. Conversely, there exist a vast number of historical documents written in MODI that are yet to be deciphered. Recognizing characters in Indian language scripts poses many challenges due to the complex nature of the scripts and variations in individuals' writing styles. This paper provides an overview of the newest advancements in the Handwritten Optical Character Recognition (HWCR) methodology specifically designed for the MODI script. Utilization of residual networks and inception in image classification has gained popularity in recent times. In this paper the authors have implemented three techniques: ResNet9, ResNet50, and InceptionNet V3, trained specifically for handwritten MODI characters and vowels. The dataset used for training the models consists of handwritten MODI script images. The benchmark database from IEEE data port for handwritten MODI script is used to evaluate the performance. The dataset contains 46 classes, including 10 vowel classes and 36 consonant classes. Each class comprises 90 images, resulting in a total of 4140 images. The image size in the dataset is 227×227. The accuracy achieved by the trained models is as follows: 98.92% for ResNet9, 91.91% for ResNet50, and 86% for Inception Net V3. The obtained results have been compared with existing models and it is observed that the proposed model attained improved performance parameters and less training and validation losses in comparison to existing methods. There are several advantages of the proposed model in comparison to state of the art, namely minimal training and validation loss. In addition to this, the proposed approach improved generalization and robustness, and improved model scalability.https://www.ijimai.org/journal/bibcite/reference/3487deep learningfeature extractionimage processinginceptionnet v3resnet50resnet9
spellingShingle	Kanchan Varpe Sachin Sakhare Reading Modi Lipi: A Deep Learning Journey in Character Recognition International Journal of Interactive Multimedia and Artificial Intelligence deep learning feature extraction image processing inceptionnet v3 resnet50 resnet9
title	Reading Modi Lipi: A Deep Learning Journey in Character Recognition
title_full	Reading Modi Lipi: A Deep Learning Journey in Character Recognition
title_fullStr	Reading Modi Lipi: A Deep Learning Journey in Character Recognition
title_full_unstemmed	Reading Modi Lipi: A Deep Learning Journey in Character Recognition
title_short	Reading Modi Lipi: A Deep Learning Journey in Character Recognition
title_sort	reading modi lipi a deep learning journey in character recognition
topic	deep learning feature extraction image processing inceptionnet v3 resnet50 resnet9
url	https://www.ijimai.org/journal/bibcite/reference/3487
work_keys_str_mv	AT kanchanvarpe readingmodilipiadeeplearningjourneyincharacterrecognition AT sachinsakhare readingmodilipiadeeplearningjourneyincharacterrecognition

Reading Modi Lipi: A Deep Learning Journey in Character Recognition

Similar Items