Building morphological analyzer for Nepali

Morphological analyzer is a fundamental tool in Natural Language Processing (NLP) that generates the morphological analyses of a given word-form. It can be used in enhancing the accuracy of POS-Tagging, Chunking, Syntactic Parsing, Word Sense Disambiguation (WSD), Information Retrieval (IR) & M...

Full description

Saved in:
Bibliographic Details
Main Authors: Shahid Mushtaq Bhat, Rupesh Rai
Format: Article
Language:English
Published: Universiti Malaya 2012-12-01
Series:Journal of Modern Languages
Subjects:
Online Access:https://ajap.um.edu.my/index.php/JML/article/view/3297
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1846159212434948096
author Shahid Mushtaq Bhat
Rupesh Rai
author_facet Shahid Mushtaq Bhat
Rupesh Rai
author_sort Shahid Mushtaq Bhat
collection DOAJ
description Morphological analyzer is a fundamental tool in Natural Language Processing (NLP) that generates the morphological analyses of a given word-form. It can be used in enhancing the accuracy of POS-Tagging, Chunking, Syntactic Parsing, Word Sense Disambiguation (WSD), Information Retrieval (IR) & Machine Translation (MT) Systems. This paper describes an ongoing effort to develop Nepali morphological analyzer, using an open source platform-Apertium (LT-Toolbox). Since, it is the initial stage of this project; we have confined our work to inflectional morphology. So far, we have covered all the possible categories, as per LDC-IL1 POS tag-set of Nepali. Currently, the coverage of Nepali Morph-Analyzer is 20,000 words, classified into 219 paradigms.
format Article
id doaj-art-e6b432cfcf2848ada7b7250f455be3db
institution Kabale University
issn 1675-526X
2462-1986
language English
publishDate 2012-12-01
publisher Universiti Malaya
record_format Article
series Journal of Modern Languages
spelling doaj-art-e6b432cfcf2848ada7b7250f455be3db2024-11-23T19:00:20ZengUniversiti MalayaJournal of Modern Languages1675-526X2462-19862012-12-01221Building morphological analyzer for NepaliShahid Mushtaq Bhat0Rupesh Rai1Linguistic Data Consortium for Indian LanguagesCentral Institute of Indian Languages Morphological analyzer is a fundamental tool in Natural Language Processing (NLP) that generates the morphological analyses of a given word-form. It can be used in enhancing the accuracy of POS-Tagging, Chunking, Syntactic Parsing, Word Sense Disambiguation (WSD), Information Retrieval (IR) & Machine Translation (MT) Systems. This paper describes an ongoing effort to develop Nepali morphological analyzer, using an open source platform-Apertium (LT-Toolbox). Since, it is the initial stage of this project; we have confined our work to inflectional morphology. So far, we have covered all the possible categories, as per LDC-IL1 POS tag-set of Nepali. Currently, the coverage of Nepali Morph-Analyzer is 20,000 words, classified into 219 paradigms. https://ajap.um.edu.my/index.php/JML/article/view/3297Morphological analyzer, Word and paradigm model, Apertium, LT-Tool Box, Paradigm, Concatenative Morphology, Machine Translation, Devnagri, Transliteration
spellingShingle Shahid Mushtaq Bhat
Rupesh Rai
Building morphological analyzer for Nepali
Journal of Modern Languages
Morphological analyzer, Word and paradigm model, Apertium, LT-Tool Box, Paradigm, Concatenative Morphology, Machine Translation, Devnagri, Transliteration
title Building morphological analyzer for Nepali
title_full Building morphological analyzer for Nepali
title_fullStr Building morphological analyzer for Nepali
title_full_unstemmed Building morphological analyzer for Nepali
title_short Building morphological analyzer for Nepali
title_sort building morphological analyzer for nepali
topic Morphological analyzer, Word and paradigm model, Apertium, LT-Tool Box, Paradigm, Concatenative Morphology, Machine Translation, Devnagri, Transliteration
url https://ajap.um.edu.my/index.php/JML/article/view/3297
work_keys_str_mv AT shahidmushtaqbhat buildingmorphologicalanalyzerfornepali
AT rupeshrai buildingmorphologicalanalyzerfornepali