Building morphological analyzer for Nepali

Morphological analyzer is a fundamental tool in Natural Language Processing (NLP) that generates the morphological analyses of a given word-form. It can be used in enhancing the accuracy of POS-Tagging, Chunking, Syntactic Parsing, Word Sense Disambiguation (WSD), Information Retrieval (IR) & M...

Full description

Saved in:
Bibliographic Details
Main Authors: Shahid Mushtaq Bhat, Rupesh Rai
Format: Article
Language:English
Published: Universiti Malaya 2012-12-01
Series:Journal of Modern Languages
Subjects:
Online Access:https://ejournal.um.edu.my/index.php/JML/article/view/3297
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1846162787927064576
author Shahid Mushtaq Bhat
Rupesh Rai
author_facet Shahid Mushtaq Bhat
Rupesh Rai
author_sort Shahid Mushtaq Bhat
collection DOAJ
description Morphological analyzer is a fundamental tool in Natural Language Processing (NLP) that generates the morphological analyses of a given word-form. It can be used in enhancing the accuracy of POS-Tagging, Chunking, Syntactic Parsing, Word Sense Disambiguation (WSD), Information Retrieval (IR) & Machine Translation (MT) Systems. This paper describes an ongoing effort to develop Nepali morphological analyzer, using an open source platform-Apertium (LT-Toolbox). Since, it is the initial stage of this project; we have confined our work to inflectional morphology. So far, we have covered all the possible categories, as per LDC-IL1 POS tag-set of Nepali. Currently, the coverage of Nepali Morph-Analyzer is 20,000 words, classified into 219 paradigms.
format Article
id doaj-art-b933b97e149445f9811bacf6a2d6fdb3
institution Kabale University
issn 1675-526X
2462-1986
language English
publishDate 2012-12-01
publisher Universiti Malaya
record_format Article
series Journal of Modern Languages
spelling doaj-art-b933b97e149445f9811bacf6a2d6fdb32024-11-20T04:48:15ZengUniversiti MalayaJournal of Modern Languages1675-526X2462-19862012-12-01221Building morphological analyzer for NepaliShahid Mushtaq Bhat0Rupesh Rai1Linguistic Data Consortium for Indian LanguagesCentral Institute of Indian Languages Morphological analyzer is a fundamental tool in Natural Language Processing (NLP) that generates the morphological analyses of a given word-form. It can be used in enhancing the accuracy of POS-Tagging, Chunking, Syntactic Parsing, Word Sense Disambiguation (WSD), Information Retrieval (IR) & Machine Translation (MT) Systems. This paper describes an ongoing effort to develop Nepali morphological analyzer, using an open source platform-Apertium (LT-Toolbox). Since, it is the initial stage of this project; we have confined our work to inflectional morphology. So far, we have covered all the possible categories, as per LDC-IL1 POS tag-set of Nepali. Currently, the coverage of Nepali Morph-Analyzer is 20,000 words, classified into 219 paradigms. https://ejournal.um.edu.my/index.php/JML/article/view/3297Morphological analyzer, Word and paradigm model, Apertium, LT-Tool Box, Paradigm, Concatenative Morphology, Machine Translation, Devnagri, Transliteration
spellingShingle Shahid Mushtaq Bhat
Rupesh Rai
Building morphological analyzer for Nepali
Journal of Modern Languages
Morphological analyzer, Word and paradigm model, Apertium, LT-Tool Box, Paradigm, Concatenative Morphology, Machine Translation, Devnagri, Transliteration
title Building morphological analyzer for Nepali
title_full Building morphological analyzer for Nepali
title_fullStr Building morphological analyzer for Nepali
title_full_unstemmed Building morphological analyzer for Nepali
title_short Building morphological analyzer for Nepali
title_sort building morphological analyzer for nepali
topic Morphological analyzer, Word and paradigm model, Apertium, LT-Tool Box, Paradigm, Concatenative Morphology, Machine Translation, Devnagri, Transliteration
url https://ejournal.um.edu.my/index.php/JML/article/view/3297
work_keys_str_mv AT shahidmushtaqbhat buildingmorphologicalanalyzerfornepali
AT rupeshrai buildingmorphologicalanalyzerfornepali