Building morphological analyzer for Nepali

Morphological analyzer is a fundamental tool in Natural Language Processing (NLP) that generates the morphological analyses of a given word-form. It can be used in enhancing the accuracy of POS-Tagging, Chunking, Syntactic Parsing, Word Sense Disambiguation (WSD), Information Retrieval (IR) & M...

Full description

Saved in:
Bibliographic Details
Main Authors: Shahid Mushtaq Bhat, Rupesh Rai
Format: Article
Language:English
Published: Universiti Malaya 2012-12-01
Series:Journal of Modern Languages
Subjects:
Online Access:http://jml.um.edu.my/index.php/JML/article/view/3297
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Morphological analyzer is a fundamental tool in Natural Language Processing (NLP) that generates the morphological analyses of a given word-form. It can be used in enhancing the accuracy of POS-Tagging, Chunking, Syntactic Parsing, Word Sense Disambiguation (WSD), Information Retrieval (IR) & Machine Translation (MT) Systems. This paper describes an ongoing effort to develop Nepali morphological analyzer, using an open source platform-Apertium (LT-Toolbox). Since, it is the initial stage of this project; we have confined our work to inflectional morphology. So far, we have covered all the possible categories, as per LDC-IL1 POS tag-set of Nepali. Currently, the coverage of Nepali Morph-Analyzer is 20,000 words, classified into 219 paradigms.
ISSN:1675-526X
2462-1986