Enhanced Category-Feature Association Measure

Text classification is one of the severe challenges for categorizing large and high-dimensional text data accurately and efficiently. Many features confuse the classification process, and feature selection (FS) strategies should be used to deal with the problem of high dimensionality. This paper pr...

Full description

Saved in:
Bibliographic Details
Main Authors: Soran S. Badawi, Ari M. Saeed, Sara A. Ahmed, Diyari A. Hassan
Format: Article
Language:English
Published: Koya University 2025-08-01
Series:ARO-The Scientific Journal of Koya University
Subjects:
Online Access:https://aro.koyauniversity.org/index.php/aro/article/view/2034
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Text classification is one of the severe challenges for categorizing large and high-dimensional text data accurately and efficiently. Many features confuse the classification process, and feature selection (FS) strategies should be used to deal with the problem of high dimensionality. This paper proposes a novel FS technique based on enhanced category-feature association measure (ECFAM). ECFAM utilizes the existence and elimination of terms and the complicated relationships among the terms across different sections. This one-of-a-kind approach emphasizes the key role of ancillary terms in classifying and differentiating categories. The comparison is done on two important datasets, Reuters-21578 and 20-Newsgroups, through two widely employed supervised machine learning classifiers and one deep learning algorithm. Throughout our experiments, we investigate the feature sizes in nine different feature sets, ranging from 50 to 4000. Experimental data show that ECFAM always performs better than other methods concerning accuracy and computational cost.
ISSN:2410-9355
2307-549X