The Outcomes and Publication Standards of Research Descriptions in Document Classification: A Systematic Review

Document classification, a critical area of research, employs machine and deep learning methods to solve real-world problems. This study attempts to highlight the qualitative and quantitative outcomes of the literature review from a broad range of scopes, including machine and deep learning methods,...

Full description

Saved in:
Bibliographic Details
Main Authors: Marcin Marcin Michal Mironczuk, Adam Muller, Witold Pedrycz
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10786331/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1846115034124517376
author Marcin Marcin Michal Mironczuk
Adam Muller
Witold Pedrycz
author_facet Marcin Marcin Michal Mironczuk
Adam Muller
Witold Pedrycz
author_sort Marcin Marcin Michal Mironczuk
collection DOAJ
description Document classification, a critical area of research, employs machine and deep learning methods to solve real-world problems. This study attempts to highlight the qualitative and quantitative outcomes of the literature review from a broad range of scopes, including machine and deep learning methods, as well as solutions based on nature, biological, or quantum physics-inspired methods. A rigorous synthesis was conducted using a systematic literature review of 102 papers published between 2003 and 2023. The 20 Newsgroups (bydate version) were used as a reference point of benchmarks to ensure fair comparisons of methods. Qualitative analysis revealed that recent studies utilize Graph Neural Networks (GNNs) combined with models based on the transformer architecture and propose end-to-end solutions. Quantitative analysis demonstrated state-of-the-art results, with accuracy, micro and macro F1-scores of 90.38%, 88.28%, and 89.38%, respectively. However, the reproducibility of many studies may need to be revised for the scientific community. The resulting overview covers a wide range of document classification methods and can contribute to a better understanding of this field. Additionally, the systematic review approach reduces systematic error, making it useful for researchers in the document classification community.
format Article
id doaj-art-7b4599cfc4394068a1e69ae369b7ede9
institution Kabale University
issn 2169-3536
language English
publishDate 2024-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-7b4599cfc4394068a1e69ae369b7ede92024-12-20T00:01:19ZengIEEEIEEE Access2169-35362024-01-011218925318928710.1109/ACCESS.2024.351355010786331The Outcomes and Publication Standards of Research Descriptions in Document Classification: A Systematic ReviewMarcin Marcin Michal Mironczuk0https://orcid.org/0000-0002-4951-2264Adam Muller1Witold Pedrycz2https://orcid.org/0000-0002-9335-9930National Information Processing Institute, Warsaw, Masovian Voivodeship, PolandNational Information Processing Institute, Warsaw, Masovian Voivodeship, PolandDepartment of Electrical and Computer Engineering, University of Alberta, Edmonton, AB, CanadaDocument classification, a critical area of research, employs machine and deep learning methods to solve real-world problems. This study attempts to highlight the qualitative and quantitative outcomes of the literature review from a broad range of scopes, including machine and deep learning methods, as well as solutions based on nature, biological, or quantum physics-inspired methods. A rigorous synthesis was conducted using a systematic literature review of 102 papers published between 2003 and 2023. The 20 Newsgroups (bydate version) were used as a reference point of benchmarks to ensure fair comparisons of methods. Qualitative analysis revealed that recent studies utilize Graph Neural Networks (GNNs) combined with models based on the transformer architecture and propose end-to-end solutions. Quantitative analysis demonstrated state-of-the-art results, with accuracy, micro and macro F1-scores of 90.38%, 88.28%, and 89.38%, respectively. However, the reproducibility of many studies may need to be revised for the scientific community. The resulting overview covers a wide range of document classification methods and can contribute to a better understanding of this field. Additionally, the systematic review approach reduces systematic error, making it useful for researchers in the document classification community.https://ieeexplore.ieee.org/document/10786331/Document classificationtext classificationsystematic reviewsurveys and overviewssupervised learning by classificationnatural language processing
spellingShingle Marcin Marcin Michal Mironczuk
Adam Muller
Witold Pedrycz
The Outcomes and Publication Standards of Research Descriptions in Document Classification: A Systematic Review
IEEE Access
Document classification
text classification
systematic review
surveys and overviews
supervised learning by classification
natural language processing
title The Outcomes and Publication Standards of Research Descriptions in Document Classification: A Systematic Review
title_full The Outcomes and Publication Standards of Research Descriptions in Document Classification: A Systematic Review
title_fullStr The Outcomes and Publication Standards of Research Descriptions in Document Classification: A Systematic Review
title_full_unstemmed The Outcomes and Publication Standards of Research Descriptions in Document Classification: A Systematic Review
title_short The Outcomes and Publication Standards of Research Descriptions in Document Classification: A Systematic Review
title_sort outcomes and publication standards of research descriptions in document classification a systematic review
topic Document classification
text classification
systematic review
surveys and overviews
supervised learning by classification
natural language processing
url https://ieeexplore.ieee.org/document/10786331/
work_keys_str_mv AT marcinmarcinmichalmironczuk theoutcomesandpublicationstandardsofresearchdescriptionsindocumentclassificationasystematicreview
AT adammuller theoutcomesandpublicationstandardsofresearchdescriptionsindocumentclassificationasystematicreview
AT witoldpedrycz theoutcomesandpublicationstandardsofresearchdescriptionsindocumentclassificationasystematicreview
AT marcinmarcinmichalmironczuk outcomesandpublicationstandardsofresearchdescriptionsindocumentclassificationasystematicreview
AT adammuller outcomesandpublicationstandardsofresearchdescriptionsindocumentclassificationasystematicreview
AT witoldpedrycz outcomesandpublicationstandardsofresearchdescriptionsindocumentclassificationasystematicreview