Unsupervised feature selection in binarization of real attributes for conceptual clustering

This paper proposes an approach for processing noisy data to form homogeneous subgroups of objects based on Formal Concept Analysis (FCA). The approach involves binary encoding of heterogeneous features and unsupervised feature selection using the Laplacian Score. The selected feature set is then us...

Full description

Saved in:
Bibliographic Details
Main Authors: Shkaberina Guzel, Masich Igor, Markushin Egor, Kraeva Ekaterina
Format: Article
Language:English
Published: EDP Sciences 2025-01-01
Series:ITM Web of Conferences
Online Access:https://www.itm-conferences.org/articles/itmconf/pdf/2025/03/itmconf_hmmocs-III2024_04004.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850195521217495040
author Shkaberina Guzel
Masich Igor
Markushin Egor
Kraeva Ekaterina
author_facet Shkaberina Guzel
Masich Igor
Markushin Egor
Kraeva Ekaterina
author_sort Shkaberina Guzel
collection DOAJ
description This paper proposes an approach for processing noisy data to form homogeneous subgroups of objects based on Formal Concept Analysis (FCA). The approach involves binary encoding of heterogeneous features and unsupervised feature selection using the Laplacian Score. The selected feature set is then used to generate formal concepts. The main idea of our research is to use the concepts derived through FCA as new features for clustering. This process transforms the original feature space into a concept-driven space, where each feature corresponds to the extents of the derived concepts. The proposed approach enhances clustering performance in the presence of noise, outperforming the traditional K-means clustering algorithm in terms of cluster coherence and accuracy. By utilizing concept-based features, the method is able to better capture the underlying structure of the data, leading to more robust and meaningful groupings compared to conventional attribute-based clustering techniques.
format Article
id doaj-art-89dd29b1859d45d4b921e86d89e985a0
institution OA Journals
issn 2271-2097
language English
publishDate 2025-01-01
publisher EDP Sciences
record_format Article
series ITM Web of Conferences
spelling doaj-art-89dd29b1859d45d4b921e86d89e985a02025-08-20T02:13:44ZengEDP SciencesITM Web of Conferences2271-20972025-01-01720400410.1051/itmconf/20257204004itmconf_hmmocs-III2024_04004Unsupervised feature selection in binarization of real attributes for conceptual clusteringShkaberina Guzel0Masich Igor1Markushin Egor2Kraeva Ekaterina3Laboratory “Hybrid Methods of Modeling and Optimization in Complex Systems”, Siberian Federal UniversityLaboratory “Hybrid Methods of Modeling and Optimization in Complex Systems”, Siberian Federal UniversityReshetnev Siberian State University of Science and TechnologyReshetnev Siberian State University of Science and TechnologyThis paper proposes an approach for processing noisy data to form homogeneous subgroups of objects based on Formal Concept Analysis (FCA). The approach involves binary encoding of heterogeneous features and unsupervised feature selection using the Laplacian Score. The selected feature set is then used to generate formal concepts. The main idea of our research is to use the concepts derived through FCA as new features for clustering. This process transforms the original feature space into a concept-driven space, where each feature corresponds to the extents of the derived concepts. The proposed approach enhances clustering performance in the presence of noise, outperforming the traditional K-means clustering algorithm in terms of cluster coherence and accuracy. By utilizing concept-based features, the method is able to better capture the underlying structure of the data, leading to more robust and meaningful groupings compared to conventional attribute-based clustering techniques.https://www.itm-conferences.org/articles/itmconf/pdf/2025/03/itmconf_hmmocs-III2024_04004.pdf
spellingShingle Shkaberina Guzel
Masich Igor
Markushin Egor
Kraeva Ekaterina
Unsupervised feature selection in binarization of real attributes for conceptual clustering
ITM Web of Conferences
title Unsupervised feature selection in binarization of real attributes for conceptual clustering
title_full Unsupervised feature selection in binarization of real attributes for conceptual clustering
title_fullStr Unsupervised feature selection in binarization of real attributes for conceptual clustering
title_full_unstemmed Unsupervised feature selection in binarization of real attributes for conceptual clustering
title_short Unsupervised feature selection in binarization of real attributes for conceptual clustering
title_sort unsupervised feature selection in binarization of real attributes for conceptual clustering
url https://www.itm-conferences.org/articles/itmconf/pdf/2025/03/itmconf_hmmocs-III2024_04004.pdf
work_keys_str_mv AT shkaberinaguzel unsupervisedfeatureselectioninbinarizationofrealattributesforconceptualclustering
AT masichigor unsupervisedfeatureselectioninbinarizationofrealattributesforconceptualclustering
AT markushinegor unsupervisedfeatureselectioninbinarizationofrealattributesforconceptualclustering
AT kraevaekaterina unsupervisedfeatureselectioninbinarizationofrealattributesforconceptualclustering