Data Clustering for Sentiment Classification with Naïve Bayes and Support Vector Machine

Visitor reviews play a crucial role in determining the success of a business, particularly those offering hospitality and services, such as hotels. The growth of internet technology has made it easier for guests to share their experiences, which can influence potential customers. Google Maps is one...

Full description

Saved in:
Bibliographic Details
Main Authors: Bayu Yanuargi, Ema Utami, Kusrini, Arli Aditya Parikesit
Format: Article
Language:English
Published: Ikatan Ahli Informatika Indonesia 2024-12-01
Series:Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
Subjects:
Online Access:https://jurnal.iaii.or.id/index.php/RESTI/article/view/6139
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841544035505799168
author Bayu Yanuargi
Ema Utami
Kusrini
Arli Aditya Parikesit
author_facet Bayu Yanuargi
Ema Utami
Kusrini
Arli Aditya Parikesit
author_sort Bayu Yanuargi
collection DOAJ
description Visitor reviews play a crucial role in determining the success of a business, particularly those offering hospitality and services, such as hotels. The growth of internet technology has made it easier for guests to share their experiences, which can influence potential customers. Google Maps is one of the platforms used for giving and searching reviews This research uses data crawled from Google Maps Review using the playwright library. However, the large volume of reviews can make analysis and topic-based categorization—such as service quality, hotel location, and operational hours—challenging. To address this, DBSCAN is used to cluster reviews based on these topics. Clustering helps improve sentiment classification, making it more targeted and allowing a comparison of two machine learning algorithms: Naïve Bayes and Support Vector Machine (SVM). Naïve Bayes achieved higher accuracy (0.87) in the operational hours cluster, while SVM scored 0.78. However, SVM showed improved accuracy in the location (0.89) and service (0.88) clusters, with Naïve Bayes maintaining a stable 0.86 accuracy in both. Both models demonstrated an average training time of less than one second, excluding preprocessing.
format Article
id doaj-art-2031c6ffb8a447e0bbc9b7c491399439
institution Kabale University
issn 2580-0760
language English
publishDate 2024-12-01
publisher Ikatan Ahli Informatika Indonesia
record_format Article
series Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
spelling doaj-art-2031c6ffb8a447e0bbc9b7c4913994392025-01-13T03:30:32ZengIkatan Ahli Informatika IndonesiaJurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)2580-07602024-12-018681982710.29207/resti.v8i6.61396139Data Clustering for Sentiment Classification with Naïve Bayes and Support Vector MachineBayu Yanuargi0Ema Utami1Kusrini2Arli Aditya Parikesit3Universitas Amikom YogyakartaUniversitas AMIKOM YogyakartaUniversitas AMIKOM YogyakartaIndonesia International Institute for Life SciencesVisitor reviews play a crucial role in determining the success of a business, particularly those offering hospitality and services, such as hotels. The growth of internet technology has made it easier for guests to share their experiences, which can influence potential customers. Google Maps is one of the platforms used for giving and searching reviews This research uses data crawled from Google Maps Review using the playwright library. However, the large volume of reviews can make analysis and topic-based categorization—such as service quality, hotel location, and operational hours—challenging. To address this, DBSCAN is used to cluster reviews based on these topics. Clustering helps improve sentiment classification, making it more targeted and allowing a comparison of two machine learning algorithms: Naïve Bayes and Support Vector Machine (SVM). Naïve Bayes achieved higher accuracy (0.87) in the operational hours cluster, while SVM scored 0.78. However, SVM showed improved accuracy in the location (0.89) and service (0.88) clusters, with Naïve Bayes maintaining a stable 0.86 accuracy in both. Both models demonstrated an average training time of less than one second, excluding preprocessing.https://jurnal.iaii.or.id/index.php/RESTI/article/view/6139sentiment analysishotelclusteringnaïve bayessupport vector machine
spellingShingle Bayu Yanuargi
Ema Utami
Kusrini
Arli Aditya Parikesit
Data Clustering for Sentiment Classification with Naïve Bayes and Support Vector Machine
Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
sentiment analysis
hotel
clustering
naïve bayes
support vector machine
title Data Clustering for Sentiment Classification with Naïve Bayes and Support Vector Machine
title_full Data Clustering for Sentiment Classification with Naïve Bayes and Support Vector Machine
title_fullStr Data Clustering for Sentiment Classification with Naïve Bayes and Support Vector Machine
title_full_unstemmed Data Clustering for Sentiment Classification with Naïve Bayes and Support Vector Machine
title_short Data Clustering for Sentiment Classification with Naïve Bayes and Support Vector Machine
title_sort data clustering for sentiment classification with naive bayes and support vector machine
topic sentiment analysis
hotel
clustering
naïve bayes
support vector machine
url https://jurnal.iaii.or.id/index.php/RESTI/article/view/6139
work_keys_str_mv AT bayuyanuargi dataclusteringforsentimentclassificationwithnaivebayesandsupportvectormachine
AT emautami dataclusteringforsentimentclassificationwithnaivebayesandsupportvectormachine
AT kusrini dataclusteringforsentimentclassificationwithnaivebayesandsupportvectormachine
AT arliadityaparikesit dataclusteringforsentimentclassificationwithnaivebayesandsupportvectormachine