Fine-tuned PhoBERT for sentiment analysis of Vietnamese phone reviews

This paper presents an exploration of sentiment analysis applied to Vietnamese phone reviews, leveraging the PhoBERT model. While significant advancements have been made in sentiment analysis for English and other widely spoken languages, Vietnamese remains relatively under investigated. Our study...

Full description

Saved in:
Bibliographic Details
Main Authors: Tan Minh Ngo, Ba Hung Ngo, Stuchilin Vladimir Valerievich
Format: Article
Language:English
Published: Can Tho University Publisher 2024-10-01
Series:CTU Journal of Innovation and Sustainable Development
Subjects:
Online Access:http://web2010.thanhtoan/index.php/ctujs/article/view/1146
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper presents an exploration of sentiment analysis applied to Vietnamese phone reviews, leveraging the PhoBERT model. While significant advancements have been made in sentiment analysis for English and other widely spoken languages, Vietnamese remains relatively under investigated. Our study addresses this gap by constructing a comprehensive dataset that integrates data from the UIT-ViSFD dataset and data collected through web scraping. We experimented with various models including naive Bayes, Support Vector Machine, and PhoBERT, utilizing multiple data preprocessing techniques. PhoBERT, a state-of-the-art pre-trained language model specifically designed for Vietnamese, demonstrated superior performance. The final PhoBERT model with optimized preprocessing achieved an accuracy of 92.74%, highlighting its efficacy in accurately identifying sentiments.
ISSN:2588-1418
2815-6412