TEXTS OF DIFFERENT EMOTIONAL CLASSES AND THEIR TOPIC MODELING

The article is devoted to studying verbalization specifics of various emotional states in the texts in the Russian language with the purpose to confirm or refute the hypothesis that texts of different emotional classes reflect the denotative situation not identically, which is reflected in thematic...

Full description

Saved in:
Bibliographic Details
Main Authors: Anastasia V. Kolmogorova, Qiuhua Sun
Format: Article
Language:English
Published: Volgograd State University 2024-11-01
Series:Vestnik Volgogradskogo Gosudarstvennogo Universiteta. Seriâ 2. Âzykoznanie
Subjects:
Online Access:https://l.jvolsu.com/index.php/en/archive-en/928-science-journal-of-volsu-linguistics-2024-vol-23-no-5/artificial-intelligence-potential-in-natural-language-processing-and-machine-translation/2844-kolmogorova-a-v-sun-qiuhua-texts-of-different-emotional-classes-and-their-topic-modeling
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The article is devoted to studying verbalization specifics of various emotional states in the texts in the Russian language with the purpose to confirm or refute the hypothesis that texts of different emotional classes reflect the denotative situation not identically, which is reflected in thematic specifics and lexical content. The research material consisted of eight corpus texts in the Russian language, which were extracted from the public pages of the social network VKontakte. The texts were selected according to emotional hashtags that corresponded to eight basic emotions, according to H. Lovheim’s model: anger, surprise, shame, enjoyment, disgust, distress, excitement, fear. The correspondence of emotion and hashtag was established in a preliminary psycholinguistic experiment. While analyzing the text collection, we used the method of computer thematic modeling to identify statistically nonrandom groups of words (topics). We applied the BERTopic neural network model to the collected data. As a result of the analysis, it was found that texts of 8 emotional classes contain an uneven number of topics, despite the fact that their number does not correlate directly with the amount of data: with a relatively small amount of data, there may be many topics, but in a voluminous corpus – few. The sets of words (tokens) that make up each non-random group (topic) differ in each subcorpora, reflecting the specifics of the denotative situation, which is formed under the influence of the emotional state of the speaker. The idea of diverse thematic “granularity” of texts of different emotional classes is theoretically justified.
ISSN:1998-9911
2409-1979