A method of micro-blog topic discovery based on feature words selection and text similarity

Some words existing in micro-blog short text have a bad effect on the accuracy of text similarity calculation,further affecting the quality of topic discovery.And these words are the same in shape or semantic meaning,but remote from the topic.A novel method of feature words selection based on micro-...

Full description

Saved in:
Bibliographic Details
Main Authors: Hongyang CHEN, Linlin WANG, Yingsheng CHEN, Jiangkun LU, Xue ZUO
Format: Article
Language:zho
Published: Beijing Xintong Media Co., Ltd 2017-10-01
Series:Dianxin kexue
Subjects:
Online Access:http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2017258/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841530202178453504
author Hongyang CHEN
Linlin WANG
Yingsheng CHEN
Jiangkun LU
Xue ZUO
author_facet Hongyang CHEN
Linlin WANG
Yingsheng CHEN
Jiangkun LU
Xue ZUO
author_sort Hongyang CHEN
collection DOAJ
description Some words existing in micro-blog short text have a bad effect on the accuracy of text similarity calculation,further affecting the quality of topic discovery.And these words are the same in shape or semantic meaning,but remote from the topic.A novel method of feature words selection based on micro-blog short text content and structured information was proposed,which could effectively choose some important feature words from the text.Moreover,in computing the similarity between texts,an improvement on computing the similarity between the text and the topic was made.Finally,the methods were combined together and applied to discover micro-blog topics.Experimental results show that the new method of topic discovery can effectively reduce the average missing rate and false detection rate,and improve the quality of topic discovery.
format Article
id doaj-art-3e852cf1d72c4965a7b50f3ce036acf2
institution Kabale University
issn 1000-0801
language zho
publishDate 2017-10-01
publisher Beijing Xintong Media Co., Ltd
record_format Article
series Dianxin kexue
spelling doaj-art-3e852cf1d72c4965a7b50f3ce036acf22025-01-15T03:06:02ZzhoBeijing Xintong Media Co., LtdDianxin kexue1000-08012017-10-013313414059599739A method of micro-blog topic discovery based on feature words selection and text similarityHongyang CHENLinlin WANGYingsheng CHENJiangkun LUXue ZUOSome words existing in micro-blog short text have a bad effect on the accuracy of text similarity calculation,further affecting the quality of topic discovery.And these words are the same in shape or semantic meaning,but remote from the topic.A novel method of feature words selection based on micro-blog short text content and structured information was proposed,which could effectively choose some important feature words from the text.Moreover,in computing the similarity between texts,an improvement on computing the similarity between the text and the topic was made.Finally,the methods were combined together and applied to discover micro-blog topics.Experimental results show that the new method of topic discovery can effectively reduce the average missing rate and false detection rate,and improve the quality of topic discovery.http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2017258/micro-blogfeature wordselectionsimilaritytopic discovery
spellingShingle Hongyang CHEN
Linlin WANG
Yingsheng CHEN
Jiangkun LU
Xue ZUO
A method of micro-blog topic discovery based on feature words selection and text similarity
Dianxin kexue
micro-blog
feature word
selection
similarity
topic discovery
title A method of micro-blog topic discovery based on feature words selection and text similarity
title_full A method of micro-blog topic discovery based on feature words selection and text similarity
title_fullStr A method of micro-blog topic discovery based on feature words selection and text similarity
title_full_unstemmed A method of micro-blog topic discovery based on feature words selection and text similarity
title_short A method of micro-blog topic discovery based on feature words selection and text similarity
title_sort method of micro blog topic discovery based on feature words selection and text similarity
topic micro-blog
feature word
selection
similarity
topic discovery
url http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2017258/
work_keys_str_mv AT hongyangchen amethodofmicroblogtopicdiscoverybasedonfeaturewordsselectionandtextsimilarity
AT linlinwang amethodofmicroblogtopicdiscoverybasedonfeaturewordsselectionandtextsimilarity
AT yingshengchen amethodofmicroblogtopicdiscoverybasedonfeaturewordsselectionandtextsimilarity
AT jiangkunlu amethodofmicroblogtopicdiscoverybasedonfeaturewordsselectionandtextsimilarity
AT xuezuo amethodofmicroblogtopicdiscoverybasedonfeaturewordsselectionandtextsimilarity
AT hongyangchen methodofmicroblogtopicdiscoverybasedonfeaturewordsselectionandtextsimilarity
AT linlinwang methodofmicroblogtopicdiscoverybasedonfeaturewordsselectionandtextsimilarity
AT yingshengchen methodofmicroblogtopicdiscoverybasedonfeaturewordsselectionandtextsimilarity
AT jiangkunlu methodofmicroblogtopicdiscoverybasedonfeaturewordsselectionandtextsimilarity
AT xuezuo methodofmicroblogtopicdiscoverybasedonfeaturewordsselectionandtextsimilarity