A method of micro-blog topic discovery based on feature words selection and text similarity
Some words existing in micro-blog short text have a bad effect on the accuracy of text similarity calculation,further affecting the quality of topic discovery.And these words are the same in shape or semantic meaning,but remote from the topic.A novel method of feature words selection based on micro-...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | zho |
Published: |
Beijing Xintong Media Co., Ltd
2017-10-01
|
Series: | Dianxin kexue |
Subjects: | |
Online Access: | http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2017258/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841530202178453504 |
---|---|
author | Hongyang CHEN Linlin WANG Yingsheng CHEN Jiangkun LU Xue ZUO |
author_facet | Hongyang CHEN Linlin WANG Yingsheng CHEN Jiangkun LU Xue ZUO |
author_sort | Hongyang CHEN |
collection | DOAJ |
description | Some words existing in micro-blog short text have a bad effect on the accuracy of text similarity calculation,further affecting the quality of topic discovery.And these words are the same in shape or semantic meaning,but remote from the topic.A novel method of feature words selection based on micro-blog short text content and structured information was proposed,which could effectively choose some important feature words from the text.Moreover,in computing the similarity between texts,an improvement on computing the similarity between the text and the topic was made.Finally,the methods were combined together and applied to discover micro-blog topics.Experimental results show that the new method of topic discovery can effectively reduce the average missing rate and false detection rate,and improve the quality of topic discovery. |
format | Article |
id | doaj-art-3e852cf1d72c4965a7b50f3ce036acf2 |
institution | Kabale University |
issn | 1000-0801 |
language | zho |
publishDate | 2017-10-01 |
publisher | Beijing Xintong Media Co., Ltd |
record_format | Article |
series | Dianxin kexue |
spelling | doaj-art-3e852cf1d72c4965a7b50f3ce036acf22025-01-15T03:06:02ZzhoBeijing Xintong Media Co., LtdDianxin kexue1000-08012017-10-013313414059599739A method of micro-blog topic discovery based on feature words selection and text similarityHongyang CHENLinlin WANGYingsheng CHENJiangkun LUXue ZUOSome words existing in micro-blog short text have a bad effect on the accuracy of text similarity calculation,further affecting the quality of topic discovery.And these words are the same in shape or semantic meaning,but remote from the topic.A novel method of feature words selection based on micro-blog short text content and structured information was proposed,which could effectively choose some important feature words from the text.Moreover,in computing the similarity between texts,an improvement on computing the similarity between the text and the topic was made.Finally,the methods were combined together and applied to discover micro-blog topics.Experimental results show that the new method of topic discovery can effectively reduce the average missing rate and false detection rate,and improve the quality of topic discovery.http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2017258/micro-blogfeature wordselectionsimilaritytopic discovery |
spellingShingle | Hongyang CHEN Linlin WANG Yingsheng CHEN Jiangkun LU Xue ZUO A method of micro-blog topic discovery based on feature words selection and text similarity Dianxin kexue micro-blog feature word selection similarity topic discovery |
title | A method of micro-blog topic discovery based on feature words selection and text similarity |
title_full | A method of micro-blog topic discovery based on feature words selection and text similarity |
title_fullStr | A method of micro-blog topic discovery based on feature words selection and text similarity |
title_full_unstemmed | A method of micro-blog topic discovery based on feature words selection and text similarity |
title_short | A method of micro-blog topic discovery based on feature words selection and text similarity |
title_sort | method of micro blog topic discovery based on feature words selection and text similarity |
topic | micro-blog feature word selection similarity topic discovery |
url | http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2017258/ |
work_keys_str_mv | AT hongyangchen amethodofmicroblogtopicdiscoverybasedonfeaturewordsselectionandtextsimilarity AT linlinwang amethodofmicroblogtopicdiscoverybasedonfeaturewordsselectionandtextsimilarity AT yingshengchen amethodofmicroblogtopicdiscoverybasedonfeaturewordsselectionandtextsimilarity AT jiangkunlu amethodofmicroblogtopicdiscoverybasedonfeaturewordsselectionandtextsimilarity AT xuezuo amethodofmicroblogtopicdiscoverybasedonfeaturewordsselectionandtextsimilarity AT hongyangchen methodofmicroblogtopicdiscoverybasedonfeaturewordsselectionandtextsimilarity AT linlinwang methodofmicroblogtopicdiscoverybasedonfeaturewordsselectionandtextsimilarity AT yingshengchen methodofmicroblogtopicdiscoverybasedonfeaturewordsselectionandtextsimilarity AT jiangkunlu methodofmicroblogtopicdiscoverybasedonfeaturewordsselectionandtextsimilarity AT xuezuo methodofmicroblogtopicdiscoverybasedonfeaturewordsselectionandtextsimilarity |