A method of micro-blog topic discovery based on feature words selection and text similarity
Some words existing in micro-blog short text have a bad effect on the accuracy of text similarity calculation,further affecting the quality of topic discovery.And these words are the same in shape or semantic meaning,but remote from the topic.A novel method of feature words selection based on micro-...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | zho |
Published: |
Beijing Xintong Media Co., Ltd
2017-10-01
|
Series: | Dianxin kexue |
Subjects: | |
Online Access: | http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2017258/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Some words existing in micro-blog short text have a bad effect on the accuracy of text similarity calculation,further affecting the quality of topic discovery.And these words are the same in shape or semantic meaning,but remote from the topic.A novel method of feature words selection based on micro-blog short text content and structured information was proposed,which could effectively choose some important feature words from the text.Moreover,in computing the similarity between texts,an improvement on computing the similarity between the text and the topic was made.Finally,the methods were combined together and applied to discover micro-blog topics.Experimental results show that the new method of topic discovery can effectively reduce the average missing rate and false detection rate,and improve the quality of topic discovery. |
---|---|
ISSN: | 1000-0801 |