Microblog hot topic detection method based on meaningful string clustering

Aiming at the properties of sparse feature, content fragmentation for microblog data, a hot topic detection method was proposed based on meaningful string clustering. The multiple strategies including repeated string detection, context analysis and language rule filtering were combined to extract me...

Full description

Saved in:
Bibliographic Details
Main Authors: Min HE, Li-hong WANG, Pan DU, Jin ZHANG, Xue-qi CHENG
Format: Article
Language:zho
Published: Editorial Department of Journal on Communications 2013-08-01
Series:Tongxin xuebao
Subjects:
Online Access:http://www.joconline.com.cn/zh/article/doi/10.3969/j.issn.1000-436x.2013.z1.034/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841539799149707264
author Min HE
Li-hong WANG
Pan DU
Jin ZHANG
Xue-qi CHENG
author_facet Min HE
Li-hong WANG
Pan DU
Jin ZHANG
Xue-qi CHENG
author_sort Min HE
collection DOAJ
description Aiming at the properties of sparse feature, content fragmentation for microblog data, a hot topic detection method was proposed based on meaningful string clustering. The multiple strategies including repeated string detection, context analysis and language rule filtering were combined to extract meaningful strings. Candidate topics were generated by clustering with distribution of meaningful strings in documents. The hot topics were detected according to hotness sorting for candidate topics. As is shown from the experiment results on microblog data, the method achieves good effect in solving the problem of data sparseness. It is effective and feasible to hot topic detection for microblog.
format Article
id doaj-art-3c3cd1f5a8ff48e9b8f41f332edfcd85
institution Kabale University
issn 1000-436X
language zho
publishDate 2013-08-01
publisher Editorial Department of Journal on Communications
record_format Article
series Tongxin xuebao
spelling doaj-art-3c3cd1f5a8ff48e9b8f41f332edfcd852025-01-14T06:42:03ZzhoEditorial Department of Journal on CommunicationsTongxin xuebao1000-436X2013-08-013425626259677561Microblog hot topic detection method based on meaningful string clusteringMin HELi-hong WANGPan DUJin ZHANGXue-qi CHENGAiming at the properties of sparse feature, content fragmentation for microblog data, a hot topic detection method was proposed based on meaningful string clustering. The multiple strategies including repeated string detection, context analysis and language rule filtering were combined to extract meaningful strings. Candidate topics were generated by clustering with distribution of meaningful strings in documents. The hot topics were detected according to hotness sorting for candidate topics. As is shown from the experiment results on microblog data, the method achieves good effect in solving the problem of data sparseness. It is effective and feasible to hot topic detection for microblog.http://www.joconline.com.cn/zh/article/doi/10.3969/j.issn.1000-436x.2013.z1.034/hot topicmicroblogmeaningful stringfeature clustering
spellingShingle Min HE
Li-hong WANG
Pan DU
Jin ZHANG
Xue-qi CHENG
Microblog hot topic detection method based on meaningful string clustering
Tongxin xuebao
hot topic
microblog
meaningful string
feature clustering
title Microblog hot topic detection method based on meaningful string clustering
title_full Microblog hot topic detection method based on meaningful string clustering
title_fullStr Microblog hot topic detection method based on meaningful string clustering
title_full_unstemmed Microblog hot topic detection method based on meaningful string clustering
title_short Microblog hot topic detection method based on meaningful string clustering
title_sort microblog hot topic detection method based on meaningful string clustering
topic hot topic
microblog
meaningful string
feature clustering
url http://www.joconline.com.cn/zh/article/doi/10.3969/j.issn.1000-436x.2013.z1.034/
work_keys_str_mv AT minhe microbloghottopicdetectionmethodbasedonmeaningfulstringclustering
AT lihongwang microbloghottopicdetectionmethodbasedonmeaningfulstringclustering
AT pandu microbloghottopicdetectionmethodbasedonmeaningfulstringclustering
AT jinzhang microbloghottopicdetectionmethodbasedonmeaningfulstringclustering
AT xueqicheng microbloghottopicdetectionmethodbasedonmeaningfulstringclustering