Microblog hot topic detection method based on meaningful string clustering
Aiming at the properties of sparse feature, content fragmentation for microblog data, a hot topic detection method was proposed based on meaningful string clustering. The multiple strategies including repeated string detection, context analysis and language rule filtering were combined to extract me...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | zho |
Published: |
Editorial Department of Journal on Communications
2013-08-01
|
Series: | Tongxin xuebao |
Subjects: | |
Online Access: | http://www.joconline.com.cn/zh/article/doi/10.3969/j.issn.1000-436x.2013.z1.034/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841539799149707264 |
---|---|
author | Min HE Li-hong WANG Pan DU Jin ZHANG Xue-qi CHENG |
author_facet | Min HE Li-hong WANG Pan DU Jin ZHANG Xue-qi CHENG |
author_sort | Min HE |
collection | DOAJ |
description | Aiming at the properties of sparse feature, content fragmentation for microblog data, a hot topic detection method was proposed based on meaningful string clustering. The multiple strategies including repeated string detection, context analysis and language rule filtering were combined to extract meaningful strings. Candidate topics were generated by clustering with distribution of meaningful strings in documents. The hot topics were detected according to hotness sorting for candidate topics. As is shown from the experiment results on microblog data, the method achieves good effect in solving the problem of data sparseness. It is effective and feasible to hot topic detection for microblog. |
format | Article |
id | doaj-art-3c3cd1f5a8ff48e9b8f41f332edfcd85 |
institution | Kabale University |
issn | 1000-436X |
language | zho |
publishDate | 2013-08-01 |
publisher | Editorial Department of Journal on Communications |
record_format | Article |
series | Tongxin xuebao |
spelling | doaj-art-3c3cd1f5a8ff48e9b8f41f332edfcd852025-01-14T06:42:03ZzhoEditorial Department of Journal on CommunicationsTongxin xuebao1000-436X2013-08-013425626259677561Microblog hot topic detection method based on meaningful string clusteringMin HELi-hong WANGPan DUJin ZHANGXue-qi CHENGAiming at the properties of sparse feature, content fragmentation for microblog data, a hot topic detection method was proposed based on meaningful string clustering. The multiple strategies including repeated string detection, context analysis and language rule filtering were combined to extract meaningful strings. Candidate topics were generated by clustering with distribution of meaningful strings in documents. The hot topics were detected according to hotness sorting for candidate topics. As is shown from the experiment results on microblog data, the method achieves good effect in solving the problem of data sparseness. It is effective and feasible to hot topic detection for microblog.http://www.joconline.com.cn/zh/article/doi/10.3969/j.issn.1000-436x.2013.z1.034/hot topicmicroblogmeaningful stringfeature clustering |
spellingShingle | Min HE Li-hong WANG Pan DU Jin ZHANG Xue-qi CHENG Microblog hot topic detection method based on meaningful string clustering Tongxin xuebao hot topic microblog meaningful string feature clustering |
title | Microblog hot topic detection method based on meaningful string clustering |
title_full | Microblog hot topic detection method based on meaningful string clustering |
title_fullStr | Microblog hot topic detection method based on meaningful string clustering |
title_full_unstemmed | Microblog hot topic detection method based on meaningful string clustering |
title_short | Microblog hot topic detection method based on meaningful string clustering |
title_sort | microblog hot topic detection method based on meaningful string clustering |
topic | hot topic microblog meaningful string feature clustering |
url | http://www.joconline.com.cn/zh/article/doi/10.3969/j.issn.1000-436x.2013.z1.034/ |
work_keys_str_mv | AT minhe microbloghottopicdetectionmethodbasedonmeaningfulstringclustering AT lihongwang microbloghottopicdetectionmethodbasedonmeaningfulstringclustering AT pandu microbloghottopicdetectionmethodbasedonmeaningfulstringclustering AT jinzhang microbloghottopicdetectionmethodbasedonmeaningfulstringclustering AT xueqicheng microbloghottopicdetectionmethodbasedonmeaningfulstringclustering |