Blog screening and mining based on temporal features and hybrid search in big data

Concerning that the correlation degree of the existing methods of blog screen and mining is loose and the information retrieval of the methods is deficient,a method based on temporal feature and hybrid search method was proposed.Considering the user reviews are important sources of evidence combinat...

Full description

Saved in:
Bibliographic Details
Main Authors: Lina ZHANG, Tai KUANG, Diqing JIANG
Format: Article
Language:zho
Published: Beijing Xintong Media Co., Ltd 2017-01-01
Series:Dianxin kexue
Subjects:
Online Access:http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2017001/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841530079511838720
author Lina ZHANG
Tai KUANG
Diqing JIANG
author_facet Lina ZHANG
Tai KUANG
Diqing JIANG
author_sort Lina ZHANG
collection DOAJ
description Concerning that the correlation degree of the existing methods of blog screen and mining is loose and the information retrieval of the methods is deficient,a method based on temporal feature and hybrid search method was proposed.Considering the user reviews are important sources of evidence combination,the average number of reviews for blogs,the sources of BM25 relevance scores,the longest blog BM25 scores and time range between the latest related blog paper and the oldest related blog paper are being as the temporal feature sets.In addition,considering local search advantage of linear search(LS) and global search advantage of differential evolution(DE),the two kinds of information search methods were combined.BlogS06 data set was used in the experiment which was consists of blog home pages,XML source files and its blog portal pages,it was used for TREC 2007 and TREC 2008 blog mining experiments.Experimental results show that the proposed method can obtain satisfactory results in terms of running time and effectiveness.
format Article
id doaj-art-b2983355016b4193b7333355457ed78a
institution Kabale University
issn 1000-0801
language zho
publishDate 2017-01-01
publisher Beijing Xintong Media Co., Ltd
record_format Article
series Dianxin kexue
spelling doaj-art-b2983355016b4193b7333355457ed78a2025-01-15T03:13:30ZzhoBeijing Xintong Media Co., LtdDianxin kexue1000-08012017-01-0133778459603931Blog screening and mining based on temporal features and hybrid search in big dataLina ZHANGTai KUANGDiqing JIANGConcerning that the correlation degree of the existing methods of blog screen and mining is loose and the information retrieval of the methods is deficient,a method based on temporal feature and hybrid search method was proposed.Considering the user reviews are important sources of evidence combination,the average number of reviews for blogs,the sources of BM25 relevance scores,the longest blog BM25 scores and time range between the latest related blog paper and the oldest related blog paper are being as the temporal feature sets.In addition,considering local search advantage of linear search(LS) and global search advantage of differential evolution(DE),the two kinds of information search methods were combined.BlogS06 data set was used in the experiment which was consists of blog home pages,XML source files and its blog portal pages,it was used for TREC 2007 and TREC 2008 blog mining experiments.Experimental results show that the proposed method can obtain satisfactory results in terms of running time and effectiveness.http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2017001/blog screening and miningtemporal featurelinear searchdifferential evolutionbig dataBM25
spellingShingle Lina ZHANG
Tai KUANG
Diqing JIANG
Blog screening and mining based on temporal features and hybrid search in big data
Dianxin kexue
blog screening and mining
temporal feature
linear search
differential evolution
big data
BM25
title Blog screening and mining based on temporal features and hybrid search in big data
title_full Blog screening and mining based on temporal features and hybrid search in big data
title_fullStr Blog screening and mining based on temporal features and hybrid search in big data
title_full_unstemmed Blog screening and mining based on temporal features and hybrid search in big data
title_short Blog screening and mining based on temporal features and hybrid search in big data
title_sort blog screening and mining based on temporal features and hybrid search in big data
topic blog screening and mining
temporal feature
linear search
differential evolution
big data
BM25
url http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2017001/
work_keys_str_mv AT linazhang blogscreeningandminingbasedontemporalfeaturesandhybridsearchinbigdata
AT taikuang blogscreeningandminingbasedontemporalfeaturesandhybridsearchinbigdata
AT diqingjiang blogscreeningandminingbasedontemporalfeaturesandhybridsearchinbigdata