Hadoop bottleneck detection algorithm based on information gain

Hadoop has become a major platform for big data storage and large data mining nowadays.Although Hadoop platform achieves high performance parallel computing through a distributed cluster of machines,the bottlenecks will inevitably appear on a machine when cluster load increases,because the cluster i...

Full description

Saved in:
Bibliographic Details
Main Authors: Zaole TAN, Zhifeng HAO, Ruichu CAI, Xiaojun XIAO, Yu LU
Format: Article
Language:zho
Published: Beijing Xintong Media Co., Ltd 2016-07-01
Series:Dianxin kexue
Subjects:
Online Access:http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2016203/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Hadoop has become a major platform for big data storage and large data mining nowadays.Although Hadoop platform achieves high performance parallel computing through a distributed cluster of machines,the bottlenecks will inevitably appear on a machine when cluster load increases,because the cluster is composed of inexpensive host.Aiming at this problem,a bottleneck detection algorithms based on information gain was proposed.The algorithm detected cluster's bottlenecks resource by computing the information gain of each resource.The experiments show that the bottleneck detection algorithm is feasible.
ISSN:1000-0801