Progressive filling partitioning and mapping algorithm for Spark based on allocation fitness degree

The job execution mechanism of Spark was analyzed,task efficiency model and Shuffle model were established,then allocation fitness degree (AFD) was defined and the optimization goal was put forward.On the basis of the model definition,the progressive filling partitioning and mapping algorithm (PFPM)...

Full description

Saved in:
Bibliographic Details
Main Authors: Chen BIAN, Jiong1 YU, Wei-rong XIU, Bin LIAO, Chang-tian YING, Yu-rong QIAN
Format: Article
Language:zho
Published: Editorial Department of Journal on Communications 2017-09-01
Series:Tongxin xuebao
Subjects:
Online Access:http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2017188/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841539490662842368
author Chen BIAN
Jiong1 YU
Wei-rong XIU
Bin LIAO
Chang-tian YING
Yu-rong QIAN
author_facet Chen BIAN
Jiong1 YU
Wei-rong XIU
Bin LIAO
Chang-tian YING
Yu-rong QIAN
author_sort Chen BIAN
collection DOAJ
description The job execution mechanism of Spark was analyzed,task efficiency model and Shuffle model were established,then allocation fitness degree (AFD) was defined and the optimization goal was put forward.On the basis of the model definition,the progressive filling partitioning and mapping algorithm (PFPM) was proposed.PFPM established the data distribution scheme adapting Reducers’ computing ability to decrease synchronous latency during Shuffle process and increase cluster the computing efficiency.The experiments demonstrate that PFPM could improve the rationality of workload distribution in Shuffle and optimize the execution efficiency of Spark.
format Article
id doaj-art-31df2c2fec8c40b187fb4f63eb22139e
institution Kabale University
issn 1000-436X
language zho
publishDate 2017-09-01
publisher Editorial Department of Journal on Communications
record_format Article
series Tongxin xuebao
spelling doaj-art-31df2c2fec8c40b187fb4f63eb22139e2025-01-14T07:13:01ZzhoEditorial Department of Journal on CommunicationsTongxin xuebao1000-436X2017-09-013813314759712267Progressive filling partitioning and mapping algorithm for Spark based on allocation fitness degreeChen BIANJiong1 YUWei-rong XIUBin LIAOChang-tian YINGYu-rong QIANThe job execution mechanism of Spark was analyzed,task efficiency model and Shuffle model were established,then allocation fitness degree (AFD) was defined and the optimization goal was put forward.On the basis of the model definition,the progressive filling partitioning and mapping algorithm (PFPM) was proposed.PFPM established the data distribution scheme adapting Reducers’ computing ability to decrease synchronous latency during Shuffle process and increase cluster the computing efficiency.The experiments demonstrate that PFPM could improve the rationality of workload distribution in Shuffle and optimize the execution efficiency of Spark.http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2017188/parallel computingSparkprogressive fillingpartitioning and mappingallocation fitness degree
spellingShingle Chen BIAN
Jiong1 YU
Wei-rong XIU
Bin LIAO
Chang-tian YING
Yu-rong QIAN
Progressive filling partitioning and mapping algorithm for Spark based on allocation fitness degree
Tongxin xuebao
parallel computing
Spark
progressive filling
partitioning and mapping
allocation fitness degree
title Progressive filling partitioning and mapping algorithm for Spark based on allocation fitness degree
title_full Progressive filling partitioning and mapping algorithm for Spark based on allocation fitness degree
title_fullStr Progressive filling partitioning and mapping algorithm for Spark based on allocation fitness degree
title_full_unstemmed Progressive filling partitioning and mapping algorithm for Spark based on allocation fitness degree
title_short Progressive filling partitioning and mapping algorithm for Spark based on allocation fitness degree
title_sort progressive filling partitioning and mapping algorithm for spark based on allocation fitness degree
topic parallel computing
Spark
progressive filling
partitioning and mapping
allocation fitness degree
url http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2017188/
work_keys_str_mv AT chenbian progressivefillingpartitioningandmappingalgorithmforsparkbasedonallocationfitnessdegree
AT jiong1yu progressivefillingpartitioningandmappingalgorithmforsparkbasedonallocationfitnessdegree
AT weirongxiu progressivefillingpartitioningandmappingalgorithmforsparkbasedonallocationfitnessdegree
AT binliao progressivefillingpartitioningandmappingalgorithmforsparkbasedonallocationfitnessdegree
AT changtianying progressivefillingpartitioningandmappingalgorithmforsparkbasedonallocationfitnessdegree
AT yurongqian progressivefillingpartitioningandmappingalgorithmforsparkbasedonallocationfitnessdegree