Implementation of a big data anonymization system based on Spark

Group based anonymization is a classical data anonymization framework,which achieves the effect of privacy protection by constructing groups of anonymized data records ensuring that records in the same group cannot be distinguished with each other.The electric power industry big data analysis involv...

Full description

Saved in:
Bibliographic Details
Main Authors: Chaoyi BIAN, Shaomin ZHU, Tao ZHOU
Format: Article
Language:zho
Published: Beijing Xintong Media Co., Ltd 2018-04-01
Series:Dianxin kexue
Subjects:
Online Access:http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2018133/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841530346094460928
author Chaoyi BIAN
Shaomin ZHU
Tao ZHOU
author_facet Chaoyi BIAN
Shaomin ZHU
Tao ZHOU
author_sort Chaoyi BIAN
collection DOAJ
description Group based anonymization is a classical data anonymization framework,which achieves the effect of privacy protection by constructing groups of anonymized data records ensuring that records in the same group cannot be distinguished with each other.The electric power industry big data analysis involves the core data of the power enterprises and the user privacy data,the data sensitivity is stronger,traditional data anonymization systems are unable to meet the needs of big data business applications and safety protection of electric power industry.A new big data anonymization system based on Spark was designed and implemented,which could provide the support for multiple data formats stored on Hadoop and substantially improve the processing efficiency for big data.
format Article
id doaj-art-104324e7fbe845f3882392c6875119ed
institution Kabale University
issn 1000-0801
language zho
publishDate 2018-04-01
publisher Beijing Xintong Media Co., Ltd
record_format Article
series Dianxin kexue
spelling doaj-art-104324e7fbe845f3882392c6875119ed2025-01-15T03:04:54ZzhoBeijing Xintong Media Co., LtdDianxin kexue1000-08012018-04-013415616159596193Implementation of a big data anonymization system based on SparkChaoyi BIANShaomin ZHUTao ZHOUGroup based anonymization is a classical data anonymization framework,which achieves the effect of privacy protection by constructing groups of anonymized data records ensuring that records in the same group cannot be distinguished with each other.The electric power industry big data analysis involves the core data of the power enterprises and the user privacy data,the data sensitivity is stronger,traditional data anonymization systems are unable to meet the needs of big data business applications and safety protection of electric power industry.A new big data anonymization system based on Spark was designed and implemented,which could provide the support for multiple data formats stored on Hadoop and substantially improve the processing efficiency for big data.http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2018133/data anonymizationprivacyelectric power industrysafety protectionSpark
spellingShingle Chaoyi BIAN
Shaomin ZHU
Tao ZHOU
Implementation of a big data anonymization system based on Spark
Dianxin kexue
data anonymization
privacy
electric power industry
safety protection
Spark
title Implementation of a big data anonymization system based on Spark
title_full Implementation of a big data anonymization system based on Spark
title_fullStr Implementation of a big data anonymization system based on Spark
title_full_unstemmed Implementation of a big data anonymization system based on Spark
title_short Implementation of a big data anonymization system based on Spark
title_sort implementation of a big data anonymization system based on spark
topic data anonymization
privacy
electric power industry
safety protection
Spark
url http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2018133/
work_keys_str_mv AT chaoyibian implementationofabigdataanonymizationsystembasedonspark
AT shaominzhu implementationofabigdataanonymizationsystembasedonspark
AT taozhou implementationofabigdataanonymizationsystembasedonspark