Implementation of a big data anonymization system based on Spark
Group based anonymization is a classical data anonymization framework,which achieves the effect of privacy protection by constructing groups of anonymized data records ensuring that records in the same group cannot be distinguished with each other.The electric power industry big data analysis involv...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | zho |
Published: |
Beijing Xintong Media Co., Ltd
2018-04-01
|
Series: | Dianxin kexue |
Subjects: | |
Online Access: | http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2018133/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841530346094460928 |
---|---|
author | Chaoyi BIAN Shaomin ZHU Tao ZHOU |
author_facet | Chaoyi BIAN Shaomin ZHU Tao ZHOU |
author_sort | Chaoyi BIAN |
collection | DOAJ |
description | Group based anonymization is a classical data anonymization framework,which achieves the effect of privacy protection by constructing groups of anonymized data records ensuring that records in the same group cannot be distinguished with each other.The electric power industry big data analysis involves the core data of the power enterprises and the user privacy data,the data sensitivity is stronger,traditional data anonymization systems are unable to meet the needs of big data business applications and safety protection of electric power industry.A new big data anonymization system based on Spark was designed and implemented,which could provide the support for multiple data formats stored on Hadoop and substantially improve the processing efficiency for big data. |
format | Article |
id | doaj-art-104324e7fbe845f3882392c6875119ed |
institution | Kabale University |
issn | 1000-0801 |
language | zho |
publishDate | 2018-04-01 |
publisher | Beijing Xintong Media Co., Ltd |
record_format | Article |
series | Dianxin kexue |
spelling | doaj-art-104324e7fbe845f3882392c6875119ed2025-01-15T03:04:54ZzhoBeijing Xintong Media Co., LtdDianxin kexue1000-08012018-04-013415616159596193Implementation of a big data anonymization system based on SparkChaoyi BIANShaomin ZHUTao ZHOUGroup based anonymization is a classical data anonymization framework,which achieves the effect of privacy protection by constructing groups of anonymized data records ensuring that records in the same group cannot be distinguished with each other.The electric power industry big data analysis involves the core data of the power enterprises and the user privacy data,the data sensitivity is stronger,traditional data anonymization systems are unable to meet the needs of big data business applications and safety protection of electric power industry.A new big data anonymization system based on Spark was designed and implemented,which could provide the support for multiple data formats stored on Hadoop and substantially improve the processing efficiency for big data.http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2018133/data anonymizationprivacyelectric power industrysafety protectionSpark |
spellingShingle | Chaoyi BIAN Shaomin ZHU Tao ZHOU Implementation of a big data anonymization system based on Spark Dianxin kexue data anonymization privacy electric power industry safety protection Spark |
title | Implementation of a big data anonymization system based on Spark |
title_full | Implementation of a big data anonymization system based on Spark |
title_fullStr | Implementation of a big data anonymization system based on Spark |
title_full_unstemmed | Implementation of a big data anonymization system based on Spark |
title_short | Implementation of a big data anonymization system based on Spark |
title_sort | implementation of a big data anonymization system based on spark |
topic | data anonymization privacy electric power industry safety protection Spark |
url | http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2018133/ |
work_keys_str_mv | AT chaoyibian implementationofabigdataanonymizationsystembasedonspark AT shaominzhu implementationofabigdataanonymizationsystembasedonspark AT taozhou implementationofabigdataanonymizationsystembasedonspark |