Data augmentation scheme for federated learning with non-IID data
To solve the problem that the model accuracy remains low when the data are not independent and identically distributed (non-IID) across different clients in federated learning, a privacy-preserving data augmentation scheme was proposed.Firstly, a data augmentation framework for federated learning sc...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | zho |
Published: |
Editorial Department of Journal on Communications
2023-01-01
|
Series: | Tongxin xuebao |
Subjects: | |
Online Access: | http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2023007/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841540077064290304 |
---|---|
author | Lingtao TANG Di WANG Shengyun LIU |
author_facet | Lingtao TANG Di WANG Shengyun LIU |
author_sort | Lingtao TANG |
collection | DOAJ |
description | To solve the problem that the model accuracy remains low when the data are not independent and identically distributed (non-IID) across different clients in federated learning, a privacy-preserving data augmentation scheme was proposed.Firstly, a data augmentation framework for federated learning scenarios was designed.All clients generated synthetic samples locally and shared them with each other, which eased the problem of client drift caused by the difference of clients’ data distributions.Secondly, based on generative adversarial network and differential privacy, a private sample generation algorithm was proposed.It helped clients to generate informative samples while preserving the privacy of clients’ local data.Finally, a differentially private label selection algorithm was proposed to ensure the labels of synthetic samples will not leak information.Simulation results demonstrate that under multiple non-IID data partition strategies, the proposed scheme can consistently improve the model accuracy and make the model converge faster.Compared with the benchmark approaches, the proposed scheme can achieve at least 25% accuracy improvement when each client has only one class of samples. |
format | Article |
id | doaj-art-db0d49e96c0143dea5f9077a8c792fdd |
institution | Kabale University |
issn | 1000-436X |
language | zho |
publishDate | 2023-01-01 |
publisher | Editorial Department of Journal on Communications |
record_format | Article |
series | Tongxin xuebao |
spelling | doaj-art-db0d49e96c0143dea5f9077a8c792fdd2025-01-14T06:28:07ZzhoEditorial Department of Journal on CommunicationsTongxin xuebao1000-436X2023-01-014416417659389161Data augmentation scheme for federated learning with non-IID dataLingtao TANGDi WANGShengyun LIUTo solve the problem that the model accuracy remains low when the data are not independent and identically distributed (non-IID) across different clients in federated learning, a privacy-preserving data augmentation scheme was proposed.Firstly, a data augmentation framework for federated learning scenarios was designed.All clients generated synthetic samples locally and shared them with each other, which eased the problem of client drift caused by the difference of clients’ data distributions.Secondly, based on generative adversarial network and differential privacy, a private sample generation algorithm was proposed.It helped clients to generate informative samples while preserving the privacy of clients’ local data.Finally, a differentially private label selection algorithm was proposed to ensure the labels of synthetic samples will not leak information.Simulation results demonstrate that under multiple non-IID data partition strategies, the proposed scheme can consistently improve the model accuracy and make the model converge faster.Compared with the benchmark approaches, the proposed scheme can achieve at least 25% accuracy improvement when each client has only one class of samples.http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2023007/federated learningNon-IIDgenerative adversarial networkdifferential privacydata augmentation |
spellingShingle | Lingtao TANG Di WANG Shengyun LIU Data augmentation scheme for federated learning with non-IID data Tongxin xuebao federated learning Non-IID generative adversarial network differential privacy data augmentation |
title | Data augmentation scheme for federated learning with non-IID data |
title_full | Data augmentation scheme for federated learning with non-IID data |
title_fullStr | Data augmentation scheme for federated learning with non-IID data |
title_full_unstemmed | Data augmentation scheme for federated learning with non-IID data |
title_short | Data augmentation scheme for federated learning with non-IID data |
title_sort | data augmentation scheme for federated learning with non iid data |
topic | federated learning Non-IID generative adversarial network differential privacy data augmentation |
url | http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2023007/ |
work_keys_str_mv | AT lingtaotang dataaugmentationschemeforfederatedlearningwithnoniiddata AT diwang dataaugmentationschemeforfederatedlearningwithnoniiddata AT shengyunliu dataaugmentationschemeforfederatedlearningwithnoniiddata |