Data augmentation scheme for federated learning with non-IID data

To solve the problem that the model accuracy remains low when the data are not independent and identically distributed (non-IID) across different clients in federated learning, a privacy-preserving data augmentation scheme was proposed.Firstly, a data augmentation framework for federated learning sc...

Full description

Saved in:
Bibliographic Details
Main Authors: Lingtao TANG, Di WANG, Shengyun LIU
Format: Article
Language:zho
Published: Editorial Department of Journal on Communications 2023-01-01
Series:Tongxin xuebao
Subjects:
Online Access:http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2023007/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841540077064290304
author Lingtao TANG
Di WANG
Shengyun LIU
author_facet Lingtao TANG
Di WANG
Shengyun LIU
author_sort Lingtao TANG
collection DOAJ
description To solve the problem that the model accuracy remains low when the data are not independent and identically distributed (non-IID) across different clients in federated learning, a privacy-preserving data augmentation scheme was proposed.Firstly, a data augmentation framework for federated learning scenarios was designed.All clients generated synthetic samples locally and shared them with each other, which eased the problem of client drift caused by the difference of clients’ data distributions.Secondly, based on generative adversarial network and differential privacy, a private sample generation algorithm was proposed.It helped clients to generate informative samples while preserving the privacy of clients’ local data.Finally, a differentially private label selection algorithm was proposed to ensure the labels of synthetic samples will not leak information.Simulation results demonstrate that under multiple non-IID data partition strategies, the proposed scheme can consistently improve the model accuracy and make the model converge faster.Compared with the benchmark approaches, the proposed scheme can achieve at least 25% accuracy improvement when each client has only one class of samples.
format Article
id doaj-art-db0d49e96c0143dea5f9077a8c792fdd
institution Kabale University
issn 1000-436X
language zho
publishDate 2023-01-01
publisher Editorial Department of Journal on Communications
record_format Article
series Tongxin xuebao
spelling doaj-art-db0d49e96c0143dea5f9077a8c792fdd2025-01-14T06:28:07ZzhoEditorial Department of Journal on CommunicationsTongxin xuebao1000-436X2023-01-014416417659389161Data augmentation scheme for federated learning with non-IID dataLingtao TANGDi WANGShengyun LIUTo solve the problem that the model accuracy remains low when the data are not independent and identically distributed (non-IID) across different clients in federated learning, a privacy-preserving data augmentation scheme was proposed.Firstly, a data augmentation framework for federated learning scenarios was designed.All clients generated synthetic samples locally and shared them with each other, which eased the problem of client drift caused by the difference of clients’ data distributions.Secondly, based on generative adversarial network and differential privacy, a private sample generation algorithm was proposed.It helped clients to generate informative samples while preserving the privacy of clients’ local data.Finally, a differentially private label selection algorithm was proposed to ensure the labels of synthetic samples will not leak information.Simulation results demonstrate that under multiple non-IID data partition strategies, the proposed scheme can consistently improve the model accuracy and make the model converge faster.Compared with the benchmark approaches, the proposed scheme can achieve at least 25% accuracy improvement when each client has only one class of samples.http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2023007/federated learningNon-IIDgenerative adversarial networkdifferential privacydata augmentation
spellingShingle Lingtao TANG
Di WANG
Shengyun LIU
Data augmentation scheme for federated learning with non-IID data
Tongxin xuebao
federated learning
Non-IID
generative adversarial network
differential privacy
data augmentation
title Data augmentation scheme for federated learning with non-IID data
title_full Data augmentation scheme for federated learning with non-IID data
title_fullStr Data augmentation scheme for federated learning with non-IID data
title_full_unstemmed Data augmentation scheme for federated learning with non-IID data
title_short Data augmentation scheme for federated learning with non-IID data
title_sort data augmentation scheme for federated learning with non iid data
topic federated learning
Non-IID
generative adversarial network
differential privacy
data augmentation
url http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2023007/
work_keys_str_mv AT lingtaotang dataaugmentationschemeforfederatedlearningwithnoniiddata
AT diwang dataaugmentationschemeforfederatedlearningwithnoniiddata
AT shengyunliu dataaugmentationschemeforfederatedlearningwithnoniiddata