Data augmentation scheme for federated learning with non-IID data

To solve the problem that the model accuracy remains low when the data are not independent and identically distributed (non-IID) across different clients in federated learning, a privacy-preserving data augmentation scheme was proposed.Firstly, a data augmentation framework for federated learning sc...

Full description

Saved in:

Bibliographic Details
Main Authors:	Lingtao TANG, Di WANG, Shengyun LIU
Format:	Article
Language:	zho
Published:	Editorial Department of Journal on Communications 2023-01-01
Series:	Tongxin xuebao
Subjects:	federated learning Non-IID generative adversarial network differential privacy data augmentation
Online Access:	http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2023007/
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1841540077064290304
author	Lingtao TANG Di WANG Shengyun LIU
author_facet	Lingtao TANG Di WANG Shengyun LIU
author_sort	Lingtao TANG
collection	DOAJ
description	To solve the problem that the model accuracy remains low when the data are not independent and identically distributed (non-IID) across different clients in federated learning, a privacy-preserving data augmentation scheme was proposed.Firstly, a data augmentation framework for federated learning scenarios was designed.All clients generated synthetic samples locally and shared them with each other, which eased the problem of client drift caused by the difference of clients’ data distributions.Secondly, based on generative adversarial network and differential privacy, a private sample generation algorithm was proposed.It helped clients to generate informative samples while preserving the privacy of clients’ local data.Finally, a differentially private label selection algorithm was proposed to ensure the labels of synthetic samples will not leak information.Simulation results demonstrate that under multiple non-IID data partition strategies, the proposed scheme can consistently improve the model accuracy and make the model converge faster.Compared with the benchmark approaches, the proposed scheme can achieve at least 25% accuracy improvement when each client has only one class of samples.
format	Article
id	doaj-art-db0d49e96c0143dea5f9077a8c792fdd
institution	Kabale University
issn	1000-436X
language	zho
publishDate	2023-01-01
publisher	Editorial Department of Journal on Communications
record_format	Article
series	Tongxin xuebao
spelling	doaj-art-db0d49e96c0143dea5f9077a8c792fdd2025-01-14T06:28:07ZzhoEditorial Department of Journal on CommunicationsTongxin xuebao1000-436X2023-01-014416417659389161Data augmentation scheme for federated learning with non-IID dataLingtao TANGDi WANGShengyun LIUTo solve the problem that the model accuracy remains low when the data are not independent and identically distributed (non-IID) across different clients in federated learning, a privacy-preserving data augmentation scheme was proposed.Firstly, a data augmentation framework for federated learning scenarios was designed.All clients generated synthetic samples locally and shared them with each other, which eased the problem of client drift caused by the difference of clients’ data distributions.Secondly, based on generative adversarial network and differential privacy, a private sample generation algorithm was proposed.It helped clients to generate informative samples while preserving the privacy of clients’ local data.Finally, a differentially private label selection algorithm was proposed to ensure the labels of synthetic samples will not leak information.Simulation results demonstrate that under multiple non-IID data partition strategies, the proposed scheme can consistently improve the model accuracy and make the model converge faster.Compared with the benchmark approaches, the proposed scheme can achieve at least 25% accuracy improvement when each client has only one class of samples.http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2023007/federated learningNon-IIDgenerative adversarial networkdifferential privacydata augmentation
spellingShingle	Lingtao TANG Di WANG Shengyun LIU Data augmentation scheme for federated learning with non-IID data Tongxin xuebao federated learning Non-IID generative adversarial network differential privacy data augmentation
title	Data augmentation scheme for federated learning with non-IID data
title_full	Data augmentation scheme for federated learning with non-IID data
title_fullStr	Data augmentation scheme for federated learning with non-IID data
title_full_unstemmed	Data augmentation scheme for federated learning with non-IID data
title_short	Data augmentation scheme for federated learning with non-IID data
title_sort	data augmentation scheme for federated learning with non iid data
topic	federated learning Non-IID generative adversarial network differential privacy data augmentation
url	http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2023007/
work_keys_str_mv	AT lingtaotang dataaugmentationschemeforfederatedlearningwithnoniiddata AT diwang dataaugmentationschemeforfederatedlearningwithnoniiddata AT shengyunliu dataaugmentationschemeforfederatedlearningwithnoniiddata

Data augmentation scheme for federated learning with non-IID data

Similar Items