Tensor-Based Efficient Federated Reinforcement Learning for Cyber-Physical-Social Intelligence

Reinforcement Learning (RL) serves as a fundamental learning paradigm in the field of artificial intelligence, enabling decision-making policies through interactions with environments. However, traditional RL methods encounter challenges when dealing with large-scale or continuous state spaces due t...

Full description

Saved in:
Bibliographic Details
Main Authors: Xin Nie, Laurence T. Yang, Fulan Fan, Zecan Yang
Format: Article
Language:English
Published: Tsinghua University Press 2025-06-01
Series:Big Data Mining and Analytics
Subjects:
Online Access:https://www.sciopen.com/article/10.26599/BDMA.2025.9020036
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Reinforcement Learning (RL) serves as a fundamental learning paradigm in the field of artificial intelligence, enabling decision-making policies through interactions with environments. However, traditional RL methods encounter challenges when dealing with large-scale or continuous state spaces due to the curse of dimensionality. Although Deep Reinforcement Learning (DRL) can handle complex environments, its lack of transparency and interpretability hinders its applicability due to the black box nature. Moreover, centralized data collection and processing methods pose privacy security risks. Federated learning offers a distributed approach that ensures privacy preservation while co-training models. However, existing federated reinforcement learning approaches have not adequately addressed communication and computation overhead issues. To address these challenges, this study proposes a tensor train decomposition-based federated reinforcement learning method that enhances efficiency and provides interpretability. By leveraging tensor to model state-action values and employing tensor decomposition techniques for dimensionality reduction, this method effectively reduces model parameters and communication overhead while maintaining strong interpretability, accelerates algorithm convergence speed. Experimental results validate the advantages of our proposed algorithm in terms of efficiency and reliability.
ISSN:2096-0654
2097-406X