What does this research mean for the field?

Q: What does this research mean for the field?

A Federated Decision Transformer (FDT) framework integrating transformer-based sequence modeling with federated learning provides superior reward efficiency, scalability, and adaptability in dynamic IoT networks compared to centralized critic-based methods like MAAC. Novelty: ClaimNovelty.METHODOLOGICAL. Consensus alignment: ConsensusAlignment.NEUTRAL.

October 27, 2025Open Access

Federated Decision Transformers for Scalable Reinforcement Learning in Smart City IoT Systems

Key Points

Key points are not available for this paper at this time.

Abstract

The rapid proliferation of devices on the Internet of Things (IoT) in smart city environments enables autonomous decision-making, but introduces challenges of scalability, coordination, and privacy. Existing reinforcement learning (RL) methods, such as Multi-Agent Actor–Critic (MAAC), depend on centralized critics and recurrent structures, which limit scalability and create single points of failure. This paper proposes a Federated Decision Transformer (FDT) framework that integrates transformer-based sequence modeling with federated learning. By replacing centralized critics with self-attention-driven trajectory modeling, the FDT preserves data locality, enhances privacy, and supports decentralized policy learning across distributed IoT nodes. We benchmarked the FDT against MAAC in a mobile edge computing (MEC) environment with identical hyperparameter configurations. The results demonstrate that the FDT achieves superior reward efficiency, scalability, and adaptability in dynamic IoT networks, although with slightly higher variance during early training. These findings highlight transformer-based federated RL as a robust and privacy-preserving alternative to critic-based methods for large-scale IoT systems.

Bookmark

View Full Paper

Cite This Study

Alterkawi et al. (Mon,) studied this question.

synapsesocial.com/papers/6a1fabae900b646e2b260a73 https://doi.org/https://doi.org/10.3390/fi17110492

Bookmark

View Full Paper