What type of study is this?

This is a Quantitative Study study.

September 17, 2025Open Access

Integrating Reinforcement Learning and LLM with Self-Optimization Network System

Key Points

Integration of reinforcement learning and large language model optimizes network resources effectively.
Experimental results demonstrate significant reductions in network latency and packet loss, enhancing resource utilization.
The framework leverages real-time sensor data to dynamically adjust bandwidth allocation under varying traffic patterns.
Simulation results underscore the effectiveness of sensor-driven optimization in heterogeneous network environments.

Abstract

The rapid expansion of communication networks and increasingly complex service demands have presented significant challenges to the intelligent management of network resources. To address these challenges, we have proposed a network self-optimization framework integrating the predictive capabilities of the Large Language Model (LLM) with the decision-making capabilities of multi-agent Reinforcement Learning (RL). Specifically, historical network traffic data are converted into structured inputs to forecast future traffic patterns using a GPT-2-based prediction module. Concurrently, a Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithm leverages real-time sensor data—including link delay and packet loss rates collected by embedded network sensors—to dynamically optimize bandwidth allocation. This sensor-driven mechanism enables the system to perform real-time optimization of bandwidth allocation, ensuring accurate monitoring and proactive resource scheduling. We evaluate our framework in a heterogeneous network simulated using Mininet under diverse traffic scenarios. Experimental results show that the proposed method significantly reduces network latency and packet loss, as well as improves robustness and resource utilization, highlighting the effectiveness of integrating sensor-driven RL optimization with predictive insights from LLMs.

Read Full Paperexternally

Perguntar à IA

Bookmark

View Full Paper