What question did this study set out to answer?

To develop a predictive auto-scaling system on AWS that utilizes machine learning for improved resource management.

April 27, 2026Open Access

ML-Based Predictive Autoscaling on Aws

Key Points

To develop a predictive auto-scaling system on AWS that utilizes machine learning for improved resource management.
Utilized Long Short-Term Memory (LSTM) models to analyze historical and real-time workload data.
Employed federated learning to train models across distributed cloud nodes without sharing raw data.
Proactively scaled AWS EC2 instances using Auto Scaling Groups based on workload predictions.
Improved application responsiveness and reduced latency observed.
Optimized resource utilization while lowering operational costs observed through implementation.
Achieved a more efficient and privacy-preserving solution for cloud resource management.

Abstract

Traditional cloud auto-scaling systems rely on static threshold-based rules that react only after workload changes occur, leading to delayed responses, inefficient resource utilization, and increased latency. To address these limitations, this paper proposes a predictive auto-scaling system on AWS using machine learning and federated learning techniques. The system utilizes Long ShortTerm Memory (LSTM) models to analyze historical and real- time workload data, including CPU utilization, memory usage, and request rates collected through Amazon CloudWatch. Unlike centralized approaches, federated learning is employed to train models across distributed cloud nodes without sharing raw data, ensuring data privacy and reducing communication overhead. The locally trained models share updates that are aggregated to form a global predictive model capable of accurately forecasting future workload demands.Based on these predictions, the system proactively scales AWS EC2 instances using Auto Scaling Groups, enabling timely resource allocation before performance degradation occurs. This approach improves application responsiveness, reduces latency, optimizes resource utilization, and lowers operational costs. Overall, the proposed system provides a scalable, efficient, and privacy- preserving solution for intelligent cloud resource management.

ML-Based Predictive Autoscaling on Aws

Key Points

Abstract

Cite This Study

Also Consider

Also Consider