What does this research mean for the field?

The REAR-RL framework, which integrates Q-learning with a multi-criteria reward function and a lightweight trust model, significantly improves network lifetime, packet delivery ratio, and end-to-end delay in MANETs compared to traditional routing protocols. Novelty: ClaimNovelty.METHODOLOGICAL. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

The aim is to develop a novel routing framework that addresses energy depletion and non-cooperative node issues in MANETs.

June 9, 2026Open Access

Intelligent Energy-Aware Routing: A Reinforcement Learning Approach for Non-Cooperative Node Detection and Path Optimization in MANETs

Key Points

The aim is to develop a novel routing framework that addresses energy depletion and non-cooperative node issues in MANETs.
Implemented REAR-RL, a reinforcement learning-based routing framework using Q-learning.
Utilized a multi-criteria reward function considering factors like residual energy and node cooperation.
Conducted extensive simulations in NS-3 with varying node mobility and traffic loads.
Achieved a 34.7% improvement in network lifetime compared to existing protocols.
Increased packet delivery ratio by 28.3%.
Reduced end-to-end delay by 19.6% compared to AODV, DSR, and OLSR.

Abstract

ABSTRACT Mobile Ad Hoc Networks (MANETs) are inherently susceptible to energy depletion and non-cooperative node behavior, both of which critically degrade routing performance and network lifetime. Existing routing protocols fail to simultaneously address dynamic topology changes, selfish node detection, and energy-balanced path selection. This paper presents REAR-RL (Reinforcement Energy-Aware Routing via Reinforcement Learning), a novel adaptive routing framework that integrates a Q-learning-based decision engine with a multi-criteria reward function encapsulating residual energy, link quality, node cooperation history, and hop count. REAR-RL employs a lightweight trust model derived from packet forwarding behavior to identify and isolate non-cooperative nodes without requiring centralized infrastructure. The reward shaping strategy prioritizes routes that balance energy consumption across participating nodes while maximizing packet delivery. Extensive simulations conducted in NS-3 with 50 to 200 mobile nodes reveal that REAR-RL achieves up to 34.7% improvement in network lifetime, a 28.3% increase in packet delivery ratio, and reduces end-to-end delay by 19.6% compared to AODV, DSR, and OLSR under varying node mobility and traffic loads. These results demonstrate the viability of model-free reinforcement learning as a scalable, infrastructure-free solution for intelligent routing in adversarial mobile environments.

Intelligent Energy-Aware Routing: A Reinforcement Learning Approach for Non-Cooperative Node Detection and Path Optimization in MANETs

Key Points

Abstract

Cite This Study

Also Consider

Also Consider