What does this research mean for the field?

A combined Policy Gradient Reinforcement Learning approach that simultaneously learns a Moving Horizon Estimation scheme and a Model Predictive Control scheme can effectively control polytopic Linear Parameter-Varying systems with inexact scheduling parameters. Novelty: ClaimNovelty.METHODOLOGICAL. Consensus alignment: ConsensusAlignment.NEUTRAL.

January 1, 2022Open Access

Policy Gradient Reinforcement Learning for Uncertain Polytopic LPV Systems based on MHE-MPC

Structured PICO

Population

Polytopic Linear Parameter-Varying (LPV) systems with inexact scheduling parameters

Intervention

Policy Gradient (PG) Reinforcement Learning (RL) to learn both the estimator (Moving Horizon Estimation, MHE) and controller (Model Predictive Control, MPC)

Outcome

Closed-loop performance

The study proposes a reinforcement learning-based approach to improve model predictive control and moving horizon estimation in uncertain polytopic LPV systems.

Abstract

In this paper, we propose a learning-based Model Predictive Control (MPC) approach for the polytopic Linear Parameter-Varying (LPV) systems with inexact scheduling parameters (as exogenous signals with inexact bounds), where the Linear Time Invariant (LTI) models (vertices) captured by combinations of the scheduling parameters becomes wrong. We first propose to adopt a Moving Horizon Estimation (MHE) scheme to simultaneously estimate the convex combination vector and unmeasured states based on the observations and model matching error. To tackle the wrong LTI models used in both the MPC and MHE schemes, we then adopt a Policy Gradient (PG) Reinforcement Learning (RL) to learn both the estimator (MHE) and controller (MPC) so that the best closed-loop performance is achieved. The effectiveness of the proposed RL-based MHE/MPC design is demonstrated using an illustrative example.

Read Full Paperexternally

AI에게 질문

Bookmark

View Full Paper