Los puntos clave no están disponibles para este artículo en este momento.
Abstract Safety is a critical factor in reinforcement learning (RL) in chemical processes. In our previous work, we had proposed a new stability‐guaranteed RL for unconstrained nonlinear control‐affine systems. In the approximate policy iteration algorithm, a Lyapunov neural network (LNN) was updated while being restricted to the control Lyapunov function, and a policy was updated using a variation of Sontag's formula. In this study, we additionally consider state and input constraints by introducing a barrier function, and we extend the applicable type to general nonlinear systems. We augment the constraints into the objective function and use the LNN added with a Lyapunov barrier function to approximate the augmented value function. Sontag's formula input with this approximate function brings the states into its lower level set, thereby guaranteeing the constraints satisfaction and stability. We prove the practical asymptotic stability and forward invariance. The effectiveness is validated using four tank system simulations.
Kim et al. (Tue,) studied this question.