Key points are not available for this paper at this time.
Model-free control approaches require advanced exploration-exploitation policies to achieve practical tasks such as learning to bipedal robot walk in unstructured environments. In this article, we first construct a comprehensive exploration-exploitation policy that carries quality knowledge about the long-term predictor and the control policy, and the control signal of the model-free algorithms. Therefore, the developed model-free algorithm continues exploration by adjusting its unknown parameters until the desired learning and control are accomplished. Second, we provide an utterly model-free adaptive law enriched with the exploration-exploitation policy and derived step-by-step using the exact analogy of the model-based solution. The obtained adaptive control law considers the control signal saturation and the control signal (input) delay. Performed Lyapunov stability analysis ensures the convergence of the adaptive law that can also be used for intelligent control approaches. Third, we implement the adaptive algorithm in real time on a challenging benchmark system: a fourth-order, coupled dynamics, input saturated, and time-delayed underactuated manipulator. The results show that the proposed adaptive algorithm explores larger state-action spaces and treats the vanishing gradient problem in both learning and control. Also, we notice from the results that the learning and control properties of the adaptive algorithm are optimized as required.
Building similarity graph...
Analyzing shared references across papers
Loading...
Tutsoy et al. (Mon,) studied this question.
synapsesocial.com/papers/6a05b274b6b31dc090346c39 — DOI: https://doi.org/10.1109/tcyb.2021.3091680
Önder Tutsoy
Adana Science and Technology University
Duygun Erol Barkana
Yeditepe University
Kemal Balıkçı
Osmaniye Korkut Ata University
IEEE Transactions on Cybernetics
Yeditepe University
Osmaniye Korkut Ata University
Adana Science and Technology University
Building similarity graph...
Analyzing shared references across papers
Loading...