TY - JOUR
T1 - Adaptive Reinforcement Learning Strategy with Sliding Mode Control for Unknown and Disturbed Wheeled Inverted Pendulum
AU - Dao, Phuong Nam
AU - Liu, Yen Chen
N1 - Publisher Copyright:
© 2020, ICROS, KIEE and Springer.
PY - 2021/2
Y1 - 2021/2
N2 - This paper develops a novel adaptive integral sliding-mode control (SMC) technique to improve the tracking performance of a wheeled inverted pendulum (WIP) system, which belongs to a class of continuous time systems with input disturbance and/or unknown parameters. The proposed algorithm is established based on an integrating between the advantage of online adaptive reinforcement learning control and the high robustness of integral sliding-mode control (SMC) law. The main objective is to find a general structure of integral sliding mode control law that can guarantee the system state reaching a sliding surface in finite time. An adaptive/approximate optimal control based on the approximate/adaptive dynamic programming (ADP) is responsible for the asymptotic stability of the closed loop system. Furthermore, the convergence possibility of proposed output feedback optimal control was determined without the convergence of additional state observer. Finally, the theoretical analysis and simulation results validate the performance of the proposed control structure.
AB - This paper develops a novel adaptive integral sliding-mode control (SMC) technique to improve the tracking performance of a wheeled inverted pendulum (WIP) system, which belongs to a class of continuous time systems with input disturbance and/or unknown parameters. The proposed algorithm is established based on an integrating between the advantage of online adaptive reinforcement learning control and the high robustness of integral sliding-mode control (SMC) law. The main objective is to find a general structure of integral sliding mode control law that can guarantee the system state reaching a sliding surface in finite time. An adaptive/approximate optimal control based on the approximate/adaptive dynamic programming (ADP) is responsible for the asymptotic stability of the closed loop system. Furthermore, the convergence possibility of proposed output feedback optimal control was determined without the convergence of additional state observer. Finally, the theoretical analysis and simulation results validate the performance of the proposed control structure.
UR - http://www.scopus.com/inward/record.url?scp=85097191650&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85097191650&partnerID=8YFLogxK
U2 - 10.1007/s12555-019-0912-9
DO - 10.1007/s12555-019-0912-9
M3 - Article
AN - SCOPUS:85097191650
SN - 1598-6446
VL - 19
SP - 1139
EP - 1150
JO - International Journal of Control, Automation and Systems
JF - International Journal of Control, Automation and Systems
IS - 2
ER -