Adaptive PID controller based on Q-learning algorithm

Qian Shi; Hak Keung Lam; Bo Xiao; Shun Hung Tsai

doi:10.1049/trit.2018.1007

Adaptive PID controller based on Q-learning algorithm

Qian Shi, Hak Keung Lam^*, Bo Xiao, Shun Hung Tsai

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

39 Citations (Scopus)

206 Downloads (Pure)

Abstract

An adaptive proportional-integral-derivative (PID) controller based on Q-learning algorithm is proposed to balance the cart-pole system in simulation environment. This controller was trained using Q-learning algorithm and implemented the learned Q-tables to change the gains of linear PID controllers according to the state of the system during the control process. The adaptive PID controller based on Q-learning algorithm was trained from a set of fixed initial positions and was able to balance the system starting from a series of initial positions that are different from the ones used in the training session, which achieved equivalent or even better performances in comparison with the conventional PID controller and the controller only uses Q-learning algorithm. This indicates the advantage of the adaptive PID controller based on Q-learning algorithm both in the generality of balancing the cart-pole system from a relatively wide range of initial positions and in the stabilisability of achieving smaller steady-state error.

Original language	English
Pages (from-to)	235-244
Number of pages	10
Journal	CAAI Transactions on Intelligence Technology
Volume	3
Issue number	4
DOIs	https://doi.org/10.1049/trit.2018.1007
Publication status	Published - 1 Dec 2018

Access to Document

10.1049/trit.2018.1007

Adaptive PID controller based on Q-learning algorithm

Cite this

@article{38f8b970127c41549a36750616aaffd7,

title = "Adaptive PID controller based on Q-learning algorithm",

abstract = "An adaptive proportional-integral-derivative (PID) controller based on Q-learning algorithm is proposed to balance the cart-pole system in simulation environment. This controller was trained using Q-learning algorithm and implemented the learned Q-tables to change the gains of linear PID controllers according to the state of the system during the control process. The adaptive PID controller based on Q-learning algorithm was trained from a set of fixed initial positions and was able to balance the system starting from a series of initial positions that are different from the ones used in the training session, which achieved equivalent or even better performances in comparison with the conventional PID controller and the controller only uses Q-learning algorithm. This indicates the advantage of the adaptive PID controller based on Q-learning algorithm both in the generality of balancing the cart-pole system from a relatively wide range of initial positions and in the stabilisability of achieving smaller steady-state error.",

author = "Qian Shi and Lam, {Hak Keung} and Bo Xiao and Tsai, {Shun Hung}",

year = "2018",

month = dec,

day = "1",

doi = "10.1049/trit.2018.1007",

language = "English",

volume = "3",

pages = "235--244",

journal = "CAAI Transactions on Intelligence Technology",

issn = "2468-6557",

publisher = "Institution of Engineering and Technology",

number = "4",

}

TY - JOUR

T1 - Adaptive PID controller based on Q-learning algorithm

AU - Shi, Qian

AU - Lam, Hak Keung

AU - Xiao, Bo

AU - Tsai, Shun Hung

PY - 2018/12/1

Y1 - 2018/12/1

N2 - An adaptive proportional-integral-derivative (PID) controller based on Q-learning algorithm is proposed to balance the cart-pole system in simulation environment. This controller was trained using Q-learning algorithm and implemented the learned Q-tables to change the gains of linear PID controllers according to the state of the system during the control process. The adaptive PID controller based on Q-learning algorithm was trained from a set of fixed initial positions and was able to balance the system starting from a series of initial positions that are different from the ones used in the training session, which achieved equivalent or even better performances in comparison with the conventional PID controller and the controller only uses Q-learning algorithm. This indicates the advantage of the adaptive PID controller based on Q-learning algorithm both in the generality of balancing the cart-pole system from a relatively wide range of initial positions and in the stabilisability of achieving smaller steady-state error.

AB - An adaptive proportional-integral-derivative (PID) controller based on Q-learning algorithm is proposed to balance the cart-pole system in simulation environment. This controller was trained using Q-learning algorithm and implemented the learned Q-tables to change the gains of linear PID controllers according to the state of the system during the control process. The adaptive PID controller based on Q-learning algorithm was trained from a set of fixed initial positions and was able to balance the system starting from a series of initial positions that are different from the ones used in the training session, which achieved equivalent or even better performances in comparison with the conventional PID controller and the controller only uses Q-learning algorithm. This indicates the advantage of the adaptive PID controller based on Q-learning algorithm both in the generality of balancing the cart-pole system from a relatively wide range of initial positions and in the stabilisability of achieving smaller steady-state error.

UR - http://www.scopus.com/inward/record.url?scp=85068497703&partnerID=8YFLogxK

U2 - 10.1049/trit.2018.1007

DO - 10.1049/trit.2018.1007

M3 - Article

AN - SCOPUS:85068497703

SN - 2468-6557

VL - 3

SP - 235

EP - 244

JO - CAAI Transactions on Intelligence Technology

JF - CAAI Transactions on Intelligence Technology

IS - 4

ER -

Adaptive PID controller based on Q-learning algorithm

Abstract

Access to Document

Other files and links

Fingerprint

Cite this