Reinforcement Learning-Based Control of Nonlinear Systems Using Lyapunov Stability Concept and Fuzzy Reward Scheme

Ming Chen; Hak Keung Lam; Qian Shi; Bo Xiao

doi:10.1109/TCSII.2019.2947682

Reinforcement Learning-Based Control of Nonlinear Systems Using Lyapunov Stability Concept and Fuzzy Reward Scheme

Ming Chen, Hak Keung Lam^*, Qian Shi, Bo Xiao

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

19 Citations (Scopus)

205 Downloads (Pure)

Abstract

In this brief, a reinforcement learning-based control approach for nonlinear systems is presented. The proposed control approach offers a design scheme of the adjustable policy learning rate (APLR) to reduce the influence imposed by negative or large advantages, which improves the learning stability of the proximal policy optimization (PPO) algorithm. Besides, this brief puts forward a Lyapunov-fuzzy reward system to further promote the learning efficiency. In addition, the proposed control approach absorbs the Lyapunov stability concept into the design of the Lyapunov reward system and a particular fuzzy reward system is set up using the knowledge of the cart-pole inverted pendulum and fuzzy inference system (FIS). The merits of the proposed approach are validated by simulation examples.

Original language	English
Article number	8871158
Pages (from-to)	2059-2063
Number of pages	5
Journal	IEEE Transactions on Circuits and Systems II: Express Briefs
Volume	67
Issue number	10
DOIs	https://doi.org/10.1109/TCSII.2019.2947682
Publication status	Published - Oct 2020

Keywords

adjustable policy learning rate (APLR)
cart-pole inverted pendulum
fuzzy reward system
Lyapunov reward system
Proximal policy optimization (PPO)

Access to Document

10.1109/TCSII.2019.2947682

Reinforcement Learning-Based Control of Nonlinear Systems Using Lyapunov Stability Concept and Fuzzy Reward SchemeAccepted author manuscript, 1.6 MB

Cite this

@article{59729b9c2ccf479aa666bf4a362851f6,

title = "Reinforcement Learning-Based Control of Nonlinear Systems Using Lyapunov Stability Concept and Fuzzy Reward Scheme",

abstract = "In this brief, a reinforcement learning-based control approach for nonlinear systems is presented. The proposed control approach offers a design scheme of the adjustable policy learning rate (APLR) to reduce the influence imposed by negative or large advantages, which improves the learning stability of the proximal policy optimization (PPO) algorithm. Besides, this brief puts forward a Lyapunov-fuzzy reward system to further promote the learning efficiency. In addition, the proposed control approach absorbs the Lyapunov stability concept into the design of the Lyapunov reward system and a particular fuzzy reward system is set up using the knowledge of the cart-pole inverted pendulum and fuzzy inference system (FIS). The merits of the proposed approach are validated by simulation examples. ",

keywords = "adjustable policy learning rate (APLR), cart-pole inverted pendulum, fuzzy reward system, Lyapunov reward system, Proximal policy optimization (PPO)",

author = "Ming Chen and Lam, {Hak Keung} and Qian Shi and Bo Xiao",

year = "2020",

month = oct,

doi = "10.1109/TCSII.2019.2947682",

language = "English",

volume = "67",

pages = "2059--2063",

journal = "IEEE Transactions on Circuits and Systems II: Express Briefs",

issn = "1549-7747",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "10",

}

TY - JOUR

T1 - Reinforcement Learning-Based Control of Nonlinear Systems Using Lyapunov Stability Concept and Fuzzy Reward Scheme

AU - Chen, Ming

AU - Lam, Hak Keung

AU - Shi, Qian

AU - Xiao, Bo

PY - 2020/10

Y1 - 2020/10

N2 - In this brief, a reinforcement learning-based control approach for nonlinear systems is presented. The proposed control approach offers a design scheme of the adjustable policy learning rate (APLR) to reduce the influence imposed by negative or large advantages, which improves the learning stability of the proximal policy optimization (PPO) algorithm. Besides, this brief puts forward a Lyapunov-fuzzy reward system to further promote the learning efficiency. In addition, the proposed control approach absorbs the Lyapunov stability concept into the design of the Lyapunov reward system and a particular fuzzy reward system is set up using the knowledge of the cart-pole inverted pendulum and fuzzy inference system (FIS). The merits of the proposed approach are validated by simulation examples.

AB - In this brief, a reinforcement learning-based control approach for nonlinear systems is presented. The proposed control approach offers a design scheme of the adjustable policy learning rate (APLR) to reduce the influence imposed by negative or large advantages, which improves the learning stability of the proximal policy optimization (PPO) algorithm. Besides, this brief puts forward a Lyapunov-fuzzy reward system to further promote the learning efficiency. In addition, the proposed control approach absorbs the Lyapunov stability concept into the design of the Lyapunov reward system and a particular fuzzy reward system is set up using the knowledge of the cart-pole inverted pendulum and fuzzy inference system (FIS). The merits of the proposed approach are validated by simulation examples.

KW - adjustable policy learning rate (APLR)

KW - cart-pole inverted pendulum

KW - fuzzy reward system

KW - Lyapunov reward system

KW - Proximal policy optimization (PPO)

UR - http://www.scopus.com/inward/record.url?scp=85092130746&partnerID=8YFLogxK

U2 - 10.1109/TCSII.2019.2947682

DO - 10.1109/TCSII.2019.2947682

M3 - Article

AN - SCOPUS:85092130746

SN - 1549-7747

VL - 67

SP - 2059

EP - 2063

JO - IEEE Transactions on Circuits and Systems II: Express Briefs

JF - IEEE Transactions on Circuits and Systems II: Express Briefs

IS - 10

M1 - 8871158

ER -

Reinforcement Learning-Based Control of Nonlinear Systems Using Lyapunov Stability Concept and Fuzzy Reward Scheme

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this