Skip to main navigation Skip to search Skip to main content

Q-Learning-based control for discrete-time switched affine systems and its application to DC–DC converter

  • Xiaozeng Xu
  • , Yanzheng Zhu
  • , Rongni Yang
  • , Wei Xing Zheng
  • , Jose de Jesus Rubio
  • Shandong University of Science and Technology
  • College of Electrical Engineering and Automation
  • Shandong University
  • Instituto Politécnico Nacional

Research output: Contribution to journalArticlepeer-review

Abstract

In this paper, a new data-based Q-learning algorithm is proposed to address the optimal control issue for a class of discrete-time switched affine systems (SASs). The algorithm shifts the emphasis onto learning the optimal switching law directly from system input-output data, employing a neural-network-approximated Q-function as the key learning element. Firstly, the optimal control issue is transformed into solving the corresponding Bellman’s optimality equation based on the Q-function. Then, a new Q-learning algorithm is developed to find the optimal solution of system switching based entirely on the system input-output data, and a fully connected neural network is borrowed as the Q-function approximator. Moreover, considering the affine properties of SASs, the sequence of Q-functions generated remains bounded in proximity to the precise optimal solution. Finally, both the advantage and effectiveness of the proposed Q-learning based optimal control approach are verified by three examples, including a case study of DC-DC buck-boost converter.

Original languageEnglish
Pages (from-to)2206-2215
Number of pages10
JournalIEEE Transactions on Circuits and Systems I: Regular Papers
Volume73
Issue number3
DOIs
Publication statusE-pub ahead of print (In Press) - 2025

Keywords

  • Data-based control
  • neural network
  • optimal control
  • Q-learning
  • switched affine systems

Fingerprint

Dive into the research topics of 'Q-Learning-based control for discrete-time switched affine systems and its application to DC–DC converter'. Together they form a unique fingerprint.

Cite this