Abstract
In this paper, a new data-based Q-learning algorithm is proposed to address the optimal control issue for a class of discrete-time switched affine systems (SASs). The algorithm shifts the emphasis onto learning the optimal switching law directly from system input-output data, employing a neural-network-approximated Q-function as the key learning element. Firstly, the optimal control issue is transformed into solving the corresponding Bellman’s optimality equation based on the Q-function. Then, a new Q-learning algorithm is developed to find the optimal solution of system switching based entirely on the system input-output data, and a fully connected neural network is borrowed as the Q-function approximator. Moreover, considering the affine properties of SASs, the sequence of Q-functions generated remains bounded in proximity to the precise optimal solution. Finally, both the advantage and effectiveness of the proposed Q-learning based optimal control approach are verified by three examples, including a case study of DC-DC buck-boost converter.
| Original language | English |
|---|---|
| Pages (from-to) | 2206-2215 |
| Number of pages | 10 |
| Journal | IEEE Transactions on Circuits and Systems I: Regular Papers |
| Volume | 73 |
| Issue number | 3 |
| DOIs | |
| Publication status | E-pub ahead of print (In Press) - 2025 |
Keywords
- Data-based control
- neural network
- optimal control
- Q-learning
- switched affine systems
Fingerprint
Dive into the research topics of 'Q-Learning-based control for discrete-time switched affine systems and its application to DC–DC converter'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver