Abstract: This paper focuses on solving the linear quadratic regulator problem for discrete-time linear systems without knowing system matrices. The classical Q-learning methods for linear systems can ...