Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with √T Regret

Asaf Cassel*, Tomer Koren*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

9 Scopus citations

Fingerprint

Dive into the research topics of 'Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with √T Regret'. Together they form a unique fingerprint.

Keyphrases

Mathematics