Back to MJ's Publications
Publications about 'Reinforcement learning'


D. Ding.
Provable reinforcement learning for constrained and multiagent control systems.
PhD thesis,
University of Southern California,
2022.
Keyword(s): Constrained Markov decision processes,
Constrained nonconvex optimization,
Function approximation,
Gameagnostic convergence,
Multiagent reinforcement learning,
Multiagent systems,
Natural policy gradient,
Policy gradient methods,
Proximal policy optimization,
Primaldual algorithms,
Reinforcement learning,
Safe exploration,
Safe reinforcement learning,
Sample complexity,
Stochastic optimization.
[bibtexentry]

I. K. Ozaslan,
H. Mohammadi,
and M. R. Jovanovic.
Computing stabilizing feedback gains via a modelfree policy gradient method.
IEEE Control Syst. Lett.,
7:407412,
July 2023.
Keyword(s): Datadriven control,
Gradient descent,
Linear quadratic regulator,
Modelfree control,
Nonconvex optimization,
Optimization,
Optimal control,
Random search method,
Reinforcement learning,
Sample complexity.
[bibtexentry]

H. Mohammadi,
A. Zare,
M. Soltanolkotabi,
and M. R. Jovanovic.
Convergence and sample complexity of gradient methods for the modelfree linearquadratic regulator problem.
IEEE Trans. Automat. Control,
67(5):24352450,
May 2022.
Keyword(s): Datadriven control,
Gradient descent,
Gradientflow dynamics,
Linear quadratic regulator,
Modelfree control,
Nonconvex optimization,
Optimization,
Optimal control,
PolyakLojasiewicz inequality,
Random search method,
Reinforcement learning,
Sample complexity.
[bibtexentry]

H. Mohammadi,
M. Soltanolkotabi,
and M. R. Jovanovic.
On the linear convergence of random search for discretetime LQR.
IEEE Control Syst. Lett.,
5(3):989994,
July 2021.
Keyword(s): Datadriven control,
Gradient descent,
Linear quadratic regulator,
Modelfree control,
Nonconvex optimization,
Optimization,
Optimal control,
Random search method,
Reinforcement learning,
Sample complexity.
[bibtexentry]

D. Ding,
X. Wei,
Z. Yang,
Z. Wang,
and M. R. Jovanovic.
Fast multiagent temporaldifference learning via homotopy stochastic primaldual optimization.
IEEE Trans. Automat. Control,
2020.
Note: Submitted; also arXiv:1908.02805.
Keyword(s): Convex optimization,
Distributed temporaldifference learning,
Multiagent systems,
Primaldual algorithms,
Reinforcement learning,
Stochastic optimization.
[bibtexentry]

D. Ding,
C.Y. Wei,
K. Zhang,
and M. R. Jovanovic.
Independent policy gradient for largescale Markov potential games: sharper rates, function approximation, and gameagnostic convergence.
In Proceedings of the 39th International Conference on Machine Learning,
volume 162 of Proceedings of Machine Learning Research,
Baltimore, MD,
pages 51665220,
2022.
Keyword(s): Multiagent reinforcement learning,
Independent reinforcement learning,
Policy gradient methods,
Markov potential games,
Function approximation,
Gameagnostic convergence.
[bibtexentry]

D. Ding,
X. Wei,
Z. Yang,
Z. Wang,
and M. R. Jovanovic.
Provably efficient safe exploration via primaldual policy optimization.
In 24th International Conference on Artificial Intelligence and Statistics,
volume 130,
Virtual,
pages 33043312,
2021.
Keyword(s): Safe reinforcement learning,
Constrained Markov decision processes,
Safe exploration,
Proximal policy optimization,
Nonconvex optimization,
Online mirror descent,
Primaldual method.
[bibtexentry]

H. Mohammadi,
M. Soltanolkotabi,
and M. R. Jovanovic.
On the lack of gradient domination for linear quadratic Gaussian problems with incomplete state information.
In Proceedings of the 60th IEEE Conference on Decision and Control,
Austin, TX,
pages 11201124,
2021.
Keyword(s): Datadriven control,
Gradient descent,
Gradientflow dynamics,
Modelfree control,
Nonconvex optimization,
Optimization,
Optimal control,
PolyakLojasiewicz inequality,
Random search method,
Reinforcement learning,
Sample complexity.
[bibtexentry]

H. Mohammadi,
M. Soltanolkotabi,
and M. R. Jovanovic.
Learning the modelfree linear quadratic regulator via random search.
In Proceedings of Machine Learning Research, 2nd Annual Conference on Learning for Dynamics and Control,
volume 120,
Berkeley, CA,
pages 19,
2020.
Keyword(s): Datadriven control,
Gradient descent,
Gradientflow dynamics,
Linear quadratic regulator,
Modelfree control,
Nonconvex optimization,
Optimization,
Optimal control,
PolyakLojasiewicz inequality,
Random search method,
Reinforcement learning,
Sample complexity.
[bibtexentry]

H. Mohammadi,
M. Soltanolkotabi,
and M. R. Jovanovic.
Random search for learning the linear quadratic regulator.
In Proceedings of the 2020 American Control Conference,
Denver, CO,
pages 47984803,
2020.
Keyword(s): Datadriven control,
Gradient descent,
Gradientflow dynamics,
Linear quadratic regulator,
Modelfree control,
Nonconvex optimization,
Optimization,
Optimal control,
PolyakLojasiewicz inequality,
Random search method,
Reinforcement learning,
Sample complexity.
[bibtexentry]

D. Ding,
X. Wei,
Z. Yang,
Z. Wang,
and M. R. Jovanovic.
Fast multiagent temporaldifference learning via homotopy stochastic primaldual method.
In Optimization Foundations for Reinforcement Learning Workshop, 33rd Conference on Neural Information Processing Systems,
Vancouver, Canada,
2019.
Keyword(s): Convex optimization,
Distributed temporaldifference learning,
Multiagent systems,
Primaldual algorithms,
Reinforcement learning,
Stochastic optimization.
[bibtexentry]

S. HassanMoghaddam,
M. R. Jovanovic,
and S. Meyn.
Datadriven proximal algorithms for the design of structured optimal feedback gains.
In Proceedings of the 2019 American Control Conference,
Philadelphia, PA,
pages 58465850,
2019.
Keyword(s): Datadriven feedback design,
Largescale systems,
Nonsmooth optimization,
Proximal algorithms,
Reinforcement learning,
Sparsitypromoting optimal control,
Structured optimal control.
[bibtexentry]

H. Mohammadi,
A. Zare,
M. Soltanolkotabi,
and M. R. Jovanovic.
Global exponential convergence of gradient methods over the nonconvex landscape of the linear quadratic regulator.
In Proceedings of the 58th IEEE Conference on Decision and Control,
Nice, France,
pages 74747479,
2019.
Keyword(s): Datadriven control,
Global exponential stability,
Gradient descent,
Gradientflow dynamics,
Modelfree control,
Nonconvex optimization,
Optimization,
Optimal control,
Reinforcement learning.
[bibtexentry]

H. Mohammadi,
M. Soltanolkotabi,
and M. R. Jovanovic.
Modelfree linear quadratic regulator.
In K. G. Vamvoudakis,
Y. Wan,
F. Lewis,
and D. Cansever, editors, Handbook of Reinforcement Learning and Control.
Springer International Publishing,
2021.
Note: Doi:10.1007/9783030609900.
Keyword(s): Datadriven control,
Gradient descent,
Gradientflow dynamics,
Linear quadratic regulator,
Modelfree control,
Nonconvex optimization,
Optimization,
Optimal control,
PolyakLojasiewicz inequality,
Random search method,
Reinforcement learning,
Sample complexity.
[bibtexentry]
Back to MJ's Publications
Disclaimer:
This material is presented to ensure timely dissemination of
scholarly and technical work. Copyright and all rights therein
are retained by authors or by other copyright holders.
All person copying this information are expected to adhere to
the terms and constraints invoked by each author's copyright.
In most cases, these works may not be reposted
without the explicit permission of the copyright holder.
Last modified: Sun Oct 23 23:45:07 2022
Author: mihailo.
This document was translated from BibT_{E}X by
bibtex2html