Back to MJ's Publications

Publications about 'Nonconvex optimization'
Theses
  1. D. Ding. Provable reinforcement learning for constrained and multi-agent control systems. PhD thesis, University of Southern California, 2022. Keyword(s): Constrained Markov decision processes, Constrained nonconvex optimization, Function approximation, Game-agnostic convergence, Multi-agent reinforcement learning, Multi-agent systems, Natural policy gradient, Policy gradient methods, Proximal policy optimization, Primal-dual algorithms, Reinforcement learning, Safe exploration, Safe reinforcement learning, Sample complexity, Stochastic optimization. [bibtex-entry]


Journal articles
  1. I. K. Ozaslan, H. Mohammadi, and M. R. Jovanovic. Computing stabilizing feedback gains via a model-free policy gradient method. IEEE Control Syst. Lett., 7:407-412, July 2023. Keyword(s): Data-driven control, Gradient descent, Linear quadratic regulator, Model-free control, Nonconvex optimization, Optimization, Optimal control, Random search method, Reinforcement learning, Sample complexity. [bibtex-entry]


  2. D. Ding, K. Zhang, J. Duan, T. Basar, and M. R. Jovanovic. Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs. J. Mach. Learn. Res., 2022. Note: Submitted; also arXiv:2206.02346. Keyword(s): Constrained Markov decision processes, Constrained nonconvex optimization, Function approximation, Natural policy gradient, Policy gradient methods, Primal-dual algorithms, Sample complexity. [bibtex-entry]


  3. H. Mohammadi, A. Zare, M. Soltanolkotabi, and M. R. Jovanovic. Convergence and sample complexity of gradient methods for the model-free linear-quadratic regulator problem. IEEE Trans. Automat. Control, 67(5):2435-2450, May 2022. Keyword(s): Data-driven control, Gradient descent, Gradient-flow dynamics, Linear quadratic regulator, Model-free control, Nonconvex optimization, Optimization, Optimal control, Polyak-Lojasiewicz inequality, Random search method, Reinforcement learning, Sample complexity. [bibtex-entry]


  4. H. Mohammadi, M. Soltanolkotabi, and M. R. Jovanovic. On the linear convergence of random search for discrete-time LQR. IEEE Control Syst. Lett., 5(3):989-994, July 2021. Keyword(s): Data-driven control, Gradient descent, Linear quadratic regulator, Model-free control, Nonconvex optimization, Optimization, Optimal control, Random search method, Reinforcement learning, Sample complexity. [bibtex-entry]


Conference articles
  1. H. Mohammadi, M. Soltanolkotabi, and M. R. Jovanovic. On the lack of gradient domination for linear quadratic Gaussian problems with incomplete state information. In Proceedings of the 60th IEEE Conference on Decision and Control, Austin, TX, pages 1120-1124, 2021. Keyword(s): Data-driven control, Gradient descent, Gradient-flow dynamics, Model-free control, Nonconvex optimization, Optimization, Optimal control, Polyak-Lojasiewicz inequality, Random search method, Reinforcement learning, Sample complexity. [bibtex-entry]


  2. D. Ding, K. Zhang, T. Basar, and M. R. Jovanovic. Natural policy gradient primal-dual method for constrained Markov decision processes. In Proceedings of the 34th Conference on Neural Information Processing Systems, volume 33, Vancouver, Canada, pages 8378-8390, 2020. Keyword(s): Constrained Markov decision processes, Constrained nonconvex optimization, Natural policy gradient, Policy gradient methods, Primal-dual algorithms. [bibtex-entry]


  3. H. Mohammadi, M. Soltanolkotabi, and M. R. Jovanovic. Learning the model-free linear quadratic regulator via random search. In Proceedings of Machine Learning Research, 2nd Annual Conference on Learning for Dynamics and Control, volume 120, Berkeley, CA, pages 1-9, 2020. Keyword(s): Data-driven control, Gradient descent, Gradient-flow dynamics, Linear quadratic regulator, Model-free control, Nonconvex optimization, Optimization, Optimal control, Polyak-Lojasiewicz inequality, Random search method, Reinforcement learning, Sample complexity. [bibtex-entry]


  4. H. Mohammadi, M. Soltanolkotabi, and M. R. Jovanovic. Random search for learning the linear quadratic regulator. In Proceedings of the 2020 American Control Conference, Denver, CO, pages 4798-4803, 2020. Keyword(s): Data-driven control, Gradient descent, Gradient-flow dynamics, Linear quadratic regulator, Model-free control, Nonconvex optimization, Optimization, Optimal control, Polyak-Lojasiewicz inequality, Random search method, Reinforcement learning, Sample complexity. [bibtex-entry]


  5. H. Mohammadi, A. Zare, M. Soltanolkotabi, and M. R. Jovanovic. Global exponential convergence of gradient methods over the nonconvex landscape of the linear quadratic regulator. In Proceedings of the 58th IEEE Conference on Decision and Control, Nice, France, pages 7474-7479, 2019. Keyword(s): Data-driven control, Global exponential stability, Gradient descent, Gradient-flow dynamics, Model-free control, Nonconvex optimization, Optimization, Optimal control, Reinforcement learning. [bibtex-entry]


  6. H. Mohammadi, M. Razaviyayn, and M. R. Jovanovic. On the stability of gradient flow dynamics for a rank-one matrix approximation problem. In Proceedings of the 2018 American Control Conference, Milwaukee, WI, pages 4533-4538, 2018. Keyword(s): Nonconvex optimization, Stability of nonlinear systems, Matrix approximation, Gradient flow dynamics. [bibtex-entry]


  7. S. Hassan-Moghaddam, X. Wu, and M. R. Jovanovic. Edge addition in directed consensus networks. In Proceedings of the 2017 American Control Conference, Seattle, WA, pages 5592-5597, 2017. Keyword(s): Alternating direction method of multipliers, Consensus, Directed networks, Nonconvex optimization, Sparsity-promoting optimal control. [bibtex-entry]


Book chapters
  1. H. Mohammadi, M. Soltanolkotabi, and M. R. Jovanovic. Model-free linear quadratic regulator. In K. G. Vamvoudakis, Y. Wan, F. Lewis, and D. Cansever, editors, Handbook of Reinforcement Learning and Control. Springer International Publishing, 2021. Note: Doi:10.1007/978-3-030-60990-0. Keyword(s): Data-driven control, Gradient descent, Gradient-flow dynamics, Linear quadratic regulator, Model-free control, Nonconvex optimization, Optimization, Optimal control, Polyak-Lojasiewicz inequality, Random search method, Reinforcement learning, Sample complexity. [bibtex-entry]



Back to MJ's Publications




Disclaimer:

This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All person copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.




Last modified: Sun Oct 23 23:45:07 2022
Author: mihailo.


This document was translated from BibTEX by bibtex2html