Back to MJ's Publications

Publications of X. Wei
Conference articles
  1. D. Ding, X. Wei, Z. Yang, Z. Wang, and M. R. Jovanovic. Provably efficient generalized Lagrangian policy optimization for safe multi-agent reinforcement learning. In Proceedings of 5th Annual Conference on Learning for Dynamics and Control, volume 211 of Proceedings of Machine Learning Research, Philadelphia, PA, pages 315-332, 2023. Keyword(s): Constrained Markov games, Method of Lagrange multipliers, Minimax optimization, Multi-agent reinforcement learning, Primal-dual policy optimization. [bibtex-entry]

  2. D. Ding, X. Wei, Z. Yang, Z. Wang, and M. R. Jovanovic. Provably efficient safe exploration via primal-dual policy optimization. In 24th International Conference on Artificial Intelligence and Statistics, volume 130, Virtual, pages 3304-3312, 2021. Keyword(s): Safe reinforcement learning, Constrained Markov decision processes, Safe exploration, Proximal policy optimization, Non-convex optimization, Online mirror descent, Primal-dual method. [bibtex-entry]

  3. D. Ding, X. Wei, H. Yu, and M. R. Jovanovic. Byzantine-resilient distributed learning under constraints. In Proceedings of the 2021 American Control Conference, New Orleans, LA, pages 2260-2265, 2021. Keyword(s): Byzantine primal-dual optimization, Constrained optimization, Distributed optimization, Robust statistical learning. [bibtex-entry]

  4. D. Ding, X. Wei, and M. R. Jovanovic. Distributed robust statistical learning: Byzantine mirror descent. In Proceedings of the 58th IEEE Conference on Decision and Control, Nice, France, pages 1822-1827, 2019. Keyword(s): Byzantine mirror descent, Distributed optimization, Dual averaging, Robust statistical learning. [bibtex-entry]

  5. D. Ding, X. Wei, Z. Yang, Z. Wang, and M. R. Jovanovic. Fast multi-agent temporal-difference learning via homotopy stochastic primal-dual method. In Optimization Foundations for Reinforcement Learning Workshop, 33rd Conference on Neural Information Processing Systems, Vancouver, Canada, 2019. Keyword(s): Convex optimization, Distributed temporal-difference learning, Multi-agent systems, Primal-dual algorithms, Reinforcement learning, Stochastic optimization. [bibtex-entry]

Back to MJ's Publications


This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All person copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

Last modified: Tue Jan 23 11:32:51 2024
Author: mihailo.

This document was translated from BibTEX by bibtex2html