An Abelian Theorem for a Markov Decision Process in a System of Interacting Objects with Unknown Random Disturbance Law

José Daniel López-Barrientos, José Manuel Mendoza-Madrid, Paola Friné González-Vega

Research output: Contribution to journalArticlepeer-review

Abstract

This paper studies a mean-field approach for Markov decision processes in a class of systems of a large number of objects that interact with each other according to an observable -but unknown- law for the central controller. The central controller acts under the ergodic cost criterion with Borei state and control spaces, bounded costs, and compact action space. We depart from the characterization of the discounted optimal strategies, and then, by means of an Abelian theorem, we study the existence of average cost optimal stationary policies in the original model. We also analyze the performance of the mean-field limit optimal policies in the original model.

Original languageEnglish
Pages (from-to)763-782
Number of pages20
JournalPure and Applied Functional Analysis
Volume9
Issue number3
StatePublished - 1 Jan 2024
Externally publishedYes

Keywords

  • Abelian theorems
  • Discounted and ergodic performance criteria
  • Mean-field theory
  • Robustness of estimation

Fingerprint

Dive into the research topics of 'An Abelian Theorem for a Markov Decision Process in a System of Interacting Objects with Unknown Random Disturbance Law'. Together they form a unique fingerprint.

Cite this