TY - JOUR
T1 - An Abelian Theorem for a Markov Decision Process in a System of Interacting Objects with Unknown Random Disturbance Law
AU - López-Barrientos, José Daniel
AU - Mendoza-Madrid, José Manuel
AU - González-Vega, Paola Friné
N1 - Publisher Copyright:
© 2024, Yokohama Publications. All rights reserved.
PY - 2024/1/1
Y1 - 2024/1/1
N2 - This paper studies a mean-field approach for Markov decision processes in a class of systems of a large number of objects that interact with each other according to an observable -but unknown- law for the central controller. The central controller acts under the ergodic cost criterion with Borei state and control spaces, bounded costs, and compact action space. We depart from the characterization of the discounted optimal strategies, and then, by means of an Abelian theorem, we study the existence of average cost optimal stationary policies in the original model. We also analyze the performance of the mean-field limit optimal policies in the original model.
AB - This paper studies a mean-field approach for Markov decision processes in a class of systems of a large number of objects that interact with each other according to an observable -but unknown- law for the central controller. The central controller acts under the ergodic cost criterion with Borei state and control spaces, bounded costs, and compact action space. We depart from the characterization of the discounted optimal strategies, and then, by means of an Abelian theorem, we study the existence of average cost optimal stationary policies in the original model. We also analyze the performance of the mean-field limit optimal policies in the original model.
KW - Abelian theorems
KW - Discounted and ergodic performance criteria
KW - Mean-field theory
KW - Robustness of estimation
UR - http://www.scopus.com/inward/record.url?scp=85208424688&partnerID=8YFLogxK
M3 - Artículo
AN - SCOPUS:85208424688
SN - 2189-3756
VL - 9
SP - 763
EP - 782
JO - Pure and Applied Functional Analysis
JF - Pure and Applied Functional Analysis
IS - 3
ER -