Cui, Kai ; Dayanıklı, Gökçe ; Laurière, Mathieu ; Geist, Matthieu ; Pietquin, Olivier ; Koeppl, Heinz (2024)
Learning Discrete-Time Major-Minor Mean Field Games.
38th AAAI Conference on Artificial Intelligence. Vancouver, Canada (20.02.2024 - 27.02.2024)
doi: 10.26083/tuprints-00028687
Conference or Workshop Item, Secondary publication, Postprint
Text
Cui_et_al_2024_Learning_Discrete-Time_Major-Minor_Mean_Field_Games.pdf Copyright Information: In Copyright. Download (901kB) |
Item Type: | Conference or Workshop Item |
---|---|
Type of entry: | Secondary publication |
Title: | Learning Discrete-Time Major-Minor Mean Field Games |
Language: | English |
Date: | 16 December 2024 |
Place of Publication: | Darmstadt |
Year of primary publication: | 2024 |
Place of primary publication: | Menlo Park, Calif. |
Publisher: | AAAI |
Journal or Publication Title: | Proceedings of the AAAI Conference on Artificial Intelligence |
Volume of the journal: | 38 |
Issue Number: | 9 |
Book Title: | Proceedings of the 38th AAAI Conference on Artificial Intelligence |
Event Title: | 38th AAAI Conference on Artificial Intelligence |
Event Location: | Vancouver, Canada |
Event Dates: | 20.02.2024 - 27.02.2024 |
DOI: | 10.26083/tuprints-00028687 |
Corresponding Links: | |
Origin: | Secondary publication service |
Abstract: | Recent techniques based on Mean Field Games (MFGs) allow the scalable analysis of multi-player games with many similar, rational agents. However, standard MFGs remain limited to homogeneous players that weakly influence each other, and cannot model major players that strongly influence other players, severely limiting the class of problems that can be handled. We propose a novel discrete time version of major-minor MFGs (M3FGs), along with a learning algorithm based on fictitious play and partitioning the probability simplex. Importantly, M3FGs generalize MFGs with common noise and can handle not only random exogeneous environment states but also major players. A key challenge is that the mean field is stochastic and not deterministic as in standard MFGs. Our theoretical investigation verifies both the M3FG model and its algorithmic solution, showing firstly the well-posedness of the M3FG model starting from a finite game of interest, and secondly convergence and approximation guarantees of the fictitious play algorithm. Then, we empirically verify the obtained theoretical results, ablating some of the theoretical assumptions made, and show successful equilibrium learning in three example problems. Overall, we establish a learning framework for a novel and broad class of tractable games. |
Status: | Postprint |
URN: | urn:nbn:de:tuda-tuprints-286878 |
Classification DDC: | 000 Generalities, computers, information > 004 Computer science 600 Technology, medicine, applied sciences > 621.3 Electrical engineering, electronics |
Divisions: | 18 Department of Electrical Engineering and Information Technology > Self-Organizing Systems Lab |
Date Deposited: | 16 Dec 2024 14:02 |
Last Modified: | 16 Dec 2024 14:03 |
URI: | https://tuprints.ulb.tu-darmstadt.de/id/eprint/28687 |
PPN: | |
Export: |
View Item |