Taming the Exponential Action Set: Sublinear Regret and Fast Convergence to Nash Equilibrium in Online Congestion Games
Sublinear function
DOI:
10.48550/arxiv.2306.13673
Publication Date:
2023-01-01
AUTHORS (5)
ABSTRACT
The congestion game is a powerful model that encompasses range of engineering systems such as traffic networks and resource allocation. It describes the behavior group agents who share common set $F$ facilities take actions subsets with $k$ facilities. In this work, we study online formulation games, where participate in repeatedly observe feedback randomness. We propose CongestEXP, decentralized algorithm applies classic exponential weights method. By maintaining on facility level, regret bound CongestEXP avoids dependence size possible sets, i.e., $\binom{F}{k} \approx F^k$, scales only linearly $F$. Specifically, show attains upper $O(kF\sqrt{T})$ for every individual player, $T$ time horizon. On other hand, exploiting growth enables to achieve fast convergence rate. If strict Nash equilibrium exists, can converge policy almost exponentially $O(F\exp(-t^{1-\alpha}))$, $t$ number iterations $\alpha \in (1/2, 1)$.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES ()
CITATIONS ()
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....