Suphx: Mastering Mahjong with Deep Reinforcement Learning
0301 basic medicine
FOS: Computer and information sciences
03 medical and health sciences
Artificial Intelligence (cs.AI)
Computer Science - Artificial Intelligence
DOI:
10.48550/arxiv.2003.13590
Publication Date:
2020-01-01
AUTHORS (10)
ABSTRACT
Artificial Intelligence (AI) has achieved great success in many domains, and game AI is widely regarded as its beachhead since the dawn of AI. In recent years, studies on have gradually evolved from relatively simple environments (e.g., perfect-information games such Go, chess, shogi or two-player imperfect-information heads-up Texas hold'em) to more complex ones multi-player hold'em StartCraft II). Mahjong a popular worldwide but very challenging for research due playing/scoring rules rich hidden information. We design an Mahjong, named Suphx, based deep reinforcement learning with some newly introduced techniques including global reward prediction, oracle guiding, run-time policy adaptation. Suphx demonstrated stronger performance than most top human players terms stable rank rated above 99.99% all officially ranked Tenhou platform. This first time that computer program outperforms Mahjong.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES ()
CITATIONS ()
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....