Failure Diagnosis in Microservice Systems: A Comprehensive Survey and Analysis

Microservices
DOI: 10.1145/3715005 Publication Date: 2025-01-23T15:59:04Z
ABSTRACT
Widely adopted for their scalability and flexibility, modern microservice systems present unique failure diagnosis challenges due to independent deployment dynamic interactions. This complexity can lead cascading failures that negatively impact operational efficiency user experience. Recognizing the critical role of fault in improving stability reliability systems, researchers have conducted extensive studies achieved a number significant results. survey provides an exhaustive review 98 scientific papers from 2003 present, including thorough examination elucidation fundamental concepts, system architecture, problem statement. It also includes qualitative analysis dimensions, providing in-depth discussion current best practices future directions, aiming further its development application. In addition, this compiles publicly available datasets, toolkits, evaluation metrics facilitate selection validation techniques practitioners.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (171)
CITATIONS (3)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....