Delog: A Privacy Preserving Log Filtering Framework for Online Compute Platforms

FOS: Computer and information sciences Computer Science - Cryptography and Security 02 engineering and technology Computer Science - Information Retrieval H.3.8 H.5.4 0202 electrical engineering, electronic engineering, information engineering I.9.4 Cryptography and Security (cs.CR) H.3.11 Information Retrieval (cs.IR) H.3.8; H.5.4; H.3.11; I.9.4
DOI: 10.48550/arxiv.1902.04843 Publication Date: 2019-01-01
ABSTRACT
In many software applications, logs serve as the only interface between the application and the developer. However, navigating through the logs of long-running applications is often challenging. Logs from previously successful application runs can be leveraged to automatically identify errors and provide users with only the logs that are relevant to the debugging process. We describe a privacy preserving framework which can be employed by Platform as a Service (PaaS) providers to utilize the user logs generated on the platform while protecting the potentially sensitive logged data. Further, in order to accurately and scalably parse log lines, we present a distributed log parsing algorithm which leverages Locality Sensitive Hashing (LSH). We outperform the state-of-the-art on multiple datasets. We further demonstrate the scalability of Delog on publicly available Thunderbird log dataset with close to 27,000 unique patterns and 211 million lines.<br/>11 pages, 9 Tables, 7 figures<br/>
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES ()
CITATIONS ()
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....