Explaining the GWSkyNet-Multi machine learning classifier predictions for gravitational-wave events

Gravitational wave sources FOS: Physical sciences Astronomy and Astrophysics General Relativity and Quantum Cosmology (gr-qc) Astrophysics 530 01 natural sciences General Relativity and Quantum Cosmology QB460-466 Space and Planetary Science 0103 physical sciences Convolutional neural networks Gravitational wave astronomy Astrophysics - Instrumentation and Methods for Astrophysics Instrumentation and Methods for Astrophysics (astro-ph.IM)
DOI: 10.48550/arxiv.2308.12357 Publication Date: 2024-03-01
ABSTRACT
Abstract GWSkyNet-Multi is a machine learning model developed for the classification of candidate gravitational-wave events detected by the LIGO and Virgo observatories. The model uses limited information released in the low-latency Open Public Alerts to produce prediction scores indicating whether an event is a merger of two black holes (BHs), a merger involving a neutron star (NS), or a non-astrophysical glitch. This facilitates time-sensitive decisions about whether to perform electromagnetic follow-up of candidate events during LIGO-Virgo-KAGRA (LVK) observing runs. However, it is not well understood how the model is leveraging the limited information available to make its predictions. As a deep learning neural network, the inner workings of the model can be difficult to interpret, impacting our trust in its validity and robustness. We tackle this issue by systematically perturbing the model and its inputs to explain what underlying features and correlations it has learned for distinguishing the sources. We show that the localization area of the 2D sky maps and the computed coherence versus incoherence Bayes factors are used as strong predictors for distinguishing between real events and glitches. The estimated distance to the source is further used to discriminate between binary BH mergers and mergers involving NSs. We leverage these findings to show that events misclassified by GWSkyNet-Multi in LVK’s third observing run have distinct sky areas, coherence factors, and distance values that influence the predictions and explain these misclassifications. The results help identify the model’s limitations and inform potential avenues for further optimization.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES ()
CITATIONS ()
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....