NFDI4DS | UHH-SEMS - Publication Details

Identifying suspicious URLs

0202 electrical engineering, electronic engineering, information engineering 02 engineering and technology

DOI: 10.1145/1553374.1553462 Publication Date: 2009-06-16T13:34:36Z

Abstract Supplemental Material References Cited by

AUTHORS (4)

Justin Ma

Lawrence K. Saul

Stefan Savage

Geoffrey M. Voelker

ABSTRACT

This paper explores online learning approaches for detecting malicious Web sites (those involved in criminal scams) using lexical and host-based features of the associated URLs. We show that this application is particularly appropriate for online algorithms as the size of the training data is larger than can be efficiently processed in batch and because the distribution of features that typify malicious URLs is changing continuously. Using a real-time system we developed for gathering URL features, combined with a real-time source of labeled URLs from a large Web mail provider, we demonstrate that recently-developed online algorithms can be as accurate as batch techniques, achieving classification accuracies up to 99% over a balanced data set.

SUPPLEMENTAL MATERIAL

Coming soon ....

REFERENCES (22)

CITATIONS (300)

EXTERNAL LINKS

OPENAIRE - Products CROSSREF - Publications

PlumX Metrics

Identifying suspicious URLs

RECOMMENDATIONS

FAIR ASSESSMENT

Coming soon ....

JUPYTER LAB

Coming soon ....