Prophiler: a fast filter for the large-scale detection of malicious web pages - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2011

Prophiler: a fast filter for the large-scale detection of malicious web pages

Résumé

Malicious web pages that host drive-by-download exploits have become a popular means for compromising hosts on the Internet and, subsequently, for creating large-scale botnets. In a drive-by-download exploit, an attacker embeds a malicious script (typically written in JavaScript) into a web page. When a victim visits this page, the script is executed and attempts to compromise the browser or one of its plugins. To detect drive-by-download exploits, researchers have developed a number of systems that analyze web pages for the presence of malicious code. Most of these systems use dynamic analysis. That is, they run the scripts associated with a web page either directly in a real browser (running in a virtualized environment) or in an emulated browser, and they monitor the scripts' executions for malicious activity. While the tools are quite precise, the analysis process is costly, often requiring in the order of tens of seconds for a single page. Therefore, performing this analysis on a large set of web pages containing hundreds of millions of samples can be prohibitive. One approach to reduce the resources required for performing large-scale analysis of malicious web pages is to develop a fast and reliable filter that can quickly discard pages that are benign, forwarding to the costly analysis tools only the pages that are likely to contain malicious code. In this paper, we describe the design and implementation of such a filter. Our filter, called Prophiler, uses static analysis techniques to quickly examine a web page for malicious content. This analysis takes into account features derived from the HTML contents of a page, from the associated JavaScript code, and from the corresponding URL. We automatically derive detection models that use these features using machine-learning techniques applied to labeled datasets. To demonstrate the effectiveness and efficiency of Prophiler, we crawled and collected millions of pages, which we analyzed for malicious behavior. Our results show that our filter is able to reduce the load on a more costly dynamic analysis tools by more than 85%, with a negligible amount of missed malicious pages.

Domaines

Informatique
Fichier principal
Vignette du fichier
www2011_Prophiler_a_fast_filter_for_the_large_scale_detection_of_malicious_web_pages.pdf (670.13 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00727271 , version 1 (03-09-2012)

Identifiants

Citer

Davide Canali, Marco Cova, Giovanni Vigna, Christopher Kruegel. Prophiler: a fast filter for the large-scale detection of malicious web pages. Proceedings of the 20th international conference on World wide web, Mar 2011, Hyderabad, India. pp.197--206, ⟨10.1145/1963405.1963436⟩. ⟨hal-00727271⟩

Collections

EURECOM
197 Consultations
1624 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More