Quantifying Paedophile Queries in a Large P2P System

Matthieu Latapy 1 Clémence Magnien 1 Raphaël Fournier 1
1 ComplexNetworks
LIP6 - Laboratoire d'Informatique de Paris 6
Abstract : Increasing knowledge of paedophile activity in P2P systems is a crucial societal concern, with important consequences on child protection, policy making, and internet regulation. Because of a lack of traces of P2P exchanges and rigorous analysis methodology, however, current knowledge of this activity remains very limited. We consider here a widely used P2P system, eDonkey, and focus on two key statistics: the fraction of paedophile queries entered in the system and the fraction of users who entered such queries. We collect hundreds of millions of keyword-based queries; we design a paedophile query detection tool for which we establish false positive and false negative rates using assessment by experts; with this tool and these rates, we then estimate the fraction of paedophile queries in our data. We conclude that approximately 0.25% of queries are paedophile. Our statistics are by far the most precise and reliable ever obtained in this domain.
Document type :
Conference papers
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-00650336
Contributor : Raphaël Fournier-S'Niehotta <>
Submitted on : Friday, December 9, 2011 - 6:25:01 PM
Last modification on : Thursday, March 21, 2019 - 2:16:45 PM

Identifiers

Citation

Matthieu Latapy, Clémence Magnien, Raphaël Fournier. Quantifying Paedophile Queries in a Large P2P System. IEEE International Conference on Computer Communications INFOCOM (Mini-Conference), Apr 2011, Shanghai, China. pp.401-405, ⟨10.1109/INFCOM.2011.5935191⟩. ⟨hal-00650336⟩

Share

Metrics

Record views

82