Poster: Towards a Publicly Available Framework to Process Traceroutes with MetaTrace - Archive ouverte HAL Accéder directement au contenu
Poster De Conférence Année : 2023

Poster: Towards a Publicly Available Framework to Process Traceroutes with MetaTrace

Résumé

The objective of this research is to contribute towards the development of an open-source framework for processing large-scale traceroute datasets. By providing such a framework, we aim to benefit the community by saving time in everyday traceroute analysis and enabling the design of new scalable reactive measurements, where prior traceroute measurements are leveraged to make informed decisions for future ones. It is important to clarify that our goal is not to surpass proprietary solutions like BigQuery, which are utilized by CDNs for processing billions of traceroutes. These proprietary solutions are not freely accessible to the public, whereas our focus is on creating an open and freely available framework for the wider community. Our contributions include (1) sharing the ideas and thinking process behind building MetaTrace, which efficiently utilizes Click-House features for traceroute processing; and (2) providing an open-source implementation of MetaTrace. We evaluated MetaTrace using two types of queries: predicate queries for filtering traceroutes based on conditions, and aggregate queries for computing metrics on traceroutes. Our results show that MetaTrace is significantly faster compared to alternative solutions. For predicate queries, it outperforms a multiprocessed Rust solution by a factor of 552 and is 3.4 times faster than ClickHouse without MetaTrace optimizations. For aggregate queries, MetaTrace processes 202 million traceroutes in 11 seconds, with its performance scaling linearly with traceroute volume. Notably, on a single server, MetaTrace can perform a predicate query on a 6-year dataset of 6 billion traceroutes in just 240 seconds. Furthermore, MetaTrace is resource-efficient, making it accessible for research groups with limited resources to conduct Internetscale traceroute studies.
Fichier principal
Vignette du fichier
metatrace-extended.pdf (325.16 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-04218315 , version 1 (26-09-2023)

Identifiants

  • HAL Id : hal-04218315 , version 1

Citer

Matthieu Gouel, Omar Darwich, Maxime Mouchet, Kevin Vermeulen. Poster: Towards a Publicly Available Framework to Process Traceroutes with MetaTrace. ACM Internet Measurement Conference (IMC 2023), Oct 2023, Montreal (Canada), Canada. 2023. ⟨hal-04218315⟩
61 Consultations
49 Téléchargements

Partager

Gmail Facebook X LinkedIn More