A comparison of big remote sensing data processing with Hadoop MapReduce and Spark - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

A comparison of big remote sensing data processing with Hadoop MapReduce and Spark

I. Chebbi
  • Fonction : Auteur
Wadii Boulila
N. Mellouli
  • Fonction : Auteur
M. Lamolle
  • Fonction : Auteur
I.R. Farah
  • Fonction : Auteur

Résumé

The continuous generation of huge amount of remote sensing (RS) data is becoming a challenging task for researchers due to the 4 Vs characterizing this type of data (volume, variety, velocity and veracity). Many platforms have been proposed to deal with big data in RS field. This paper focus on the comparison of two well-known platforms of big RS data namely Hadoop and Spark. We start by describing the two platforms Hadoop and Spark. The first platform is designed for processing enormous unstructured data in a distributed computing environment. It is composed of two basic elements : 1) Hadoop Distributed file system for storage, and 2) Mapreduce and Yarn for parallel processing, scheduling the jobs and analyzing big RS data. The second platform, Spark, is composed by a set of libraries and uses the resilient distributed data set to overcome the computational complexity. The last part of this paper is devoted to a comparison between the two platforms.
Fichier non déposé

Dates et versions

hal-01930009 , version 1 (21-11-2018)

Identifiants

Citer

I. Chebbi, Wadii Boulila, N. Mellouli, M. Lamolle, I.R. Farah. A comparison of big remote sensing data processing with Hadoop MapReduce and Spark. 2018 4th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP), Mar 2018, Sousse, France. ⟨10.1109/ATSIP.2018.8364497⟩. ⟨hal-01930009⟩
25 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More