CART algorithm for spatial data: Application to environmental and ecological data

Liliane Bel; Denis Allard; J.M. Laurent; R. Cheddadi; Avner Bar-Hen

doi:10.1016/j.csda.2008.09.012

Article Dans Une Revue Computational Statistics and Data Analysis Année : 2009

CART algorithm for spatial data: Application to environmental and ecological data

(1) , (2) , (3) , (3) , (1, 4)

1
2
3
4

Liliane Bel

Fonction : Auteur
PersonId : 15267
IdHAL : liliane-bel
ORCID : 0000-0003-0613-220X
IdRef : 031129196

Mathématiques et Informatique Appliquées

Denis Allard

Fonction : Auteur

Biostatistique et Processus Spatiaux

J.M. Laurent

Fonction : Auteur

Institut des Sciences de l'Evolution de Montpellier

R. Cheddadi

Fonction : Auteur
PersonId : 756162
ORCID : 0000-0001-5652-8718
IdRef : 033612668

Institut des Sciences de l'Evolution de Montpellier

Avner Bar-Hen

Fonction : Auteur
PersonId : 182343
IdHAL : avner-bar-hen
ORCID : 0000-0002-4449-8117
IdRef : 103914579

Mathématiques et Informatique Appliquées

Mathématiques Appliquées Paris 5

Résumé

Most statistical learning techniques such as Classification And Regression Trees (CART) assume independent samples to compute classification rules. This assumption is very practical for estimating quantities involved in the algorithm and for assessing asymptotic properties of estimators. In many environmental or ecological applications, the data under study are a sample of some regionalized variables, which can be modeled as random fields with spatial dependence. When the sampling scheme is very irregular, a direct application of supervised classification algorithms leads to biased discriminant rules due, for example, to the possible oversampling of some areas. The CART algorithm is adapted to the case of spatially dependent samples, focusing on environmental and ecological applications. Two approaches are considered. The first one takes into account the irregularity of the sampling by weighting the data according to their spatial pattern using two existing methods based on Voronoï tessellation and regular grid, and one original method based on kriging. The second one uses spatial estimates of the quantities involved in the construction of the discriminant rule at each step of the algorithm. These methods are tested on simulations and on a classical dataset to highlight their advantages and drawbacks. They are then applied on an ecological data set to explore the relationship between pollen data and presence/absence of tree species, which is an important question for climate reconstruction based on paleoecological data.

Mots clés

regression trees algorithm écologie statistique

Domaines

Calcul [stat.CO]

Archive Ouverte ProdInra : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01197560

Soumis le : vendredi 11 septembre 2015-20:06:04

Dernière modification le : vendredi 19 avril 2024-16:18:56

Dates et versions

hal-01197560 , version 1 (11-09-2015)

Identifiants

HAL Id : hal-01197560 , version 1
DOI : 10.1016/j.csda.2008.09.012
PRODINRA : 51016
WOS : 000265571000024

Citer

Liliane Bel, Denis Allard, J.M. Laurent, R. Cheddadi, Avner Bar-Hen. CART algorithm for spatial data: Application to environmental and ecological data. Computational Statistics and Data Analysis, 2009, 53 (8), pp.3082-3093. ⟨10.1016/j.csda.2008.09.012⟩. ⟨hal-01197560⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CIRAD AGROPARISTECH EPHE CNRS INRA ISEM MAP5 MIA-PARIS AGROPOLIS PSL B3ESTE UNIV-MONTPELLIER INRAE UP-SCIENCES MATHNUM INRAEPACA

314 Consultations

0 Téléchargements

CART algorithm for spatial data: Application to environmental and ecological data

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager