Oblique decision trees for spatial pattern detection: optimal algorithm and application to malaria risk. - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue BMC Medical Research Methodology Année : 2005

Oblique decision trees for spatial pattern detection: optimal algorithm and application to malaria risk.

Résumé

BACKGROUND: In order to detect potential disease clusters where a putative source cannot be specified, classical procedures scan the geographical area with circular windows through a specified grid imposed to the map. However, the choice of the windows' shapes, sizes and centers is critical and different choices may not provide exactly the same results. The aim of our work was to use an Oblique Decision Tree model (ODT) which provides potential clusters without pre-specifying shapes, sizes or centers. For this purpose, we have developed an ODT-algorithm to find an oblique partition of the space defined by the geographic coordinates. METHODS: ODT is based on the classification and regression tree (CART). As CART finds out rectangular partitions of the covariate space, ODT provides oblique partitions maximizing the interclass variance of the independent variable. Since it is a NP-Hard problem in RN, classical ODT-algorithms use evolutionary procedures or heuristics. We have developed an optimal ODT-algorithm in R2, based on the directions defined by each couple of point locations. This partition provided potential clusters which can be tested with Monte-Carlo inference. We applied the ODT-model to a dataset in order to identify potential high risk clusters of malaria in a village in Western Africa during the dry season. The ODT results were compared with those of the Kulldorff' s SaTScan. RESULTS: The ODT procedure provided four classes of risk of infection. In the first high risk class 60%, 95% confidence interval (CI95%) [52.22-67.55], of the children was infected. Monte-Carlo inference showed that the spatial pattern issued from the ODT-model was significant (p < 0.0001). Satscan results yielded one significant cluster where the risk of disease was high with an infectious rate of 54.21%, CI95% [47.51-60.75]. Obviously, his center was located within the first high risk ODT class. Both procedures provided similar results identifying a high risk cluster in the western part of the village where a mosquito breeding point was located. CONCLUSION: ODT-models improve the classical scanning procedures by detecting potential disease clusters independently of any specification of the shapes, sizes or centers of the clusters.
Fichier principal
Vignette du fichier
1471-2288-5-22.pdf (283.75 Ko) Télécharger le fichier
Loading...

Dates et versions

inserm-00090279 , version 1 (29-08-2006)

Identifiants

Citer

Jean Gaudart, Belco Poudiougou, Stéphane Ranque, Ogobara K. Doumbo. Oblique decision trees for spatial pattern detection: optimal algorithm and application to malaria risk.. BMC Medical Research Methodology, 2005, 5, pp.22. ⟨10.1186/1471-2288-5-22⟩. ⟨inserm-00090279⟩
89 Consultations
318 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More