Service interruption on Monday 11 July from 12:30 to 13:00: all the sites of the CCSD (HAL, Epiciences, SciencesConf, AureHAL) will be inaccessible (network hardware connection).
Skip to Main content Skip to Navigation
Preprints, Working Papers, ...

Identifying potential significant factors impacting zero-inflated proportions data

Abstract : Managing epidemics requires to investigate potential impact of risk and protective factors on epidemiological links. Here we focus on links defined by inferred probabilities (transmission links in Equine Influenza, similarity measures of COVID-19 dynamics between different countries). The specific nature of these epidemiological data (zero-inflated, correlated, continuous and bounded) does not allow to use classical supervised methods like linear regression or decision tree to identify impacting factors on the response variable. In this article we propose a by block-permutation-based methodology (i) to identify factors (discrete or continuous) that are potentially significant, (ii) to define a performance indicator to quantify the percentage of correlation explained by the significant factors subset. The methodology is illustrated on simulated data and on the above-mentioned epidemics.
Document type :
Preprints, Working Papers, ...
Complete list of metadata
Contributor : Mélina Ribaud Connect in order to contact the contributor
Submitted on : Friday, January 29, 2021 - 3:43:56 PM
Last modification on : Friday, November 19, 2021 - 4:01:29 PM


Files produced by the author(s)


  • HAL Id : hal-02936779, version 3



Melina Ribaud, Edith Gabriel, Joseph Hughes, Samuel Soubeyrand. Identifying potential significant factors impacting zero-inflated proportions data. 2021. ⟨hal-02936779v3⟩



Record views


Files downloads