DropLasso: A robust variant of Lasso for single cell RNA-seq data

Abstract : Single-cell RNA sequencing (scRNA-seq) is a fast growing approach to measure the genome-wide transcriptome of many individual cells in parallel, but results in noisy data with many dropout events. Existing methods to learn molecular signatures from bulk transcriptomic data may therefore not be adapted to scRNA-seq data, in order to automatically classify individual cells into predefined classes. We propose a new method called DropLasso to learn a molecular signature from scRNA-seq data. DropLasso extends the dropout regularisation technique, popular in neural network training, to esti- mate sparse linear models. It is well adapted to data corrupted by dropout noise, such as scRNA-seq data, and we clarify how it relates to elastic net regularisation. We provide promising results on simulated and real scRNA-seq data, suggesting that DropLasso may be better adapted than standard regularisa- tions to infer molecular signatures from scRNA-seq data.
Complete list of metadatas

Cited literature [44 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01716704
Contributor : Jean-Philippe Vert <>
Submitted on : Sunday, June 2, 2019 - 11:00:52 PM
Last modification on : Sunday, June 9, 2019 - 1:36:09 AM

File

techreport_modified.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01716704, version 2
  • ARXIV : 1802.09381

Citation

Beyrem Khalfaoui, Jean-Philippe Vert. DropLasso: A robust variant of Lasso for single cell RNA-seq data. 2019. ⟨hal-01716704v2⟩

Share

Metrics

Record views

42

Files downloads

134