How does Seurat handle sparsity in scRNAseq data #5011

jasminelmah · 2021-08-25T20:17:39Z

jasminelmah
Aug 25, 2021

Recently, it's been suggested that zero-inflated models are not required for scRNAseq UMI data (Valentine, 2020; Cao et al., 2021. In other words, the zeros in UMI data are sufficiently described by non-zero inflated models, such as the Poisson or negative binomial. Does this mean that SCTransform, which is based on the negative binomial model, is sufficient to handle scRNAseq data sparsity? In a similar vein, given that something like LogNormalize does not explicitly model zeros, why is it still popular? Thanks for helping me understand!

Answered by saketkc

Aug 27, 2021

As you point out through the linked references, negative binomial is a sufficient (and necessary )in explaining the number of zeros. LogNormalization is meant to achieve the same objective (1. correct for sequencing depth - scaling step and 2. reduce impact of outliers - log step) but can dampen biological variance at the same time (some slides here). The popularity comes probably because of ease of implementation (no explicit statistical estimation required).

View full answer

saketkc · 2021-08-27T16:51:14Z

saketkc
Aug 27, 2021
Maintainer

As you point out through the linked references, negative binomial is a sufficient (and necessary )in explaining the number of zeros. LogNormalization is meant to achieve the same objective (1. correct for sequencing depth - scaling step and 2. reduce impact of outliers - log step) but can dampen biological variance at the same time (some slides here). The popularity comes probably because of ease of implementation (no explicit statistical estimation required).

1 reply

jasminelmah Sep 4, 2021
Author

Thanks! That's very clarifying.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How does Seurat handle sparsity in scRNAseq data #5011

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

How does Seurat handle sparsity in scRNAseq data #5011

jasminelmah Aug 25, 2021

Replies: 1 comment · 1 reply

saketkc Aug 27, 2021 Maintainer

jasminelmah Sep 4, 2021 Author

jasminelmah
Aug 25, 2021

Replies: 1 comment 1 reply

saketkc
Aug 27, 2021
Maintainer

jasminelmah Sep 4, 2021
Author