Inconsistent filtered genes before and after scv.utils.merge function #471

jipeifeng · 2020-11-14T05:10:08Z

jipeifeng
Nov 14, 2020

Hi:
Thanks for the helpful tools.
I have came across an issue at the preprocessing step, particularly at the filter_and_normalize function.

Test 1:
I merged all the loom files and run the filter step:

Test 2:
I merged all the velocity output files (loom files), merged the loom files into an already existing AnnData object, and run the filter step:

And you can see that the number of filtered genes are inconsistent with each other. And would you please point what is the difference. Moreover, should I merge the Anndata object with scvelo object (using scv.utils.merge function) before or after performing the preprocessing step (scv.pp.filter_and_normalize).
Thank you very much !

WeilerP · 2021-05-30T08:19:47Z

WeilerP
May 30, 2021
Maintainer

You should first merge, then preprocess the data.

Regarding the different results after filtering: Let's say you detected gene A 5 times (in unspliced) in your first dataset but 0 times in the second. Suppose you are now filtering only based on unspliced counts via

scv.filter_and_normalize(adata, min_counts_u=5)

Filtering the two datasets individually, gene A will not be filtered out in the first dataset. It is, however, filtered out in the second one. If you then merge the two datasets using an inner merge, you will not find gene A in the final AnnData object.
Merging the two datasets prior to filtering, will lead to gene A not being filtered as it is detected at least 5 times.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inconsistent filtered genes before and after scv.utils.merge function #471

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

Select a reply

Inconsistent filtered genes before and after scv.utils.merge function #471

jipeifeng Nov 14, 2020

Replies: 1 comment

WeilerP May 30, 2021 Maintainer

jipeifeng
Nov 14, 2020

WeilerP
May 30, 2021
Maintainer