Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

InferCNV from STdeconvolve data #53

Open
rstagnit opened this issue Apr 9, 2024 · 2 comments
Open

InferCNV from STdeconvolve data #53

rstagnit opened this issue Apr 9, 2024 · 2 comments

Comments

@rstagnit
Copy link

rstagnit commented Apr 9, 2024

Hi @bmill3r ,

Thank you for developing this tool. Deconvolution and annotation have worked well for me thus far. I am now looking to attempt to do CNV analysis on the deconvolved cell types (I have already done them on the non-deconvolved samples) as a form of comparison and achieving more granularity. I am struggling with creating the initial infercnv Object. I'm not sure your familiarity with the package, so I'll briefly detail what is needed.

In order to create an infercnv object, I need a raw_counts_matrix which is the matrix of genes (rows) vs. cells (columns) containing the raw counts. If I am correct, I can use the corpus for my raw counts matrix so that isn't an issue.

You then need an annotations_file which for me is typically a matrix of the barcodes (rows) vs Seurat clusters (columns).
The goal would be to have the annotations file be the barcodes corresponding to each cell type. I know the results$theta are proportions of each cell type for each spot, so I am having difficulty trying to circumvent that issue either by making each barcode correspond to the cell type of the highest proportion, or another idea if you have one.

I greatly appreciate any assistance you can give.

Thanks,
Rob

@bmill3r
Copy link
Collaborator

bmill3r commented Apr 13, 2024

Hi @rstagnit,

Thanks for your question and your interest in using STdeconvolve. I am not familiar with infercnv but if I understand correctly, you are interested in performing CNV analysis on the multi-cellular spots. If so, then in terms of the inputs you are asking for, using the raw counts from the corpus seems reasonable. For cell type annotations, for infercnv, it only allows for one cell type label per capture location, correct? If so, then using the deconvolved cell type at the highest proportion per spot is probably the simplest strategy. What is the distribution of deconvolved cell type proportions for your spots? Do you have a sense of the multi-cellular resolution of your data? For example, are your spots large enough to contain only a few cells, or are they big enough to contain many cells? Thinking about these characteristics of your data might help determine if using the highest proportion is reasonable.

Sorry I can't be more helpful,
Brendan

@floriankreten
Copy link

Hi @rstagnit , did you make any progress? The approach sounds quite interesting. That said, the output of stdeconvolve is not suitable for a follow-up analysis with InferCNV, since it does not provide the necessary "raw-count-estimate" (except that you find out which spots can be integrated into your analysis, since they contain a large proportion of cell-type X). You might want to try other deconvolutional tools.

BR Florian

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants