You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I've been trying to use your pipeline to align samples that have Drosophila spike-ins. Rather than doing sequential alignment, I generated a combined mouse and Drosophila genome w/ the dmel chromosomes in the format "dm6_{chrom}". I didn't recover any signal along the Dmel genome. The problem seems to be that when you filter out rRNA and chrM, you also pass it through grep '_' -v here:
I edited those lines to remove the grep '_' -v section while still removing the rRNA and chrM reads, and it seems to have fixed the problem. However, I was wondering why that was there. In the mm10 annotation I'm using, none of the chromosomes have '_' in them.
I would also maybe recommend documenting that behavior, as this seems to be a relatively common way of doing spike normalization.
The text was updated successfully, but these errors were encountered:
I have a quick follow-up question that you might already have figured out. I’m also trying to normalize using the Drosophila spike-in, but I ran into a bit of a problem. Since the pipeline seems to remove unaligned reads from the BAM file, I’m having trouble getting the total reads (here the aligned reads are shown as total reads). Could you kindly guide me on how you calculated the scale factor (normalizing factor)? I have human-aligned and drosophila-aligned reads from the pipeline. I’d really appreciate your help!
Hi,
I've been trying to use your pipeline to align samples that have Drosophila spike-ins. Rather than doing sequential alignment, I generated a combined mouse and Drosophila genome w/ the dmel chromosomes in the format "dm6_{chrom}". I didn't recover any signal along the Dmel genome. The problem seems to be that when you filter out rRNA and chrM, you also pass it through
grep '_' -v
here:proseq2.0/proseq2.0.bsh
Line 873 in c3260bd
proseq2.0/proseq2.0.bsh
Line 1166 in c3260bd
I edited those lines to remove the
grep '_' -v
section while still removing the rRNA and chrM reads, and it seems to have fixed the problem. However, I was wondering why that was there. In the mm10 annotation I'm using, none of the chromosomes have '_' in them.I would also maybe recommend documenting that behavior, as this seems to be a relatively common way of doing spike normalization.
The text was updated successfully, but these errors were encountered: