-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Supported opposite-strand protocols (for metagene) #23
Comments
Hi @sarahhcarl , Thank you for your thoughts. I agree that re-aligning and reprocessing is more work than it is worth, especially for large datasets, and especially when dUTP-based and other reverse-strand sequencing methodologies are common. I have written kind of a long post in response (below); the TLDR is, I have been thinking about how to do this for a while, and am taking suggestions on how best to support this. Please read on if you have thoughts! Thanks & best, There are a couple of reasons we haven't included a strand-flipping function yet — even though I actually started working on it two years ago (!), without finishing it — but the main theme is that we haven't figured out the best way to solve the problem, "best" meaning:
If you have thoughts on how best to do this, I welcome your feedback. Here's what I have been considering:
|
Hi Josh, Thanks for the thoughtful reply. After thinking about it some more and reading the For example, it seems like one critical point in the
What about making an additional mapping parameter that would allow this filtering step to either choose reads on the same strand, reads on the opposite strand, or all reads regardless of strand? I don't know how many functions would need this to be implemented (how much code would be duplicated), but to me it seems like a straightforward solution that also supports unstranded protocols. Cheers, |
Hi Sarah, Thank you for your thoughts! I have given this a little bit of work and have a working implementation. I moved the logic you highlighted above into the mapping functions ( I just need to write a few more tests to make sure there are no hidden surprises, and make a few other changes to keep up with updates to Cheers, |
Hi Sarah- |
Hi Josh,
At the moment I don't need it urgently, so I don't think you need to rush -
although maybe other users would find it helpful. Thanks for touching base
again though!
Cheers,
Sarah
…On Wed, Jun 13, 2018 at 2:59 AM Joshua Griffin Dunn < ***@***.***> wrote:
Hi Sarah-
I just wanted to let you know that this hasn't fallen off the radar. I
haven't had a chance to get back to my tests due to work pressures. If it's
still useful, I can push the feature branch containing my implementation up
here, with the caveat that until I'm done building the test datasets &
rewriting the unit tests, the implementation won't be proven
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#23 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/ADhDIgN90dwnTJTt0PVP0yR83kQs5Ivjks5t8GPhgaJpZM4So5nV>
.
|
Hi Plastid team,
I have found a lot of the functionality provided by plastid very useful, but I got frustratingly stuck today when I was trying to run metagene count on data originating from a stranded protocol that produces reads on the opposite of the original strand. I know in your FAQ you suggest reverse-complementing the reads and re-aligning them, but honestly I feel like this is a lot more work than necessary, especially for a big project with many samples.
This issue was almost a deal-breaker for me, but then I realized I could get the same effect by simply reversing the strand of all of my window annotations in my ROI file. I did it with sed, but it would seem to be a pretty straightforward feature to add as a parameter in the actual command, either at the stage of generating the ROI or on the fly during counting. Such a feature would be a relatively easy way to expand usage of plastid by supporting a large number of users with this type of data.
Thanks!
Sarah
The text was updated successfully, but these errors were encountered: