-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support needed arguments in pipeline script #16
Comments
Certainly. I am working on this functionality tonight and should have something soon. |
Awesome! Thanks Vincent. |
All - I have had multiple issues trying to implement this vcf filterer.
Sorry for the slow progress, did not anticipate the amount of difficulty getting this calls to work. |
@vlaufer, for number 1, you most likely need to specify "shell=True" in the call to subprocess.call(). See here: https://docs.python.org/3/library/subprocess.html#replacing-os-system For number 2, perhaps you could drop the current script in a gist so that we can take a look? |
I will find time either after work or weekend for any remaining works. |
@seandavi - I tried both shell="True" and shell="False" I think the issue centers on escaping characters correctly and the syntax required by snpSift. I have been pushing the script to this github repo, it is in the main folder under the name vcf_filter_v0.1.py @eweitz - I do not think there needs to be an option for delivering FASTQ or BAM to the front end at all, but I may be completely missing something. We currently have no way to annotate anything other than a VCF, and also again unless I am missing something we would need separate FASTQ or BAM --> JSON converters. It seems to me that the pipeline has to go FASTQ-->BAM-->VCF-->JSON to make sense... @dauss75 - I think there is actually a lot left to do. We need to get the liftover tool in place, so that no matter what comes in, we output the same format of VCF. We will also need to re-download all the SnpSift databases etc. on wherever the next VM is. |
@vlaufer, agreed -- with the cavaet that users may upload data in FASTQ, BAM or VCF, so we can bypass the FASTQ-to-BAM or BAM-to-VCF conversion in some cases. The Per your comment, I assume that the annotated and filtered VCF file will then need to be converted to JSON prior to being passed into @cjav's Django and @ohjuarez's Solr services. The Django view layer gets its data from @ohjuarez's Solr API (see here) formatted as JSON. The JSON is then converted into a Python dictionary, which gets passed onto a Django template that converts the dict to the HTML the user sees. |
@eweitz - good. I think I just misinterpreted which script you were talking about taking those arguments. we are on the same page. Thanks for bearing with me as I get used to the issue functionality of github. Kindly, Vincent |
@dauss75, @vlaufer, @chris-owen, my Django code will need to call the back-end pipeline you developed last week in order to annotate and filter the user's uploaded NGS data file (e.g., an unannotated VCF file). Could you please add support in your pipeline's entry-point script for the following, and provide an example of how to call it from the command line?
The call from my script will include the following arguments:
Here's how I imagine I would call the script you have at the beginning of your pipeline:
The text was updated successfully, but these errors were encountered: