Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issue passing an output directory path to blob container as an command line argument #643

Open
mdanner93 opened this issue Apr 19, 2023 · 0 comments

Comments

@mdanner93
Copy link

mdanner93 commented Apr 19, 2023

Hi everyone,

I set up CromwellOnAzure but I have a problem passing a directory path as an output during workflows
e.g. running a GenomeDBImport task on Azure:

version 1.0

workflow GenomicsDBImport {
    input {
    File input_patient
    File input_mother 
    File input_father
    File bed_file
    Array[File] reads
    String output_dir
    String gatk_docker = "broadinstitute/gatk:latest"
    String gatk_path = "/gatk/gatk"
    }
    

    call DBImport { 
        input:
            input_patient = input_patient,
            input_mother = input_mother,
            input_father = input_father,
            bed_file = bed_file,
            reads = reads,
            output_dir = output_dir,
            docker = gatk_docker,
            gatk_path = gatk_path
    }
}

task DBImport {
    input {
    # Command Parameters
    File input_patient
    File input_mother 
    File input_father
    File bed_file
    Array[File] reads
    String gatk_path
    String output_dir
    
    # Runtime Parameters
    String docker
    Int addtional_disk_space_gb = 10
    Int machine_mem_gb = 64
    Int preemptible_attempts = 3
    
    Int disk_space_gb = 40 + addtional_disk_space_gb
    }

    command {
        ~{gatk_path} --java-options "-Xmx32g -Xms32g" GenomicsDBImport \
        -V ~{input_patient} \
        -V ~{input_mother} \
        -V ~{input_father} \
        --genomicsdb-workspace-path ~{output_dir} \
        --read-index ~{sep="," reads} \
        -L ~{bed_file} --merge-input-intervals true \
        --max-num-intervals-to-import-in-parallel 70000 \
        --verbosity DEBUG
    }

    runtime {
        docker: docker
        memory: machine_mem_gb + " GB"
        disk: disk_space_gb + " GB"
        preemptible: true
        maxRetries: preemptible_attempts
    }

    output {
        String output_db = "${output_dir}"
    }

}

Leads to the following error:

A USER ERROR has occurred: Error creating GenomicsDB workspace: https://mydatalake.blob.core.windows.net/genomicsdb/test/genomicsdb

I am really looking forward for your help.

Many thanks in advance!

@mdanner93 mdanner93 changed the title Not able to add and use new containers Problems of using Directory Type as an output path Apr 20, 2023
@mdanner93 mdanner93 changed the title Problems of using Directory Type as an output path Issue passing a directory path as an command line argument and output Apr 20, 2023
@mdanner93 mdanner93 changed the title Issue passing a directory path as an command line argument and output Issue passing an output directory path to blob container as an command line argument Apr 21, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant