Unable to use publishDir to publish a large directory #4862
Unanswered
sbalgobin94
asked this question in
Q&A
Replies: 1 comment 3 replies
-
Hi Samantha, a few probing questions:
|
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I am using publishDir to copy an output directory (that has subdirectories) from my Cellranger process to a specified directory. For smaller output directories, this works, but not for a larger one (>50G total). I am using the Slurm executor and using a shared file system.
What I've tried so far:
I have cleared the cache
Changed my OpenJDK to Corretto (it was previously JetBrains)
Tried another file system
Tried other runs of the same size from the same directory
Tried smaller runs (successful)
Tried to gzip the output folder, doesn't help
Here is my nextflow script:
Here is my config file:
This is the ending portion of a log file for a run in which publishDir was SUCCESSFUL:
However, I do not see this ending portion for the runs whose results do not get published as indicated in this log file. These are the last few lines and that's it:
There is nothing in .command.err.
It looks like the publishDir thread pool is not being created for these larger runs. I'm at a loss for where to go from here. Even if I used saveAs, there are different files in each subdirectory and I want to keep things organized. I want to keep the entire folder from the Cellranger process so that the user can navigate to it whenever they need to.
Beta Was this translation helpful? Give feedback.
All reactions