Skip to content

Commit

Permalink
Add more information on pseudo pooling
Browse files Browse the repository at this point in the history
  • Loading branch information
kathryn1995 committed Mar 28, 2024
1 parent bdc0615 commit d93475e
Showing 1 changed file with 16 additions and 0 deletions.
16 changes: 16 additions & 0 deletions docs/parameters.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,8 @@ has_children: false

# Modifying Parameters

## DADA2

DADA2 infers amplicon sequences exactly and can be tuned depending on your needs. DADA2 is run in the DADA2 module of the pipeline (`DADA2_ANALYSIS`). Below are parameters that you can set to control your output.

|Parameter|Description|
Expand All @@ -24,3 +26,17 @@ Below is an example of how you may use the above parameters on the command line:
```bash
nextflow run main.nf --readDIR /wynton/scratch/data --outDIR /wynton/scratch/results -profile sge,apptainer --target v4 -config conf/custom.config --omega_a 1e-120 --band_size 16 --pool pseudo
```

## Further Information

The default parameter setting for pooling in DADA2 is set to False, meaning that all inference is performed on individual samples. Pseudo pooling, on the other hand, involves an initial round of DADA2 clustering followed by pooling the alleles called for each sample, which are then entered as priors for a second round of pooling. This process aims to 'rescue' low abundance alleles that DADA2 identified as errors if they appear at a higher abundance in another sample in the run. However, while this approach may enhance sensitivity in detecting variants, it also carries the risk of reintroducing false positives, potentially due to common sequencing errors and low-level contamination.

Through extensive analysis using multiple mixture controls from various runs at different sites, we found that activating pseudo pooling increased the sensitivity of our calls. To address the challenge of additional false positives, we adjusted the omega_a value from the default of 1e-40 to 1e-120.

For more detailed information on pseudo pooling, please refer to the documentation available (here)[https://benjjneb.github.io/dada2/pseudo.html]. If you have any questions or need guidance on selecting the most suitable settings for your specific needs, please feel free to reach out to the UCSF team.






0 comments on commit d93475e

Please sign in to comment.