Consider outputting two TSV files instead of one from combine_peptide_annotations.R #16

taylorreiter · 2024-02-26T19:04:53Z

As currently written, the left join with peptide_predictions will duplicate all of the information in all of the left-joined dataframes for any peptides that appear more than once in peptide_predictions. Outputting two TSV files - one the predictions for peptides (in which peptide_id is nonunique) and the other the joined metadata dataframes (in which peptide_id is unique) - would avoid this. Outputting two would avoid duplication, but one might be easier to work with practically. We should assess this after running the pipeline a few times. if there is relatively little duplication in peptide_ids it might not matter in practice to separate these two files.

The text was updated successfully, but these errors were encountered:

taylorreiter · 2024-03-18T18:43:32Z

done in #23

taylorreiter mentioned this issue Feb 26, 2024

Add rule to combine peptide annotation outputs #12

Merged

4 tasks

taylorreiter mentioned this issue Mar 15, 2024

Bug fixes from first run #23

Merged

5 tasks

taylorreiter closed this as completed Mar 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider outputting two TSV files instead of one from combine_peptide_annotations.R #16

Consider outputting two TSV files instead of one from combine_peptide_annotations.R #16

taylorreiter commented Feb 26, 2024

taylorreiter commented Mar 18, 2024

Consider outputting two TSV files instead of one from combine_peptide_annotations.R #16

Consider outputting two TSV files instead of one from combine_peptide_annotations.R #16

Comments

taylorreiter commented Feb 26, 2024

taylorreiter commented Mar 18, 2024