Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bad result because low quality Hi-C library? #119

Open
ptranvan opened this issue Nov 15, 2020 · 1 comment
Open

Bad result because low quality Hi-C library? #119

ptranvan opened this issue Nov 15, 2020 · 1 comment

Comments

@ptranvan
Copy link

Hi, Thanks for your software. I ran SALSA but unfortunately I didn't have satisfying result.

I applied the Arima mapping pipeline and got this statistics before SALSA:

perl $STATS $REP_DIR/$REP_LABEL.bam > $REP_DIR/$REP_LABEL.bam.stats

cat $REP_DIR/$REP_LABEL.bam.stats

All     50585700
All intra       47453287
All intra 1kb   3798461
All intra 10kb  2498813
All intra 15kb  2338248
All intra 20kb  2223609
All inter       3132413

My opinion is that I don't have enough PE for All intra 20kb . I am not sure but what do you think about this library ? ( I have a very good contig assembly of 1.3G species, 750 contigs, busco: 98%)

@ptranvan ptranvan changed the title What is a good Hi-C library? Bad result because low quality Hi-C library? Nov 15, 2020
@ghuryejay
Copy link
Collaborator

What's the input N50 of the assembly? Also, you would want to look at ALL_INTER as those are the inter-contig links that will be used for scaffolding. I do agree that it might be an artifact of the bad library.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants