You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The second cell generates two dictionaries for storing details of clades:
"clade_analyses_CC_d", for one mutation clades; and
"clade_analyses_AB_d", for two mutation clades.
Under "A/B analysis" in the function clade_analysis_updated, two mutation clades are tested against topology requirements. For each run, the largest two mutation clade is identified in the list "clade_sizes" and tested against size criteria. Its index is stored as "max2mutCladeLoc". The length of the corresponding entry in the list "subclade_sizes" (i.e. len(subclade_sizes[max2mutCladeLoc]) or, if there is only one clade, len(subclade_sizes[0]) ) is then used to check if that clade has enough direct descendants to be considered a polytomy.
The list "clades_sizes" is assigned values with the code:
The latter causes the polytomy test to use the number of subclades of a one mutation clade that has nothing to do with the two mutation clade being tested.
This seems to be a straightforward copy-paste error that can be corrected by replacing "CC" with "AB". However, the correction will significantly increase the proportion of two mutation clades that meet the topology requirements (since polytomies and large clades sizes are correlated), so that the Bayes' factors are substantially reduced. For the Main analysis, the Bayes' factor is reduced by a factor of 6.
I've made a branch, "bugfixes", with the correction, but you may prefer to implement the fix independently, since it is small.
A published correction may be necessary.
The text was updated successfully, but these errors were encountered:
The second cell generates two dictionaries for storing details of clades:
Under "A/B analysis" in the function clade_analysis_updated, two mutation clades are tested against topology requirements. For each run, the largest two mutation clade is identified in the list "clade_sizes" and tested against size criteria. Its index is stored as "max2mutCladeLoc". The length of the corresponding entry in the list "subclade_sizes" (i.e. len(subclade_sizes[max2mutCladeLoc]) or, if there is only one clade, len(subclade_sizes[0]) ) is then used to check if that clade has enough direct descendants to be considered a polytomy.
The list "clades_sizes" is assigned values with the code:
The list "subclade_sizes" is assigned values with the code:
The latter causes the polytomy test to use the number of subclades of a one mutation clade that has nothing to do with the two mutation clade being tested.
This seems to be a straightforward copy-paste error that can be corrected by replacing "CC" with "AB". However, the correction will significantly increase the proportion of two mutation clades that meet the topology requirements (since polytomies and large clades sizes are correlated), so that the Bayes' factors are substantially reduced. For the Main analysis, the Bayes' factor is reduced by a factor of 6.
I've made a branch, "bugfixes", with the correction, but you may prefer to implement the fix independently, since it is small.
A published correction may be necessary.
The text was updated successfully, but these errors were encountered: