Skip to content

adding proteins to existing clusters #24

Closed Answered by micans
nohayoussef asked this question in Q&A
Discussion options

You must be logged in to vote

Hi, normal procedure is indeed to start all over and re-cluster everything. This is really also what you want most of the time, taking into account how everything relates to everything rather than depend on/extend from a foothold on a subset of the data.
Computationally this can be quite painful of course. In your case, the step from 800 to 2000 is really a big one; it would be a bit different if it were from 2000 to 2001.
Second, within the mcl software there is no support really if you want to perform such an extension step. It would be quite fiddly and involve a lot of choices. I do believe it is something people have done in the past (mapping new genomes onto existing clusters), but t…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by micans
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #23 on November 14, 2023 15:37.