-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Flexidot computational time #6
Comments
Hi again |
Hi @paolo002 |
Hi @crimBubble Thanks for your reply. |
Hi @paolo002, thank you for giving FlexiDot a try. We are a bit unsure about where the problem actually resides. Do you get some kind of error message? FlexiDot should give a warning, if parameters are incorrect or files are missing. Maybe you can post the command-line output, so that we can see if there is something wrong. After the run is finished, you should expect "Thank you for using FlexiDot!" to be printed. We regularly use FlexiDot on SMRT reads (up to 50-100 kb) with reasonable run times. Depending on the repetitiveness, it should take something on the minute scale. To just check the command-line/ tool performance, we recommend that you crop your sequence to something short, maybe 5000 bp, and test, if you get the expected output files. Usually, output files comprise text and image files. The dotplot image itself is the last one generated. Alternatively, you can try to analyse your sequences with a longer word size to rule out memory issues, e.g. For your information, we are currently preparing the next FlexiDot release which clearly shortens the runtime for long, repetitive sequences. We are testing it at the moment with the most common commands, and if we do not run into any trouble, it should be online this week. In the future, we would like to parallelize FlexiDot. However, we did not yet find a satisfying library that works cross-platform. In general, we would not recommend using FlexiDot for pseudochromosomes or other sequences in the Mbp scale, as it would simply take too long. Especially for small word sizes this might also raise memory issues. For long sequences, we recommend the use of longer word sizes, maybe with mismatches. There are other tools (such as dgenies), which perform better on super-long sequences. Best wishes, Kathrin and Tony |
Hi Kathryn and Tony I just tried it right now and I increased the word size value -k to 20 as you suggested and it worded. I got the output within few minutes. By the way, my sequence is 600kb so this kind of size should be fine right? |
Hi |
Hi @paolo002 and @crimBubble, we just uploaded the new FlexiDot version. Maybe you want to try, how it performs on your data? It won't address the memory issue, but it will be faster. Best, Kathrin & Tony |
Hi, I'm enjoying using Flexidot, but I had similar issues at first in that it took a long time to run. Then I realized I'd missed the -t option: when I specify that the sequences are protein it runs much quicker (surprisingly it runs to completion even with the wrong sequence type). I guess it might be useful if Flexidot detects whether the -t option is correct for the sequences it reads, and throws an error if not. Thanks for the nice tool! |
Hi
Thanks a lot for developing this toolkit looks really amazing.
However, may I know the computational time and threads needed to obtain the graphs?
Because I am trying it but it seems to be taking long time to even perform a first calculation.
Does it need to be run on a server?
Best Regards
PL
The text was updated successfully, but these errors were encountered: