-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
HIBF creates a very large index #370
Comments
Hey there! VersionThe version you are using has some major refactorings. That's also why the The results should be the same (unit tests are fine), but I haven't benchmarked the performance yet. EDIT: One bug that I just encountered, and that will be fixed soon is that LayoutIt looks like it will use We will have to investigate why the estimation of the size ( Building RAMThe memory usage looks way too high. This might be due to the Index SizeWhether the HIBF is smaller than the IBF depends on the data and When using 3 levels, this might get worse, depending on the data. Questions/Suggestions
|
An update on this, without specifying |
Hi
I have been trying to build an index of a large collection of microbial genomes (102999) using HIBF and the resulting index is way larger than when I create the same index using IBF.
The raptor version I used:
The layout stat file:
The
prepare
andlayout
andbuild
commands I used:The final index is ~1Tb, and these are the timings of building the index, where it had a peak memory usage of ~3Tb:
The IBF index is ~750G and required a fraction of the memory to build the index. Shouldn't the HBIF be smaller than the IBF index? Any suggestions are much appreciated :-)
Thanks
Antonio
The text was updated successfully, but these errors were encountered: