FlatGFA: Set up for in-place mutation #155

sampsyo · 2024-03-18T11:50:11Z

There is no actual externally useful change here, but this changes the file format to allow for "gaps," i.e., to have more space for elements in the GFA representation than we have actual values. This way, we can feasibly mutate graphs in place (adding and removing elements), up to the limit of the pre-allocated regions.

To make this work, I used the tinyvec crate's SliceVec type, which works like Vec but is backed by a fixed-size slice (which may, in our case, come from an mmap'd file). Some type and lifetime trickery was required to make our backing stores polymorphic over whether they use SliceVec or Vec, but now they are. (The new Pool abstraction covers both cases.)

So far, this is hooked up for reading only. Next, I will add pre-allocated file emission, which will require guessing the necessary sizes.

Just a more organized way to treat vectors as pools, for now...

sampsyo added 10 commits March 17, 2024 18:45

Store separate capacities in Toc

5654b42

Some comments

ef0f3da

Cool new Pool trait

e6b9ad1

Just a more organized way to treat vectors as pools, for now...

Split off Pool module

553fd07

Parameterize the pool type

3780823

Start using SliceVec

a4bac8b

Lifetime heroics for a SliceStore

98e90e8

Attempt to read SliceVecs

a72fefa

Optionally use SliceVec loader

58a197c

Test -m flag

12329aa

sampsyo merged commit f73db4c into main Mar 18, 2024
3 checks passed

sampsyo deleted the polbin-prealloc branch March 18, 2024 11:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FlatGFA: Set up for in-place mutation #155

FlatGFA: Set up for in-place mutation #155

sampsyo commented Mar 18, 2024

FlatGFA: Set up for in-place mutation #155

FlatGFA: Set up for in-place mutation #155

Conversation

sampsyo commented Mar 18, 2024