Skip to content

Commit

Permalink
Clarify reference/named paths
Browse files Browse the repository at this point in the history
  • Loading branch information
jltsiren committed Nov 9, 2023
1 parent 32c9dc2 commit 58e22d6
Showing 1 changed file with 5 additions and 1 deletion.
6 changes: 5 additions & 1 deletion SERIALIZATION.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
# GBZ file format

GBZ version 1, GBWTGraph version 3. Updated 2022-01-31.
GBZ version 1, GBWTGraph version 3. Updated 2023-11-09.

Based on Simple-SDS version 0.2.0, GBWT version 5, and Metadata version 2.

Expand Down Expand Up @@ -68,6 +68,10 @@ GBWT path name fields map to GFA W-line fields in the following way:

If the GBWT metadata does not contain sample (contig) names, the integer identifier of the sample (contig) is used instead.

**Note:** The reference paths mentioned here correspond to generic named paths in VG terminology.
VG promotes some samples to reference status by setting GBWT tag `reference_samples` with a space-separated list of sample names (e.g. `GRCh38 CHM13`) as the value.
In GFA format, it encodes the same information in header tag `RS` (e.g. `RS:Z:GRCh38 CHM13`).

## GBWTGraph

**GBWTGraph** represents a bidirected sequence graph induced by a set of paths.
Expand Down

0 comments on commit 58e22d6

Please sign in to comment.