Definitions and terminology #23
Replies: 5 comments 26 replies
-
I think we also want to define recombination nodes, coalescent nodes, and CA nodes. Note that we may have a node in a tree that is none of these too. |
Beta Was this translation helpful? Give feedback.
-
Just a thought - would a short overview of other types of phylogenetic networks be within the scope of this paper (e.g. hybridisation networks, tree-child networks, galled trees, level-k networks etc)? I think the differences between these things can seem subtle, and it might be nice to try to collect together definitive descriptions of these similar structures (that may or may not fall under our definition of ARG). |
Beta Was this translation helpful? Give feedback.
-
We previously argued that there was a major distinction between an "edge annotated ARG" (e.g. tree sequence) and a "node annotated ARG" (e.g. the Griffiths-type graph), but @jeromekelleher devised a way to use edge annotations to represent the Griffiths graph. Really, the main distinction is between a local encoding of the breakpoints (the standard for node annotated args), and a global encoding, which is formed when we run the algorithm on the local encoding). We could do with some terms to describe these two forms of ARG encoding, I think. We toyed with the idea of using implicit vs explicit ISTR. |
Beta Was this translation helpful? Give feedback.
-
Yan is right that it's getting very confusing having the same TLA meaning at least three different things. We've moved on a bit from the original takes, so here's some recent thoughts.
Technically, the difference really lies in whether we associated genome coordinate information with nodes (EARG) or edges (GARG), but talking about edge and node annotations feels terribly dry, and distinguishing between events and genomes is much more concrete (and conceptually important I'd argue, but probably not worth getting into that). How we talk about the stochastic process is another issue. I guess we could just call this the ARG (which is reasonable, this this is what it originally meant), if we consistently use EARG and GARG elsewhere? |
Beta Was this translation helpful? Give feedback.
-
We could also do with a terminology for the nodes at and below a recombination. I suggest we call the 2 parent nodes the "recombining nodes" and the child the "recombinant node". Or is the similarity in names too confusing? |
Beta Was this translation helpful? Give feedback.
-
We should be precise about what we mean by various bits of terminology, so here's my effort to lay things out. It's not obvious how this will fit into the narrative of the paper at the moment, but lets see if we can agree on the terms and their definitions here before putting them in.
Genome
The complete set of genetic material that a child inherits from one parent. (What do we say about chromosomes here?)
Individual
A single organism, carrying one or more genomes depending on ploidy. E.g. a diploid individual carries two genomes.
Pedigree
A DAG in which nodes represent individuals and edges represent parent-child relationships between individual organisms.
ARG
A DAG in which nodes represent genomes and edges represent the inheritance of genetic material relative to some set of sample genomes, which has additional annotations to encode the flow of ancestral material within those genomes, so that the trees relating genomes are uniquely defined. The annotations can be either associated with nodes or edges.
Node annotated ARG
An ARG in which nodes with > 1 parent have been annotated to indicate the flow of ancestral material from the child to its parents. In the classical Griffith's approach, recombination nodes have two parents and are annotated with a breakpoint. Conventionally, we then assume that the node has inherited genetic material to the left of this breakpoint from the one parent, and the remainder from the other parent.
Edge annotated ARG
An ARG in which edges are annotated with the set of intervals containing ancestral material.
I'm not sure how formal we want to be about all this, but I think this is the set of concepts we need to work with?
Beta Was this translation helpful? Give feedback.
All reactions