Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sublineages of BA.5.2.20 with S:K444Q, ORF8:W45L (4 seq) & BA.5.6 + S:K444Q, S:E1202Q, M:H125Y (12 seq) #1110

Closed
ryhisner opened this issue Sep 24, 2022 · 6 comments
Labels
BA.5 low priority For issues that are low priority for now

Comments

@ryhisner
Copy link

Description #1

Sub-lineage of: BA.5.2.20
Earliest sequence: 2022-7-25, Philippines — EPI_ISL_14918878
Most recent sequence: 2022-9-7, USA, California — EPI_ISL_15091380
Countries circulating: USA (2), Philippines (1)
Number of Sequences: 3
GISAID Query: Spike_K444Q, NS8_W45L, NSP13_T127N
CovSpectrum Query: Nextcladepangolineage:BA.5.2.20* & S:K444Q
Substitutions on top of BA.5.2.20:
Spike: K444Q (+ F186V in the two USA seq)
ORF8: W45L
Nucleotide: A22892C, G28027T (plus T12457C & T22118G in two USA seq)

USHER Tree
https://nextstrain.org/fetch/raw.githubusercontent.com/ryhisner/jsons/main/subtreeAuspice1_genome_412e7_f3e350%E2%80%94BA.5.2.20%2BK444Q%2BORF8W45L.json

image

Evidence
S:K444Q has emerged in singlets numerous times in the past few months, but these two microlineages are the first to contain multiple sequences. According to the Bloom Lab RBD ACE2 Affinity/RBD Expression Heat Map on a BA.2 background, K444Q has the highest ACE2 affinity and the highest RBD expression (a proxy measure for stability) of any K444 mutation. (I’ve transcribed the values for the change in ACE2 binding and RBD expression for each K444X mutation in the image below.) https://jbloomlab.github.io/SARS-CoV-2-RBD_DMS_Omicron/RBD-heatmaps/

image

Furthermore, according to the deep mutational scanning (DMS) done by Yunlong Cao, “Imprinted SARS-CoV-2 humoral immunity induces converging Omicron RBD evolution,” K444Q is second only to K444E in evading antibodies from breakthrough BA.2- and BA.5-infection sera.

image

It's of course possible that other factors not accounted for by DMS and laboratory measurements of ACE2 affinity and RBD expression might make K444Q less advantageous in the real world than it appears in experiments, but I think it's a mutation worth monitoring in any case. The S:K444 residue appears to be the most important escape residue against a variety of different polyclonal sera from vaccination, infection, and vaccination + infection. Below is the Bloom Lab RBD Escape Calculator's map of escape mutations for those infected one with pre-Omicron SARS-CoV-2 then by BA.2.

image

Genomes

Genomes EPI_ISL_14918878, EPI_ISL_15002185, EPI_ISL_15091380

Description #2

Sub-lineage of: BA.5.6
Earliest sequence: 2022-8-31, USA, New York — EPI_ISL_15015729, EPI_ISL_15065199 (separate patients)
Most recent sequence: 2022-9-6, USA, New Jersey — EPI_ISL_15065199
Countries circulating: USA (3)
Number of Sequences: 3
GISAID Query: Spike_K444Q, Spike_E1202Q, M_H125Y
CovSpectrum Query: Nextcladepangolineage:BA.5.6* & S:K444Q & M:H125Y
Substitutions on top of BA.5.6:
Spike: K444Q, E1202Q
M: H125Y
Nucleotide: C19269T, T21357C, A22892C, G25166C, C26895T

USHER Tree
https://nextstrain.org/fetch/raw.githubusercontent.com/ryhisner/jsons/main/subtreeAuspice1_genome_205f_f41060%E2%80%94BA.5.6%2BK444Q%2BE1202Q%2BMH125Y.json

image

Evidence
See above for evidence on the K444Q mutation. These three sequences also have S:E1202Q, a somewhat convergent S2 mutation, and M:H125Y, which is notable for being identified as one of only two non-spike mutations that occurred more than twice in the patients examined by @thomaspeacock and Nick Loman in their study of immunocompromised patients who were repeatedly sequenced during chronic infections, “Recurrent SARS-CoV-2 mutations in immunodeficient patients.” https://academic.oup.com/ve/article/8/2/veac050/6661028

image

M:H125Y frequency was 100-fold higher among patients in their study than in the UK sequences at the time (occurring in 1 in 7 patients in their study compared to 1 in 721 patients in the COG-UK dataset). In fact, M:H125Y was the third most frequent mutation (tied with deletions in the S:138 region) found in this study, behind only S:E484K and E:T30I.

image

M:H125Y has occurred repeatedly in highly mutated singlets, and was also found in the B.1.1.482, B.1.616, AV.1, B.1.177.72, B.1.1.269 (which had the most M:H125Y sequences with 555), and AY.4.16 variants, but this is the first time, to my knowledge, that it has been detected in multiple patients in any Omicron lineage.

Genomes

Genomes EPI_ISL_14914401, EPI_ISL_15015729, EPI_ISL_15065199
@ryhisner ryhisner changed the title Sublineages of BA.5.2.20 with S:K444Q, ORF8:W45L and BA.5.6 with S:K444Q, S:E1202Q, M:H125Y Sublineages of BA.5.2.20 with S:K444Q, ORF8:W45L (3 seq) and BA.5.6 with S:K444Q, S:E1202Q, M:H125Y (3 seq) Sep 24, 2022
@FedeGueli
Copy link
Contributor

4 sequences as today for lineage 1: Spike_K444Q, NS8_W45L, NSP13_T127N

@ryhisner
Copy link
Author

Yes, and the newest one also has S:F186V and is geographically distant from the other three. It's from Massachusetts (USA), and the other three have been from the Philippines, Texas (USA), and Pennsylvania (USA).

@ryhisner
Copy link
Author

Three new sequences for the S:K444Q + S:E1202Q + M:H125Y lineage were uploaded today from New York state, making a total of seven sequences now. EPI_ISL_15296998, EPI_ISL_15312328, EPI_ISL_15312346

image

@corneliusroemer
Copy link
Contributor

@ryhisner please update the numbers in the issue title, as it seems to have increased. I classify this as low priority for now because it's a) small and b) likely less competitive than BQ.1.1 and XBB

@corneliusroemer corneliusroemer added the low priority For issues that are low priority for now label Oct 11, 2022
@ryhisner ryhisner changed the title Sublineages of BA.5.2.20 with S:K444Q, ORF8:W45L (3 seq) and BA.5.6 with S:K444Q, S:E1202Q, M:H125Y (3 seq) Sublineages of BA.5.2.20 with S:K444Q, ORF8:W45L (4 seq) and BA.5.6 with S:K444Q, S:E1202Q, M:H125Y (10 seq) Oct 18, 2022
@ryhisner
Copy link
Author

The S:K444Q + S:E1201Q + M:H125Y lineage is now up to 12 sequences, including the first outside of New York (excepting one from neighboring New Jersey): one sequence each from New Hampshire; Washington, DC; and Quebec, Canada. It remains mostly a curiosity but has now at shown some geographical spread and appears to be growing in numbers, if slowly.
Genomes

Genomes EPI_ISL_14914401, EPI_ISL_15015729, EPI_ISL_15065199, EPI_ISL_15222535, EPI_ISL_15296998, EPI_ISL_15312328, EPI_ISL_15312346, EPI_ISL_15324098, EPI_ISL_15351444, EPI_ISL_15393750, EPI_ISL_15409949, EPI_ISL_15410029

@ryhisner ryhisner changed the title Sublineages of BA.5.2.20 with S:K444Q, ORF8:W45L (4 seq) and BA.5.6 with S:K444Q, S:E1202Q, M:H125Y (10 seq) Sublineages of BA.5.2.20 with S:K444Q, ORF8:W45L (4 seq) & BA.5.6 + S:K444Q, S:E1202Q, M:H125Y (12 seq) Oct 18, 2022
@corneliusroemer
Copy link
Contributor

GISAID query just yields 4 for me. Do you have a current Usher tree?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
BA.5 low priority For issues that are low priority for now
Projects
None yet
Development

No branches or pull requests

4 participants