Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Filter cattle-outbreak using GenoFLU B3.13 #140

Merged
merged 1 commit into from
Feb 24, 2025
Merged

Conversation

jameshadfield
Copy link
Member

The previous approach relied on broad filtering -- minimum date of 2024, region of North America -- a hardcoded exclude list and a clock filter. As the diversity of sequences increased the clock-filter became less effective and ultimately dropped all the desired strains. See #133 for more.

We now use GenoFLU constellations and can relax the date and region filters accordingly. This relaxation didn't result in any non-North-American samples included but did add one B3.13 genome from 2023: 'A/Goose/USA/23-038138-001-original/2023'.

These changes to filtering will also apply to the D1.1 builds, but testing indicates no changes.

The segment-level approach is not addressed here, but could be similarly adjusted to use the GenoFLU matching on the segment level. Specifically, https://github.com/nextstrain/avian-flu/pull/138/files adds the segment-level annotations and the expanded constellation is: B3.13 = PA:ea1, HA:ea1, PB1:am4, MP:ea1, NA:ea1, PB2:am2.2, NP:am8, NS:am1.1

Closes #133

The previous approach relied on broad filtering -- minimum date of 2024,
region of North America -- a hardcoded exclude list and a clock filter.
As the diversity of sequences increased the clock-filter became less
effective and ultimately dropped all the desired strains. See
<#133> for more.

We now use GenoFLU constellations and can relax the date and region
filters accordingly. This relaxation didn't result in any
non-North-American samples included but did add one B3.13 genome from
2023: 'A/Goose/USA/23-038138-001-original/2023'.

These changes to filtering will also apply to the D1.1 builds, but
testing indicates no changes.

The segment-level approach is not addressed here, but could be similarly
adjusted to use the GenoFLU matching on the segment level. Specifically,
<https://github.com/nextstrain/avian-flu/pull/138/files> adds the
segment-level annotations and the expanded constellation is:
B3.13 = PA:ea1, HA:ea1, PB1:am4, MP:ea1, NA:ea1, PB2:am2.2, NP:am8, NS:am1.1

Closes #133
@jameshadfield
Copy link
Member Author

Going to merge this now as the changes are relatively minor, the direction's been agreed in Slack, and it allows our automated builds to resume.

@jameshadfield jameshadfield merged commit 8d5cbba into master Feb 24, 2025
6 checks passed
@jameshadfield jameshadfield deleted the james/B3.13 branch February 24, 2025 22:55
@jameshadfield jameshadfield mentioned this pull request Feb 24, 2025
3 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Fix cattle-outbreak strain selection
1 participant