Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix cattle-outbreak strain selection #133

Closed
jameshadfield opened this issue Feb 20, 2025 · 1 comment · Fixed by #140
Closed

Fix cattle-outbreak strain selection #133

jameshadfield opened this issue Feb 20, 2025 · 1 comment · Fixed by #140
Labels
bug Something isn't working

Comments

@jameshadfield
Copy link
Member

In the process of fixing #131 (via #132) the genome tree was completely wrong due to filtering out all the strains we actually want! (Screenshot below). The strains were dropped by the clock filter in augur refine. Now that GenoFLU metadata is (or soon will be) available via #126 we should try restricting to B3.13.

Image
@jameshadfield
Copy link
Member Author

The automated run produced (essentially) this tree over the weekend. I've disabled the GitHub action and re-uploaded the https://next.nextstrain.org/avian-flu/h5n1-cattle-outbreak/genome@2025-02-20 dataset as the latest.

jameshadfield added a commit that referenced this issue Feb 24, 2025
The previous approach relied on broad filtering -- minimum date of 2024,
region of North America -- a hardcoded exclude list and a clock filter.
As the diversity of sequences increased the clock-filter became less
effective and ultimately dropped all the desired strains. See
<#133> for more.

We now use GenoFLU constellations and can relax the date and region
filters accordingly. This relaxation didn't result in any
non-North-American samples included but did add one B3.13 genome from
2023: 'A/Goose/USA/23-038138-001-original/2023'.

These changes to filtering will also apply to the D1.1 builds, but
testing indicates no changes.

The segment-level approach is not addressed here, but could be similarly
adjusted to use the GenoFLU matching on the segment level. Specifically,
<https://github.com/nextstrain/avian-flu/pull/138/files> adds the
segment-level annotations and the expanded constellation is:
B3.13 = PA:ea1, HA:ea1, PB1:am4, MP:ea1, NA:ea1, PB2:am2.2, NP:am8, NS:am1.1

Closes #133
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant