Reconstructing SARS-CoV-2 lineages from mixed wastewater sequencing data

This is the repository for running NMF on wastewater sequencing data to determine lineage definitions from mixed samples. Read more in our Scientific Reports paper:

https://www.nature.com/articles/s41598-024-70416-4

Instructions

This describes the workflow for a typical SARS-CoV-2 run (other viruses are similar)

Download or create a run containing Gromstole "coverage" and "mapped" csvs into the data/sars-cov-2/runs directory. A thin wrapper for running the Gromstole alignment on fastqs is provided in the preprocess directory but may require some changes.
Set the virus, number of lineages, fasta name, and runs (including the new run) variables in find_lineages.py. On the first run, all subsequent steps should be uncommented, however the mutation frequency matrix and learned nmf vectors are saved so subsequent runs can comment out the generation steps if the data is unchanged.
Run python find_lineages.py. This will create the mutations frequency matrix, learn the NMF vectors, and create a fasta: data/sars-cov-2/[fasta_name].fasta where fasta_name is specified in find_lineages.py.
(Optional) Analyze the data using a tool like pangolin or nextclade to determine which lineages have been discovered.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
figures		figures
outbreak_lineages		outbreak_lineages
preprocess		preprocess
.gitignore		.gitignore
README.md		README.md
find_lineages.py		find_lineages.py
get_outbreak_lineages.py		get_outbreak_lineages.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reconstructing SARS-CoV-2 lineages from mixed wastewater sequencing data

Instructions

About

Releases

Packages

Languages

Ellmen/derived-wastewater-lineages

Folders and files

Latest commit

History

Repository files navigation

Reconstructing SARS-CoV-2 lineages from mixed wastewater sequencing data

Instructions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages