-
Notifications
You must be signed in to change notification settings - Fork 7
min_hash_distance
Young edited this page Feb 9, 2024
·
5 revisions
Mash dist isn't really built to identify contamination (which is why there are optional Kraken2 and Blobtools subworkflows), but it can quickly look through a large number of references for similar hits. As such, mash dist is being used here to identity possible sequences rather than attempting to characterize the reads. There is a plan to use mash screen instead of mash dist in the future, but the estimated date of completion for that is still very loose.
---
min hash distance
---
flowchart LR
fastq --> mash
contigs --> mash
mash --> A[other subworkflows]
There is a way to supply a custom mash reference file. Instructions are found on a different page in this wiki.
# number of mash dist hits to save
params.mash_max_hits = 25
# specifying a different mash reference file
params.mash_db = ""
-
- amrfinderplus
- bbduk
- blastn
- blobtools_*
- core_genome_evaluation
- circulocov
- datasets_*
- drprg
- elgato
- emmtyper
- fastani
- fastp
- fastqc
- heatcluster
- iqtree2
- kaptive
- kleborate
- kraken2
- mash_*
- mashtree
- mlst
- multiqc
- mykrobe
- panaroo
- pbptyper
- phytreeviz
- plasmidfinder
- prokka
- quast
- seqsero2
- serotypefinder
- shigatyper
- snp_dists
- spades