-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How can I use paired-end reads with singlesketch
?
#571
Comments
I am skeptical that singlesketch takes stdin - @bluegenes? What happens when you put the reads in an intermediate file and then sketch that file? |
feels like #535 might help out here. A short term fix if you really want to avoid creating an intermediate file might be to use a named pipe - https://stackoverflow.com/questions/4113986/example-of-using-named-pipes-in-linux-shell-bash |
manysketch can also explicitly take in paired end, and you can do it one pair at a time. |
(that is, you can create a single-row CSV for each pair of files and it will work just fine) |
I am only concerned about the amount of total diskspace the workflow would require to run manysketch in that fashion. |
The docs say yes -- @mr-eyes? It's possible #523 broke something, but all tests are passing... |
if you run |
Ok looking into it a little more, stdin should work, but there are no tests in place to check it. Best to confirm the file is complete with @ctb's strategy above first, then we can look into the potential issue |
Everything looks right to me for this command.
|
Ok I think needletail has |
I am currently waiting on this command to finish using
|
I am finding the same result with Attempt 1:
attempt 2:
|
I just tried combining all the fastq files into a single intermediate:
And I am still finding only 1 hash:
|
When using a standard fastq file I am still getting a single hash back:
|
@ccbaumler can you run |
Nice, I should have thought of that! For the
And for the
|
yep -- those look like fine signatures, this is an issue with the manifest from |
Thanks for helping me figure that out. Is there anyway of manually updating a sourmash signature manifest? |
can you unzip one of those files and use less or cat to put the If you're only doing one or two zips, you could edit that file and then rezip all the files together. Not an easy fix for many |
|
ok, thinking about it, this is probably an issue for all our recent databases. @ctb it's probably not too hard to update the manifests using sourmash python layer zip foo, right? |
no problem at all! |
I am attempting to use sourmash sketch for thousands of pair-end fastq files, but it is currently the bottleneck of the workflow. I tried doing something like:
This has resulted in sketches of 1 hash:
Can I use singlesketch with paired end reads in another way?
The text was updated successfully, but these errors were encountered: