-
Notifications
You must be signed in to change notification settings - Fork 145
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
rule plot_stats fails with "OverflowError: value too large to convert to npy_uint32" #36
Comments
Sorry, I have no quick and easy ideas for a fix. One way that should make this script work better on such large datasets could be to exchange As this is not such a long script, and not too complicated, switching the library used for handling the dataframes should not be overly complicated. But unless you already know |
Also, another caveat: switching to |
Thanks for that. I will definitely take a look into |
Fingers crossed! 😅 As a more general solution, some meaningful way of (programmatically) stratifying samples might make sense, i.e. having some sort of annotation column in We don't currently have the capacity to provide something like this, but are always welcoming pull requests and will try to review and merge those quickly. And if you are looking for a more actively maintained snakemake workflow for variant calling, we are putting a lot of effort into this one: |
Hey everyone,
I am trying to run this pipeline with 144 samples so the resulting files are quite big. I managed to get it almost to the end, but the last rule (plots_stats) fails with
OverflowError: value too large to convert to npy_uint32
. I guess, I just have to many rows in mycalls.tsv.gz
to be handled. The complete error log is:any ideas?
The text was updated successfully, but these errors were encountered: