Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

0 count of count_diffs and count_comparisons and all in count_missing #104

Open
hrluo93 opened this issue Apr 17, 2024 · 11 comments
Open

0 count of count_diffs and count_comparisons and all in count_missing #104

hrluo93 opened this issue Apr 17, 2024 · 11 comments
Labels
help wanted Extra attention is needed

Comments

@hrluo93
Copy link

hrluo93 commented Apr 17, 2024

Hi,

I used:
bcftools mpileup -f reference.fa -b bamlist.txt -r chr10 | bcftools call -m -Oz -f GQ -o chr10-bcf.vcf.gz to get all sites vcf

Then

I used command:
pixy --stats pi dxy fst --populations poplist2.txt --window_size 50000 --vcf chr10-bcf.vcf.gz --n_cores 2 --output_prefix chr10gipixy > chr10pixy.log

the result was NA, count_diffs and count_comparisons were both 0 and all is count_missing

image
I upload a small chr18 where showed the same results.

chr10pixy.log
chr18-bcf.filter.vcf.gz

poplist2.txt

Have I missing some step?

Best!

@hrluo93 hrluo93 added the help wanted Extra attention is needed label Apr 17, 2024
@smallfishcui
Copy link

Same here. Did you figure out the reason?

thanks,
Cui

@JeffWeinell
Copy link

I also have this problem!

@ksamuk
Copy link
Owner

ksamuk commented Jul 10, 2024

This is a bug, and the current solution is to roll back to pixy 1.2.5 (all calculations are unchanged relative to the current version).

@hrluo93
Copy link
Author

hrluo93 commented Jul 10, 2024

Same here. Did you figure out the reason?

thanks, Cui

Hi,
I finally used ANGSD to Calculate Dxy.

@hrluo93
Copy link
Author

hrluo93 commented Jul 10, 2024

This is a bug, and the current solution is to roll back to pixy 1.2.5 (all calculations are unchanged relative to the current version).

Thanks for your reply!
I am looking forward to the pipeline for dealing sex-chromosome hemizygous region.

@hrluo93
Copy link
Author

hrluo93 commented Jul 10, 2024

I also have this problem!

It seems like a bug.

@JeffWeinell
Copy link

Downgrading to pixy 1.2.5 solved it for me, thanks!

@smallfishcui
Copy link

Thanks for the answers! Downgrading to pixy 1.2.5 worked for me as well!

Cui

@smallfishcui
Copy link

This is a bug, and the current solution is to roll back to pixy 1.2.5 (all calculations are unchanged relative to the current version).

Thanks for your reply! I am looking forward to the pipeline for dealing sex-chromosome hemizygous region.

would ploidy be a problem with ANGSD?

@hrluo93
Copy link
Author

hrluo93 commented Jul 11, 2024

This is a bug, and the current solution is to roll back to pixy 1.2.5 (all calculations are unchanged relative to the current version).

Thanks for your reply! I am looking forward to the pipeline for dealing sex-chromosome hemizygous region.

would ploidy be a problem with ANGSD?

Hi!

I guess that the hemizygous region would be a problem for both pixy and ANGSD. If your species were 2 ploidy or 4 ploidy but treated it as 2 ploidy two subgenome, it would not be a problem.

For ANGSD, I guess that the problem is all non-SNP sites are considered invariant sites. It is not a problem for my species because of the low ratio (~10%) of repeat sequences. But if the target species had a ratio of complex repeats, it would be a problem. As well as this opinion(https://x.com/jrossibarra/status/1753102333331042622).

@smallfishcui
Copy link

This is a bug, and the current solution is to roll back to pixy 1.2.5 (all calculations are unchanged relative to the current version).

Thanks for your reply! I am looking forward to the pipeline for dealing sex-chromosome hemizygous region.

would ploidy be a problem with ANGSD?

Hi!

I guess that the hemizygous region would be a problem for both pixy and ANGSD. If your species were 2 ploidy or 4 ploidy but treated it as 2 ploidy two subgenome, it would not be a problem.

For ANGSD, I guess that the problem is all non-SNP sites are considered invariant sites. It is not a problem for my species because of the low ratio (~10%) of repeat sequences. But if the target species had a ratio of complex repeats, it would be a problem. As well as this opinion(https://x.com/jrossibarra/status/1753102333331042622).

Thanks for sharing! It seems pixy is much more realistic with regards to treating monomorphic sites then. As far as I know, ANGSD uses site frequency spectrum to calculate the pi and theta. I just wonder how much it is differed from pixy, especially for polyploid data. And is it okay to compare with tetraploid and diploid pi? If someone can help, it's very much appreciated.

thanks,
Cui

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

4 participants