Identify what factors from the World Bank API database relate to the vaccination percentage for countries around the world.
Kaggle CSV of world vaccination progress cribbed from the Our World in Data Github Repository: https://www.kaggle.com/gpreda/covid-world-vaccination-progress
API which pulls from World Bank for Global Development Indicators:
https://data.worldbank.org/
ISO conversion for country names:
https://gist.github.com/tadast/8827699
All data was extracted with pandas and merged into one working frame for analysis.
After performing ANOVA testing on each Indicator's quartile:
Level of confidence: 95%
H0: All mean vaccination percents over quartiles are equal
Ha: There is at least one mean vaccination percent that is not equal to another
Population Density was the only indicator which rejected the null hypothesis, which would indicate that there is a statistically significant difference in quartiles with a 95% level of confidence.
Other indicators would require more data to reach any further conclusions.
- More indicators can be compared, using a more robust API structure to check for multiyear data and filter for missing values.