Socioeconomic Privilege and Political Ideology are Associated with Racial Disparity in COVID-19 Vaccination: Methods and Materials

This public repository contains the materials for reproducing the results described in Agarwal et al. (2021) Socioeconomic Privilege and Political Ideology Are Associated with Racial Disparity in COVID-19 Vaccination and additional supplementary analyses.

Content Description

Materials for reproducibility include:

COVID-19 vaccination rate data and Python code to reproduce the data collection, including:

a) The notebook 1.CountyVaccine_Automation includes the code to collect the county-level vaccination information by race from the States whose vaccination data is oragnized in a downlable table. In this notebook, the Python code can automatically scrape the data. The States include： Illinois, Texas, Pennsylvania, Indiana, and Virginia.

b) The notebook 1.CountyVaccine_Tableau is designed to collect the county-level vaccination information by race from the States whose vaccination information is present in a Tableau Dashboard format. In this notebook, the Python code can also automatically scrape the data. The States include：New York, Wisconsin, Ohio, South Carolina, and Oregon.

c) The notebook 1.CountyVaccine_Manual is developed to collect the county-level racial vaccination information from the States whose vaccination information needs to be collected manually before running the code. These States include: California, Tennessee, North Carolina, West Virginia, Maine, and New Jersey. The instructions on manual collections are documented here.
Data and Python code to merge information from the various sources cited in our Supplementary Information (SI) Appendix.
Python code for cleaning the data.
Clean data and code to reproduce our main regression analyses (reported in main text) and robustness checks (reported in SI Appendix) as well as additional supplementary analyses reported here.

Below, we also provide additional summary statistics, exploratory data analysis, and full results for the robustness checks described in the SI appendix.

Main Regression

Below, we provide the source code for regression table presented in Agarwal et al. (2021).

You can get the stata code to do this regression by:

python statacode.py --task main_regression 
python statacode.py --task main_regression_originalX

or directly check the data and stata code in the folder StataCode/main_regression code and StataCode/main_regression_originalX code.

Detailed Regression Results and Robustness Checks

Different Age Group Controls

Below, we provide the source codes to produce regression tables by controlling for proportion of population above age 75 and disparities in the proportion of population above age 75 for the White and Black population. We add this variable as a control to account for the fact that older adults were prioritized early on in the vaccine rollout. In addition, we add additional control variables to account for the population that was eligible for the vaccines. Based on available demographic data, we approximate the vaccine eligible population by controlling for the proportion of population ages 15-74 in one set of analyses and the proportion of population ages 20-74 in a second set of analyses.

You can get the stata code to do this robustness check by:

python statacode.py --task check_age_all 
python statacode.py --task check_age_above15 
python statacode.py --task check_age_above20

or directly check the data and stata code in the folder StataCode/check_age_all code, StataCode/check_age_above15 code, and StataCode/check_age_above20 code.

Different Disparity Operationalizations

As additional robustness checks, we also model alternative operationlizations of disparity. Specifically, we model a ratio-based definition (White Vaccination Rate/Black Vaccination Rate), the log of that ratio, and an outcome that scales the absolute disparity by the overall vaccination rate of the White and Black populations in a given county.

You can get the stata code to do this robustness check by:

python statacode.py --task check_disparity_types

or directly check the data and stata code in the folder StataCode/check_disparity_types code.

Different Dates and Full Vaccination Rate Types

We compiled data from multiple time points (March 27, April 07, and May 20, 2021) to compare against our main findings based on data from April 19, 2021. In addition, we checked the same model using full vaccination data from May 20, 2021 to explore whether our pattern of findings still hold.

You can get the stata code to do this robustness check by:

python statacode.py --task diff_dates

or directly check the data and stata code in the folder StataCode/diff_dates code.

Residential Mobility

Some regions saw large rates of residential mobility (people moving in or out) during the course of the pandemic. To account for this, we collected data on areas that saw the greatest movement during the pandemic based on data from 75,000 moves (HireAHelper Migration Report, 2021). The list includes 10 cities with the greatest net increase in movement and the 10 cities with the greatest net decrease in movement, some of which are not represented in the counties included in our analysis. We exclude the 12 relevant counties represented in our data。

You can get the stata code to do this robustness check by:

python statacode.py --task residential_mobility

or directly check the data and stata code in the folder StataCode/residential_mobility code.

Recent Positive Rate per COVID-19 Test

We include a variable measuring recent positivity rate (April 12-April 19).

You can get the stata code to do this robustness check by:

python statacode.py --task recent_positive_rate

or directly check the data and stata code in the folder StataCode/recent_positive_rate code.

Avoid Collinearity by Droping `Hesitancy` Variable

We reviewed the Variable Inflation Rate (VIF) for our main regression model, finding that vaccine hesitancy had a VIF that exceeds the suggested cut-off value of 10. To assess how much the multicollinearity may have an impact on our findings, we try models excluding vaccine hesitancy.

You can get the stata code to do this robustness check by:

python statacode.py --task avoid_collinearity

or directly check the data and stata code in the folder StataCode/avoid_collinearity code.

Subsample Analysis (6 States and 10 States)

We split our sample into two subgroups, those that provide estimates for non-Hispanic White vaccination rates specifically (six states) and those that do not (ten states).

You can get the stata code to do this regression by:

python statacode.py --task subsample_analysis_6_and_10_states

or directly check the data and stata code in the folder StataCode/subsample_analysis_6_and_10_states code.

Vaccination Rate on Whole White Population

In this robustness check, we treat all the State in the same way in terms of calculate the COVID-19 White Vaccination Rate: Reported-CvdVax-White / Total-White-Population. Then we ran the models with different covariates.

You can get the stata code to do this robustness check by:

python statacode.py --task vax_rate_on_all_white

or directly check the data and stata code in the folder StataCode/vax_rate_on_all_white code.

Name		Name	Last commit message	Last commit date
Latest commit History 152 Commits
CountyVaccine		CountyVaccine
DataMerge		DataMerge
StataCode		StataCode
StataReg		StataReg
.gitignore		.gitignore
1.CountyVaccine_Automation.ipynb		1.CountyVaccine_Automation.ipynb
1.CountyVaccine_Manual.ipynb		1.CountyVaccine_Manual.ipynb
1.CountyVaccine_Tableau.ipynb		1.CountyVaccine_Tableau.ipynb
2.DataClean.ipynb		2.DataClean.ipynb
2.DataMerge.ipynb		2.DataMerge.ipynb
3.StataCode.ipynb		3.StataCode.ipynb
4.Visualization.ipynb		4.Visualization.ipynb
README.md		README.md
plotcoef.do		plotcoef.do
statacode.py		statacode.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Socioeconomic Privilege and Political Ideology are Associated with Racial Disparity in COVID-19 Vaccination: Methods and Materials

Table of Contents

Content Description

Main Regression

Detailed Regression Results and Robustness Checks

Different Age Group Controls

Different Disparity Operationalizations

Different Dates and Full Vaccination Rate Types

Residential Mobility

Recent Positive Rate per COVID-19 Test

Avoid Collinearity by Droping `Hesitancy` Variable

Subsample Analysis (6 States and 10 States)

Vaccination Rate on Whole White Population

About

Releases 1

Packages

Contributors 4

Languages

CHIDS-UMD/Covid19-Vaccination-Race-Disparity-Publish

Folders and files

Latest commit

History

Repository files navigation

Socioeconomic Privilege and Political Ideology are Associated with Racial Disparity in COVID-19 Vaccination: Methods and Materials

Table of Contents

Content Description

Main Regression

Detailed Regression Results and Robustness Checks

Different Age Group Controls

Different Disparity Operationalizations

Different Dates and Full Vaccination Rate Types

Residential Mobility

Recent Positive Rate per COVID-19 Test

Avoid Collinearity by Droping Hesitancy Variable

Subsample Analysis (6 States and 10 States)

Vaccination Rate on Whole White Population

About

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 4

Languages

Avoid Collinearity by Droping `Hesitancy` Variable

Packages