This project provides an in-depth analysis of the COVID-19 pandemic's impact across South Asia and the Middle East. It explores trends, vaccination progress, statistical relationships, and predictive insights, offering visualizations and actionable insights.
- COVID-19 Trends: Analyze total cases, positivity rates, and ICU patient trends.
- Vaccination Progress: Investigate the relationship between GDP, population density, and vaccination rates.
- Statistical Testing: Perform hypothesis testing (t-test) to compare vaccination progress across regions.
- Predictive Modeling: Use linear regression to predict total cases based on key factors.
- 📈 Time-Series Trends: Visualize total cases, vaccination progress, and positivity rates with interactive plots.
- 🔗 Automated Updates: Weekly updates of COVID-19 data using GitHub Actions.
- 📊 Correlation Analysis: Uncover relationships between GDP, vaccination progress, and population density.
- 🔍 Predictive Modeling: Linear regression to identify factors influencing total cases.
- 🔬 Hypothesis Testing: Compare vaccination progress between South Asia and the Middle East.
- 📂 Interactive Analysis: Use dropdown selectors for country-specific insights.
- Source: Our World in Data
- File:
data/owid-covid-data.csv
- Description:
- Global COVID-19 statistics: total cases, deaths, vaccination data.
- Economic and demographic data: GDP per capita, population density.
git clone https://github.com/yourusername/COVID-19-Regional-Analysis.git
cd COVID-19-Regional-Analysis
It’s recommended to use a virtual environment for dependency isolation:
python -m venv venv
Activate the virtual environment:
- Windows:
venv\Scripts\activate
- macOS/Linux:
source venv/bin/activate
pip install -r requirements.txt
Navigate to the notebook
directory and start the Jupyter Notebook:
cd notebook
jupyter notebook
Open covid-analysis.ipynb
in your browser.
This project includes a GitHub Actions Workflow to automate weekly updates of the dataset. The dataset updates every Sunday at midnight UTC. Ensure the workflow is enabled in the repository's Actions tab.
COVID-19-Regional-Analysis/
├── data/ # Dataset directory
│ ├── owid-covid-data.csv # COVID-19 data file
│ └── last_updated.txt # Metadata for last update timestamp
├── notebook/ # Jupyter notebooks for analysis
│ └── covid-analysis.ipynb # Main notebook
├── scripts/ # Python scripts for automation
│ └── update_data.py # Automates data updates
├── .github/workflows/ # GitHub Actions workflows
│ └── update_data.yml # Workflow for weekly data updates
├── images/ # Example visualizations
├── requirements.txt # Python dependencies
└── README.md # Project documentation
-
Vaccination Progress:
- Middle Eastern countries exhibit stronger correlation with GDP than South Asia.
- Socio-economic constraints contribute to slower vaccination progress in South Asia.
-
Positivity Rates:
- South Asia experiences spikes in positivity rates due to testing delays.
- Middle East demonstrates consistent positivity rate trends.
-
Regression Analysis:
- GDP per capita and vaccination progress are significant predictors of total cases.
- Population density has a weaker correlation.
-
Hypothesis Testing:
- Significant differences exist in vaccination progress between the regions (p-value < 0.05).
- Extend analysis to additional regions for global comparisons.
- Integrate advanced predictive models (e.g., Random Forest, XGBoost).
- Include government stringency index and healthcare capacity metrics.
- Develop a fully interactive dashboard with Plotly Dash.
Contributions are welcome! Feel free to:
- Fork the repository.
- Submit pull requests for new features or fixes.
- Report any issues or suggestions.