COVID-19 Worldwide

Exploratory Data Analysis

June 1, 2020

Project Overview

Note : Keep in mind that this EDA was conducted towards the end of May and so the data may be outdated if viewed at a later date

Our end goal in this notebook is to analyze the current spread of COVID and visualizes the number of cases country-wise. We also look at the amount of new cases country-wise which will give us a picture of how well the virus is being contained. Lastly, we will look into regions that have the most cases and further examine how the virus is spread state-wise.

We use two datasets in this project. The first is a dataset of COVID-19 cases worldwide and the second consists of state-wise information of cases in the US.

The datasets are first processed so that missing values and unwanted features are removed. Next we begin to apply visualizations such as geo-plotting to understand the spread of cases worldwide.



COVID-19 Cases by Country



Further, we look at how cases have increased by country so that we can understand the rate of spread by looking at the curve and determining whether it is exponenetial,linear etc.

Number of Cases with respect to Time

Examining our second dataset, we use geo-plotting and other visualizations to understand how COVID-19 as spread across the USA.
State-wise Number of Cases in the USA

Other metrics we use to understand the spread of the virus are positivty rate, death rate etc. These allow us to understand how severe the situation is in particular countries and states.

Head over to the link below to check out the entire python notebook which contains a more in-depth exploration of the data along with insights using other metrics to better understand the spread of COVID-19.