Data Science



Below you will find projects related to Data Science, Machine Learning and Python programming. To learn more about these projects, you can view a summary of the projects by clicking on the 'Learn More' button.

If you wish to dive deeper into the technical aspects of a particular project, click on the 'GitHub' button on the corresponding project card to be directed to it's respective GitHub page.




           

PROJECTS



An Exploratory Data Analysis of COVID-19 in India

This project explores the current situation of the COVID-19 pandemic in India and then delves deeper to answer a few questions regarding the severity of spread in different states as well as India as a whole.

Simple Logistic Regression using Advertising Data

This project contains an initial EDA of data from a hypothetical advertisement company and analyzes how different features impact the probability of a customer clicking on their advertisement.

K Nearest Neighbors for the Classification of Anonymized Data

This project contains a brief analysis of the anonymized dataset using some basic statistical tools and charts. The primary objective is to use the K Nearest Neighbors algorithm to correctly classify our data.

An Exploratory Data Analysis of COVID-19 Worldwide

Our end goal in this notebook is to analyze the current spread of COVID and visualizes the number of cases country-wise. We also look at the amount of new cases country-wise which will give us a picture of how well the virus is being contained. Lastly, we will look into regions that have the most cases and further examine how the virus is spread state-wise.

Linear Regression on E-commerce Dataset

This project is representative of an e-commerce company that sells clothing online but also has in-store style and clothing advice sessions. Customers come in to the store, have sessions/meetings with a personal stylist, then they can go home and order either on a mobile app or website for the clothes that they want.

Telecommunication Consultancy Data Analysis

Our objective in this project is to build a dashboard to understand the user's demographic characteristics based on their mobile usage, geolocation, and mobile device properties. Doing so will help developers, brand advertisers and other telecom companies around the world to pursue data-driven marketing efforts which are relevant to their users and catered to their preferences.

Exploratory Data Analysis of Car Sales Dataset

The projects objective is to examine a dataset which consists of car sale information over the last few decades. We seek to examine this data and give actionable insights to car manufacturers to increase their sales in the future.

Comparison of Tree-Based Models for Loan Default Prediction

Our goal with this project is to compare the performance of different tree-based models on our LendingClub data in order to try and predict the probability of loan repayment.

Classification of Species in Iris Dataset Using Support Vector Machines

In this project, we will be using the famous Iris flower dataset. We will be using a Support Vector Machine to classify data as one of three species of Iris. To optimize our model we will use GridSearch to experiment with our parameters qnd hopefully be able to improve our models accuracy.

Detection of COVID-19 and Viral Pneumonia from Chest X-Rays

This project attemps to do image classification using Resnet18 to predict whether a person has COVID-19 or Viral Pneumonia based on their chest X-ray.

Scraping and EDA of hotel details from booking.com using Python and Selenium

Using Python and Selenium, we scrape hotel details from booking.com in this project and perform minor EDA of this data.

Multi-digit number detection using Convolutional Neural Networks

In this project, we attempt to build a multi-digit classifier using Convolutional Neural Networks and OpenCV for image data.

Customer retention data EDA and Prediction

This project focuses on predicting whether a user would stop using a particular service or not. We perform EDA on the data and proceed to test different models on it to see which gives us the best accuracy.



Back to top