Scraping and EDA of hotel details from Booking.com

using Python and Selenium

October 27, 2020

Project Overview

Using Python and Selenium, we scrape hotel details from booking.com in this project and perform minor EDA of this data.

We first create the webscarper using selenium. This is done so that when we run our code block, it automatically opens up the browser, heads to the website and begin the scraping process.

Heatmap of extracted numerical features


After the scraping process is done, we will do simple Exploratory Data Analysis on this data to gain some insights in to this data. While doing this, it is also important to check if our data is in the appropriate format. If it is not, we will be converting it into a format that is more easily understandible.

Distribution of 'Star Rating' feature


We then go over some more simple patterns we can find in the data and try and infer what they mean on a surface level.

Head on over to the link below to check out the entire python notebook which contains the entire step by step process of coding the entire web-scraper, manipulating the data into an acceptable format and finally doing the simple EDA on it.