Hotel-booking-analysis-EDA-

所属分类:大数据
开发工具:Jupyter Notebook
文件大小:1957KB
下载次数:0
上传日期:2022-12-26 14:27:44
上 传 者sh-1993
说明:  酒店预订分析EDA-,分析酒店预订数据(EDA顶点项目)
(Hotel-booking-analysis-EDA-,Analyzing The Data of Hotel Booking(EDA-capstone project))

文件列表:
CAPSTONE_PROJECT_01.pdf (711521, 2022-12-26)
Hotel_Booking_Analysis_EDA_Capstone_Project.ipynb (671671, 2022-12-26)
Project_summary.pdf (161698, 2022-12-26)
Team_colab_notebook.ipynb (518900, 2022-12-26)
Technical_documentation.pdf (663392, 2022-12-26)

# Hotel-booking-analysis-EDA- Analyzing The Data of Hotel Booking(EDA-capstone project) ![image](https://user-images.githubusercontent.com/92503896/209557546-25d1***c0-4dfa-46ed-a573-172b7a3377f4.png) ## **Introduction:** Have you ever wondered the trends for hotel bookings? How long a person stays? How often people cancel? What the busiest months are? In this analysis I explore a large dataset to examine these questions. The data contains “booking due to arrive between the 1st of July of 2015 and the 31st of august 2017”. This dataset contains information on records for client stays at hotels. More specifically, it contains booking information for a city hotel and a resort hotel, and includes information such as when the booking was made, length of stay, the number of adults, children, and/or babies, and the number of available parking spaces, among other things. For the purpose of this post, I only focused on some of the important variables to examine. ## **Problem Statement:** The main objective of this project is to explore and visualize the dataset from hotel booking data using basic exploratory data analysis techniques. This will be done by finding out the information such as when the booking was made, length of stay, the number of adults, children, and/or babies, and the number of repeated guests, among other things. ## **Feature Description:** The dataset has around 119390 observations in it with 32 columns and it is mix between categorical and numerical values * Hotel ○ Resort hotel ○ City hotel * is_canceled ○ 1: Cancelled ○ 0: Not cancelled * lead_time ○ No of days that elapsed between entering date of booking into property management system and arrival date * arrival_date_year ○ Year of arrival date (2015-2017) * arrival_date_month ○ Month of arrival date (Jan - Dec) * arrival_date_week_numberr ○ Week number of year for arrival date (1-53) * arrival_date_day_of_month ○ Day of arrival date * stays_in_weekend_nights ○ No of weekend nights (Sat/Sun) the guest stayed or booked to stay at the hotel * stays_in_week_nights ○ No of week nights (Mon - Fri) the guest stayed or booked to stay at the hotel * Adults * Children * Babies * meal ○ Type of meal booked. Undefined/SC – no meal package; ○ BB – Bed & Breakfast; ○ HB – Half board (breakfast and one other meal – usually dinner); ○ FB – Full board (breakfast, lunch and dinner) * country * market_segment (a group of people who share one or more common characteristics, lumped together for marketing purposes) ○ TA: Travel agents ○ TO: Tour operators * distribution_channel (A distribution channel is a chain of businesses or intermediaries through which a good or service passes until it reaches the final buyer or the end consumer) ○ TA: Travel agents ○ TO: Tour operators * is_repeated_guest (value indicating if the booking name was from repeated guest) ○ 1: Yes ○ 0: No * previous_cancellations ○ Number of previous bookings that were cancelled by the customer prior to the current booking * previous_bookings_not_canceled ○ Number of previous bookings not cancelled by the customer prior to the current booking * reserved_room_type ○ Code of room type reserved. Code is presented instead of designation for anonymity reasons. * assigned_room_type ○ Code for the type of room assigned to the booking. Sometimes the assigned room type differs from the reserved room type due to hotel operation reasons (e.g., overbooking) or by customer request. Code is presented instead of designation for anonymity reasons. * booking_changes ○ Number of changes/amendments made to the booking from the moment the booking was entered on the PMS until the moment of check-in or cancellation * deposit_type ○ Indication on if the customer made a deposit to guarantee the booking. This variable can assume three categories: No Deposit – no deposit was made; Non- Refund – a deposit was made in the value of the total stay cost; Refundable – a deposit was made with a value under the total cost of stay. * agent -ID of the travel agency that made the booking * company ○ ID of the company/entity that made the booking or responsible for paying the booking. * day_in_waiting_list ○ Number of days the booking was in the waiting list before it was confirmed to the customer. * customer_type: ○ Contract - when the booking has an allotment or other type of contract associated to it; ○ Group – when the booking is associated to a group; ○ Transient – when the booking is not part of a group or contract, and is not associated to other transient booking; ○ Transient-party – when the booking is transient, but is associated to at least other transient booking * adr (average daily rate) * required_car_parking_spaces ○ Number of car parking spaces required by the customer * total_of_special_requests ○ Number of special requests made by the customer (e.g. twin bed or high floor) * reservation_status ○ Cancelled – booking was cancelled by the customer; ○ Check-Out – customer has checked in but already departed; ○ No-Show – customer did not check-in and did inform the hotel of the reason why * reservation_status_date ○ Date at which the last status was set. ## **Steps involved doing this project:-** Loading Data Load python libraries Load dataset Cleaning dataset Analyzing and Visualizing the Data ## **Conclusion:** 1. Majority of the hotels booked are city hotel. 2. Majority of the guests are from Western Europe.So target this area for more customers. 3. 2016 showed the highest rate of hotel bookings.(data from 2015-2017) 4. We should also target months between May to Aug.Those are peak months due to the summer period. 5. No deposit policies lead to a higher cancellation rates. 6. Since there are very few repeated guests,focus should be on retaining the customers after their first visit. 7. The majority of reservations convert into successful transactions. 8. More bookings occured on weekdays vs weekends. 9. Most bookings came from independent,transient customers.

近期下载者

相关文件


收藏者