Data Engineer & BI Analyst

New York Airbnb Price Prediction
Goals:
Build a model to predict Airbnb rooms price in New York.
I will analyse the dataset from different perspectives, with the main focus on understanding what are the most important variables that determine the price of a listing.
Solutions:
-
Perform some descriptive analysis to get a picture of the data.
-
Build a prediction model for predicting price using the variables of choice with random forest regression.
-
Finally, I will try to understand the impact of the review name on the price.
-
Generate 10 most frequent words that appear in the listing column.
-
General and remove stopwords. ( English stopwords and other expressions I need to exclude to get some meaningful results)
-
Finally, test whether the regression model that I created in the previous step can be improved by including these 10 new columns as predictors.
-
Results:
The prediction model with 10 most frequent words has a better performance.
Click Here to see the dataset, some interesting findins and Jupyter Notebooks.