Write My Paper Button

WhatsApp Widget

The Project uses data from Airbnb dataset. Airbnb is an American company that operates an online marketplace for lodging, primarily home stays for vacation rentals, and tourism activities. Based in San Francisco, California

Combined assessment :: Text Mining / NLP

This is a comprehensive project covering Data Visualization and Text Analytics course.

The Project uses data from Airbnb dataset. Airbnb is an American company that operates an online marketplace for lodging, primarily home stays for vacation rentals, and tourism activities. Based in San Francisco, California, the platform is accessible via website and mobile app. Airbnb does not own any of the listed properties; instead, it profits by receiving commission from each booking. The company was founded in 2008 by Brian Chesky, Nathan Blecharczyk and Joe Gebbia. Airbnb is a shortened version of its original name, AirBedand-Breakfast.com. The company has been criticized for possibly driving up home rents and creating nuisances for those living near leased properties. The company is regulated by many jurisdictions, including the European Union and cities such as San Francisco and New York City. It is viewed as a competitive threat by the hotel industry.

How to get dataset?

Each student will need to deploy a free MongoDB cluster. After deploying the cluster, student will be able to load “sample data” that includes the AirBnb collection of documents. Instructions on how to deploy a free MongoDB cluster can be found here

Mongo DB Setup for Airbnb data.pdf Download Mongo DB Setup for Airbnb data.pdf 

After deploying the cluster and getting the data, student will use the following R Script to connect R Studio with the Airbnb data in MongoDB.

The R script template can be found here:

MongoDB Airbnb in RR Download MongoDB Airbnb in RR 

Deliverables:

You were hired by Airbnb to analyze descriptions of postings. The Airbnb management team wants to know if there are any business insights that can be found in the postings' content (text content and numerical content)

You will explore the data to create visualizations. You're expected to create a data story using tableau or R Shiny or Power Bi. It is also expected to create visualizations and perform text analysis using R-programming.

Deliverable: For this project, you will submit 3 files:

1. A project report - details below *

2. Tableau screenshots, or R Shiny app, or Power Bi, or any other data viz that you created.

3. R ​​file(s) with code that structures the textual data from Airbnb and performs text mining frameworks.

* The written report must follow the following format: double-spaced, 12-point font, Times New Roman and be no longer than 3 pages. In the appendix of the report, your team will include screenshots of visual assets and the R code (that was used to structure the text data) Please include the following elements in your written report and make sure to export to a PDF format:

1. Executive summary with business insight.

2. Describe at least 4 visualizations created.

3. Describe Dashboard created

4. Explain key findings and business insight from the text data. Analyze at least 3 text mining frameworks.

Important: Failure to include any of these required elements will result in an F grade. Submiting a report that is longer than 3 pages (excluding appendix) will result in a one letter grade deduction. If any of these elements are submitted after the deadline, there will be a one letter grade deduction.

In the last session, students will be guided on the project definition and scope and there will be a discussion about the expected product.

 

Team Project FAQs

1. Utilize the knowledge we discuss in class and those in the posted materials.

2. You may organize your presentation and paper however you wish, but be sure to emphasize on meaning you want to drive through visualization