1 minute read

This is the final project for our Data Mining and Wrangling class in collaboration with Alejandro White, Josef Monje, MatMat Romero, Jasper Pangan and Cedric Corro. Findings were shared in an online public presentation.

Summary

Implementing a clustering method of restaurants provides a way to create a data-driven approach of choosing where to eat. As such, this study sought to find out whether the said method will show significant results especially when done in the city of Makati. This study as well aimed to determine what the cluster’s features are and how it can help someone know where to satisfy their palettes. Data was gathered via the Zomato API, yielding 2,375 data points pertaining to restaurants. Once mined and wrangled, dataset was reduced via multiple factor analysis. After which, the cleaned data was processed via k-means clustering. Internal validation methods were used to determine the most appropriate number of clusters with the dataset. After implementing the k-Means clustering method and internal validation checks, 4 clusters were chosen which were labeled as the following: namely, Quick Bites, Fast Food, Casual Dining and Full-Service Restaurants. The top features and top restaurants per cluster were taken to provide a brief idea of themes of each cluster. This can help several stakeholders in providing an unsupervised learning method as a way of making informed decisions on where to go to eat.