Skip to content

Latest commit

 

History

History
35 lines (26 loc) · 1.02 KB

README.md

File metadata and controls

35 lines (26 loc) · 1.02 KB

Participation to the Yelp Competition of Kaggle

DATASET

Training

Restaurants: 2000 Images: 234843

Test

Restaurants: 10000 Images: ~237000

DEPENDENCIES

Dependencies for SURF & Clustering (http://mulan.sourceforge.net/)

(to make python 3 and openCV work for SURF follow the awesome instructions in this [link] (http://www.pyimagesearch.com/2015/07/16/where-did-sift-and-surf-go-in-opencv-3/))

Dependencies for Classification

PROCEDURE:

  • Load images
  • Find POI (SIFT / SURF & OpenCV) & form POIVector
  • Clustering on POI in order to form PhotoVector
  • Clustering on Images in order to get form BusinessVector
  • Form training dataset from BusinessVector and labels
  • Classification & evaluation on training dataset
  • Predictions on test dataset