Skip to content

Code repository for Advances in Data Science Project

Notifications You must be signed in to change notification settings

ashrithagoramane/INFO7390

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

INFO7390 - Yelp Business Success Prediction

Problem Statement

The challenge lies in leveraging Yelp data to gain insights into factors that contribute to the success of restaurants and predicting the success of new or existing restaurants based on various features and attributes.

Pre-requisites

We ensure that we have the following requirements on the machine

  • Python 3.8+
  • pip (Python package installer)

Project Overview

This project utilizes Yelp's extensive dataset to explore factors that influence the success of businesses. We aimed to develop a predictive model that can forecast the potential success of new and existing businesses based on various attributes available in the data. This model serves to aid business owners, investors, and analysts in making informed decisions.

Run the application locally

  1. Clone the repository:
    git clone https://github.com/ashrithagoramane/INFO7390.git 
  2. Create a virtual environment and source it
    python3 -m venv .venv
    source .venv/bin/activate
  3. Install the required packages (Optional)
    pip install -r requirements.txt

Data sources

Project features

  • Data Preprocessing: Handles missing values, data standardization, and outlier detection.
  • Exploratory Data Analysis (EDA): Visual and statistical analysis to identify patterns and anomalies.
  • Feature Engineering: Enhances features to improve model accuracy.
  • Model Training: Includes Linear Regression, Decision Trees, Random Forest, Gradient Boosting, and XGBoost.
  • Performance Evaluation: Uses metrics to evaluate and compare model performance.

References

Contributors

About

Code repository for Advances in Data Science Project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published