Skip to content

mwytock0812/ny_subway

Repository files navigation

#Effect of Rain and Higher Ridership on NYC Subway System

##Table of Contents

  • bar_charts.py: Plot hourly entries to subway.
  • final_revised.pdf: Summary report on findings from analysis
  • gd_regression.py: Perform regression via gradient descent using presence of rain as regression feature. Plot residuals
  • histograms.py: Plot histograms for number of hourly entries on rainy and clear days.
  • queries.py: Query the dataframe for ridership statistics using pandasql library.
  • shapiro_wilk.py: Use the Shapiro Wilk test for normality. Verify with q-q plot.
  • subway.txt: Raw turnstile data
  • turnstile_data_master_with_weather.csv

About

Final project for Udacity's Intro to Data Science course.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages