#Effect of Rain and Higher Ridership on NYC Subway System
##Table of Contents
bar_charts.py
: Plot hourly entries to subway.final_revised.pdf
: Summary report on findings from analysisgd_regression.py
: Perform regression via gradient descent using presence of rain as regression feature. Plot residualshistograms.py
: Plot histograms for number of hourly entries on rainy and clear days.queries.py
: Query the dataframe for ridership statistics usingpandasql
library.shapiro_wilk.py
: Use the Shapiro Wilk test for normality. Verify with q-q plot.subway.txt
: Raw turnstile dataturnstile_data_master_with_weather.csv