Welcome to the General Assembly Data Science Handout page. Here I'll be assembling handouts, walkthrough and links for everyone to have some references to follow-up after class.
##Final Project Order
##General Resources ####Books
- Elements of Statistical Learning (great reference for the theory behind a lot of the techniques)
- Data Mining With R (good case studies and code examples)
- Machine Learning For Hackers (more code examples and walkthroughs)
##Lesson 1:
###Slides
###Handouts
###Links
- Video Tutorials for Command Line Basics
- Command Line Data Manipulation
- Git Tutorial from Atlassian
- Git Tutorial from CodeSchool
##Lesson 2:
###Slides
###Assignment
###Handouts
###Links
##Lesson 3 ###Slides
###Handouts
###Links
##Lesson 4 ###Assignment
###Slides
###Handouts
##Lesson 5
###Slides
###Handouts
###Links
- Logistic Regression R Tutorial from Stanford NLP
- Fast Logistic Regression: Mahout
- Fast Logistic Regression: Vowpal Wabbit
- Fast Logistic Regression: LIBLINEAR
- Odds Ratio Explanation
##Lesson 6 ###Slides
###Handouts
- More basics for regression homework
- The shortest introduction to knitr
- More explanation for numbers in logistic regression
- Installing Python
- Lesson 6 R History
###Links
- Logistic Regression Walkthrough
- History of Bayes
- Mathematical Exploration of Bayes Theorem
- Naive Bayes v. Logistic Regression
##Lesson 7 ###Handouts
###Links
- Scikits-learn User Guide
- Interactive Python Tutorial
- Introduction to Python
- Python Data Structures
- Introduction to Python
- Ranking NFL Teams w/Pandas, Sklearn
- Logistic Regression w/ Statsmodel - Well Switching in Bangledesh
- XKCD Style Plots in Python
###Slides
###Handouts
- A note on consistent columns for modeling in R
- vw example solution for regression homework
- Regression Homework Solution 1
- Regression Homework Solution 2
- Regression Homework Solution 3
###Assignment
###Handouts
###Links
- Blogpost on Insult Detection
- Github Code of Insult Detection Solution
- Choosing a ML Classifier
- [KMeans IPython Notebook] (http://nbviewer.ipython.org/urls/raw.github.com/temporaer/tutorial_ml_gkbionics/master/2%2520-%2520KMeans.ipynb)
- Text Clustering in Sklearn
- Cloudera ML KMeans
###Slides
###Links
- A Tutorial on PCA
- Stanford PCA Tutorial
- PCA w/ R Code
- Aaron's PCA/3d/clustering post
- PCA in R with step-by-step method
- Alice's slides on recommenders
- Alice's data file (on dropbox) (in this repo)
- Alice's recommender.py
###Slides
###Handouts -MySQL Tutorial
##Lesson 15
###Slides
###Handouts