MAJOR Code Refactoring #5

njfritter · 2018-06-11T21:05:26Z

Had Raul review my project and he absolutely grilled me on stuff. So here's what I need to do:

Redo formatting of scripts; some have 2 spaces as indent
Remove all extraneous packages (i.e. using both pandas and csv modules)
Reformat helper_functions, remove unnecessary ones, add purpose and inputs
Redo tokenization section to account for special twitter stuffs

Will add on as I find more

njfritter · 2018-06-11T21:06:30Z

Preliminary Checklist:

Redo formatting of scripts; some have 2 spaces as indent
Remove all extraneous packages (i.e. using both pandas and csv modules)
Reformat helper_functions, remove unnecessary ones, add purpose and inputs
Redo tokenization section to account for special twitter stuffs

njfritter · 2018-06-11T21:08:25Z

Did everything above; still haven't gotten the tokenization down pat but things are looking good.

Here is a pull request detailing the work so far

njfritter · 2018-06-11T22:09:43Z

In order to continue, I will need a successful run where I can generated a tokenized dataset. From there I will look into removing stopwords.

Issue here

Moving to "On Hold"

Update: Issue complete, moving back to "In Progress"

njfritter · 2018-06-12T15:43:59Z

Next steps:

Make sure code can run on different machine (or different directory location)

Make code to generate various actionable insights (exploratory analysis)
- Hashtag Frequency
- Url frequency
- N-gram frequency

Update: I will only be making sure the code can run on a different machine/directory location. I will be taking the exploratory analysis points and moving them to a different issue

njfritter · 2018-06-14T21:36:20Z

Will be doing exploratory analysis for the time being (issue here), moving to "On Hold"

njfritter · 2018-06-21T23:18:22Z

Update:

Changed directory structure so that there would be a separate directory for raw, untouched data and for processed data
Added directory for model objects
Updated READMEs
Still altering functions and helper_functions.py script (will likely split up by category)

njfritter self-assigned this Jun 11, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MAJOR Code Refactoring #5

MAJOR Code Refactoring #5

njfritter commented Jun 11, 2018 •

edited

Loading

njfritter commented Jun 11, 2018 •

edited

Loading

njfritter commented Jun 11, 2018

njfritter commented Jun 11, 2018 •

edited

Loading

njfritter commented Jun 12, 2018 •

edited

Loading

njfritter commented Jun 14, 2018

njfritter commented Jun 21, 2018

MAJOR Code Refactoring #5

MAJOR Code Refactoring #5

Comments

njfritter commented Jun 11, 2018 • edited Loading

njfritter commented Jun 11, 2018 • edited Loading

njfritter commented Jun 11, 2018

njfritter commented Jun 11, 2018 • edited Loading

njfritter commented Jun 12, 2018 • edited Loading

njfritter commented Jun 14, 2018

njfritter commented Jun 21, 2018

njfritter commented Jun 11, 2018 •

edited

Loading

njfritter commented Jun 11, 2018 •

edited

Loading

njfritter commented Jun 11, 2018 •

edited

Loading

njfritter commented Jun 12, 2018 •

edited

Loading