Skip to content

Commit

Permalink
removed midterm due date from schedule, updated HW 3
Browse files Browse the repository at this point in the history
  • Loading branch information
kstreet13 committed Oct 23, 2024
1 parent 6c1b30c commit 2200ca0
Show file tree
Hide file tree
Showing 2 changed files with 7 additions and 1 deletion.
6 changes: 6 additions & 0 deletions website/content/assignment/09-hw3.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -16,3 +16,9 @@ A new dataset has been added to the data science data repository <https://github
1. Tokenize the abstracts and count the number of each token. Do you see anything interesting? Does removing stop words change what tokens appear as the most frequent? What are the 5 most common tokens for each search term after removing stopwords?
2. Tokenize the abstracts into bigrams. Find the 10 most common bigrams and visualize them with ggplot2.
3. Calculate the TF-IDF value for each word-search term combination (here you want the search term to be the "document"). What are the 5 tokens from each search term with the highest TF-IDF value? How are the results different from the answers you got in question 1?

## Sentiment Analysis

1. Perform a sentiment analysis using the NRC lexicon. What is the most common sentiment for each search term? What if you remove `"positive"` and `"negative"` from the list?
2. Now perform a sentiment analysis using the AFINN lexicon to get an average positivity score for each abstract (hint: you may want to create a variable that indexes, or counts, the abstracts). Create a visualization that shows these scores grouped by search term. Are any search terms noticeably different from the others?

2 changes: 1 addition & 1 deletion website/data/schedule.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -48,7 +48,7 @@ lessons:
reading: "07-reading"
assignment: ""

- title: "(Midterm Project due) High Performance Computing, Cloud Computing"
- title: "High Performance Computing, Cloud Computing"
date: "Week 9 (10/25)"
class: "09-class"
reading: "09-reading"
Expand Down

0 comments on commit 2200ca0

Please sign in to comment.