In this tutorial, we’ll use sentiment analysis on Twitter data about the latest movie titles to answer that age old question: “Is that movie any good?” We’ll show how we built the solution using Apache Cassandra, Apache Spark, DataStax Enterprise Analytics, Python and Jupyter notebooks. This is a great tutorial to attend if you are new to big data or want to learn more about Cassandra and Spark!
- Download correct Docker Community Edition: https://store.docker.com/search?type=edition&offering=community
- Create Log In to Download
- Download Docker
- Allow for 5 GB of Memory per container
- Docker -> Preferences -> Advanced -> Memory
- cd YourDownloadPath/pydata
- docker-compose up -d (Should take 6 minutes but will also start DataStax Enterprise, Spark, and Juypter)
- Once download and start is complete
- Login with token that is in Jupyter logs
- docker logs pydata_jupyter_1