Skip to content

A quickstart installation of spark on ubuntu vm with pyspark

License

Notifications You must be signed in to change notification settings

sigviper/spark-quickstart

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

A very simple installation of Apache Spark to get you going on your data engineering/data science route.

install

Prerequisites

  • ubuntu/debian installation with ~1GB disk space, >4GB RAM and few vcores
  • working sudo password

Installation adds required system packages and installs spark to /opt/spark Script will also install pyspark via pip

Run bash scripts/install.sh

run master and worker

bash scripts/start.sh

Script will output master and worker URLs for you, but master is also running at http://127.0.0.1:8080/

About

A quickstart installation of spark on ubuntu vm with pyspark

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages