Spark-MySQL(/AuroraDB)-Scala benchmarking

MySQL is pretty fast with spark. Its actually not is bad, MKAY!

WRITES

Time taken : 15 seconds to insert 1000000 records

Roughly of 66,666 inserts a second. Not bad, MKAY!

READS

Time Taken : 6 seconds to read 100 records
Time Taken : 6 seconds to read 1000 records
Time Taken : 5 seconds to read 100000 records
Time Taken : 10 seconds to read 1000000 records

MOTIVATION - why MySQL?

Amazon AuroraDB is a new cost-effective MySQL-compatible database engine for Amazon RDS which is claimed to be 5X faster than regular MySQL. So whatever numbers shown here, you can conceivably multiply them by 5*

You can read about AuroraDB here

* common sense required

MY ENV

1 Spark cluster

Mac:
Processor Name       	   Intel Core i5
Processor Speed      	   2.4 GHz
Number of Processors 	   1
Total Number of Cores	   2
L2 Cache (per Core)  	   256 KB
L3 Cache             	   3 MB
Memory               	   16 GB

RUN THE BENCHMARKS YOURSELF GURLFRIEND

docker build -t spark-mysql:1.0.0 .
docker run -it spark-mysql:1.0.0

Bye ✌️

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
project		project
src/main/scala		src/main/scala
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
build.sbt		build.sbt
daim.gif		daim.gif
logo.png		logo.png
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spark-MySQL(/AuroraDB)-Scala benchmarking

WRITES

READS

MOTIVATION - why MySQL?

MY ENV

RUN THE BENCHMARKS YOURSELF GURLFRIEND

About

Releases

Packages

Languages

akeebismail/spark-mysql-scala

Folders and files

Latest commit

History

Repository files navigation

Spark-MySQL(/AuroraDB)-Scala benchmarking

WRITES

READS

MOTIVATION - why MySQL?

MY ENV

RUN THE BENCHMARKS YOURSELF GURLFRIEND

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages