Skip to content

A small ETL pipeline for brushing up on my SQL and pandas skills

Notifications You must be signed in to change notification settings

johannes-jansson/health-etl-playground

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Health ETL playground

A small ETL pipeline for brushing up on my SQL and pandas skills.

Extract extracts mood, sleep and workout data from a postres database on localhost and a csv file. Transform combines and massages the data. Load dumps the data in another postres database. The create_source function initiates mocked source databases from sql files in sql/.

Create a venv, install dependencies from requirements.txt and run python etl.py. Modify main if you need to create target or source databases.

Inspired by/stolen from https://medium.com/datadriveninvestor/complete-data-analytics-solution-using-etl-pipeline-in-python-edd6580de24b

About

A small ETL pipeline for brushing up on my SQL and pandas skills

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published