Health ETL playground

A small ETL pipeline for brushing up on my SQL and pandas skills.

Extract extracts mood, sleep and workout data from a postres database on localhost and a csv file. Transform combines and massages the data. Load dumps the data in another postres database. The create_source function initiates mocked source databases from sql files in sql/.

Create a venv, install dependencies from requirements.txt and run python etl.py. Modify main if you need to create target or source databases.

Inspired by/stolen from https://medium.com/datadriveninvestor/complete-data-analytics-solution-using-etl-pipeline-in-python-edd6580de24b

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
data		data
.gitignore		.gitignore
README.md		README.md
config.json		config.json
etl.py		etl.py
postgres-intro-course.md		postgres-intro-course.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Health ETL playground

About

Releases

Packages

Languages

johannes-jansson/health-etl-playground

Folders and files

Latest commit

History

Repository files navigation

Health ETL playground

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages