Skip to content

This repository contains projects for Data Analysis using various data sources, implementing data warehousing concepts, and performing ETL using various tools.

License

Notifications You must be signed in to change notification settings

vinit2107/Data-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 

Repository files navigation

Data-Analysis

Image Source: Unsplash

This repository contains projects regarding performing data analysis on the dataset. Projects included in the repository are:

  1. Flight Data Analysis

This project involves scraping data from Bureau of Transportation Statistics and uploading the downloaded files onto the S3. This layer will act as the staging layer of the data. Next, ETL would be performed on this data on a EMR using Spark. The resultant data will be stored on the RedShift acting as a data warehouse. This data will be used as a source for making a dashboard in Tableau.

LICENSE

MIT License

Copyright (c) 2020 Vinit Deshbhratar

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

About

This repository contains projects for Data Analysis using various data sources, implementing data warehousing concepts, and performing ETL using various tools.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages