Skip to content

sevincjakab/ds-eda-project-Buy-Sell_SJ

Repository files navigation

ds-project-template

Template for creating ds simple projects

Requirements

  • pyenv
  • python==3.11.3

Setup

One of the first steps when starting any data science project is to create a virtual environment. For this project you have to create this environment from scratch yourself. However, you should be already familiar with the commands you will need to do so. The general workflow consists of...

  • setting the python version locally to 3.11.3
  • creating a virtual environment using the venv module
  • activating your newly created environment
  • upgrading pip (This step is not absolutely necessary, but will save you trouble when installing some packages.)
  • installing the required packages via pip

At the end, you want to make sure that people who are interested in your project can create an identical environment on their own computer in order to be able to run your code without running into errors. Therefore you can create a requirements file and add it to your repository. You can create such a file by running the following command:

#don't run
pip freeze > requirements.txt

Note: In rare case such a requirements file created with pip freeze might not ensure that another (especially M1 chip) user can install and execute it properly. This can happen if libraries need to be compiled (e.g. SciPy). Then it also depends on environment variables and the actual system libraries.

Unit testing (Optional)

If you write python scripts for your data processing methods, you can also write unit tests. In order to run the tests execute in terminal:

pytest

This command will execute all the functions in your project that start with the word test.

Environment

This repo contains a requirements.txt file with a list of all the packages and dependencies you will need. Before you install the virtual environment, make sure to install postgresql if you haven't done it before.

brew update
#if warning: No available formula with the name "homebrew/core/postgresql".
##brew upgrade
##brew tap homebrew/core
brew install postgresql

In order to install the environment you can use the following commands:

pyenv local 3.11.3
python -m venv .venv
source .venv/bin/activate
pip install --upgrade pip
pip install -r requirements.txt

Run the echo .... .env for teh database connection details

Navigate through the EDA.ipynb Notebook:

Notebooks walks you through the Exploratory data analysis for a client who needs to sell his houses in the Seattle area.

Follow the markdowns for more information on the Questions asked and functions/methods used to clean up and analyze the data. You will realize that not all the information from this dataset was used for my analysis, but only the ones that are relevant tot the needs of the client.

The PDF file in this repo is the summary of the analysis and the recommendations for the client.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published