Skip to content

Latest commit

 

History

History
49 lines (37 loc) · 1.75 KB

README.md

File metadata and controls

49 lines (37 loc) · 1.75 KB

ML Preprocessor Pipeline CLI

🤔 Why?

Data preprocessing is a required first step before any machine learning machinery can be applied, because the algorithms learn from the data and the learning outcome for problem solving heavily depends on the proper data needed to solve a particular problem – which are called features.

📄 What?

  • Given the database, the CLI provides various options to preprocess the data.
  • Options:
    • Data Description
    • Handling NULL Values
    • Encoding Categorical Data
    • Feature Scaling
    • Data Visualisation
  • You can also DOWNLOAD⬇️ the modified dataset.

🏁 Run

  1. Clone this Repo: git clone https://github.com/priyavratuniyal/ML_Preprocessor_Pipeline_CLI.git
  2. cd ML_Preprocessor_Pipeline_CLI/
  3. pip3 install -r requirements.txt
  4. Now run, python3 main.py [Dataset's Path]
    Example: python3 main.py dataset.csv

Then you will see the below home screen. 🎉


Note: You can run python3 main.py sample_data.csv, a sample data file is provided with this repo.

📷 Screenshots

Home screen

Tasks

CLI Visualisation

If you have any suggestions/questions, ping here.