Skip to content

priyavratuniyal/ML_Preprocessor_Pipeline_CLI

Repository files navigation

ML Preprocessor Pipeline CLI

🤔 Why?

Data preprocessing is a required first step before any machine learning machinery can be applied, because the algorithms learn from the data and the learning outcome for problem solving heavily depends on the proper data needed to solve a particular problem – which are called features.

📄 What?

  • Given the database, the CLI provides various options to preprocess the data.
  • Options:
    • Data Description
    • Handling NULL Values
    • Encoding Categorical Data
    • Feature Scaling
    • Data Visualisation
  • You can also DOWNLOAD⬇️ the modified dataset.

🏁 Run

  1. Clone this Repo: git clone https://github.com/priyavratuniyal/ML_Preprocessor_Pipeline_CLI.git
  2. cd ML_Preprocessor_Pipeline_CLI/
  3. pip3 install -r requirements.txt
  4. Now run, python3 main.py [Dataset's Path]
    Example: python3 main.py dataset.csv

Then you will see the below home screen. 🎉


Note: You can run python3 main.py sample_data.csv, a sample data file is provided with this repo.

📷 Screenshots

Home screen

Tasks

CLI Visualisation

If you have any suggestions/questions, ping here.

About

An app to automate structured data preprocessing

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages