Skip to content

The main goal of this project is to build a machine learning model that will predict whether the patient is in high risk or not.

Notifications You must be signed in to change notification settings

ArpanSurin/Covid-19-Data-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 

Repository files navigation

Covid 19 Data Analysis

This repository contains a Python-based implementation of a data preprocessing pipeline. The model performs essential preprocessing tasks required for machine learning workflows. The steps include handling missing data, encoding categorical data, splitting the dataset into training and testing sets, and feature scaling.

Features

  1. Data Preprocessing

    • Handling missing data.
    • Encoding categorical data (both independent and dependent variables).
    • Splitting data into training and testing sets.
    • Feature scaling (standardization).
  2. User-Friendly Interface.

    • Easily adaptable to different datasets.
    • Modular code structure for each preprocessing step.

Dataset

The model uses the Covid_Data.csv file, which contains anonymized COVID-related data. Ensure the dataset is present in the working directory.

File Structure

|-- Covid_Data.csv
|-- Cleaned_Covid_Data.csv
|-- model.ipynb
|-- README.md

License

This project is licensed under the MIT License.

Author

Created by Arpan Surin. Feel free to contact me for any questions or suggestions!

About

The main goal of this project is to build a machine learning model that will predict whether the patient is in high risk or not.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published