This project focuses on predicting the likelihood of diabetes in patients based on various health factors like glucose, blood pressure, BMI, etc. The goal is to build a classification model to predict the presence of diabetes using machine learning.
This dataset is originally from the National Institute of Diabetes and Digestive and Kidney Diseases. The objective of the dataset is to diagnostically predict whether or not a patient has diabetes, based on certain diagnostic measurements included in the dataset. Several constraints were placed on the selection of these instances from a larger database. In particular, all patients here are females at least 21 years old of Pima Indian heritage.
The dataset includes various medical predictor variables and one target variable, "Outcome." The predictor variables encompass essential health metrics, such as the number of pregnancies, plasma glucose concentration, diastolic blood pressure, triceps skinfold thickness, insulin levels, body mass index (BMI), diabetes pedigree function, and age.
-
Pregnancies: Number of times pregnant
-
Glucose: Plasma glucose concentration at 2 hours in an oral glucose tolerance test
-
BloodPressure: Diastolic blood pressure (mm Hg)
-
SkinThickness: Triceps skinfold thickness (mm)
-
Insulin: 2-Hour serum insulin (mu U/ml)
-
BMI: Body mass index (weight in kg/(height in m)^2)
-
DiabetesPedigreeFunction: Diabetes pedigree function
-
Age: Age in years
Outcome: Class variable (0 or 1) 268 instances are labeled as 1 (indicating diabetes), while others are labeled as 0.
This dataset provides a valuable opportunity to develop a predictive model for diabetes based on demographic and health-related features
Health_Care_Prediction_for_Diabetic_Patients.ipynb
: The main Python script for data preprocessing, model training, and evaluation.data/health care diabetes.csv
: The dataset used for training the model.requirements.txt
: The list of Python libraries required to run the project.
- Clone this repository:
git clone https://github.com/22AIML021VishalMehta/Diabetes-Prediction.git
- Navigate to the project folder:
cd Diabetes-Prediction
- Install the required dependencies:
pip install -r requirements.txt
- Run the model script:
The model predicts whether a person is diabetic based on the given input features. The results are evaluated based on accuracy, precision, recall, and F1-score.
This project is licensed under the MIT License - see the LICENSE
file for details.
- Clarity: It helps users or collaborators understand the steps to quickly get started with the project.
- Reusability: It ensures that anyone who clones your project can replicate the environment and run the model without confusion.
If you have any specific commands or configurations related to running the model (e.g., setting up environment variables or using a particular Python version), those can be added as well.
Let me know if you'd like more help with that or if there's anything else!