This project focuses on building a machine learning model for detecting fraudulent activities in credit card transactions.
- Introduction
- Project Structure
- Data Collection
- Data Preprocessing
- Exploratory Data Analysis (EDA)
- Feature Engineering
- Additional Visualizations
- Results
- Next Steps
This project aims to develop a machine learning model for detecting fraudulent activities in credit card transactions. The model is trained on the Credit Card Fraud Detection dataset from Kaggle.
The project is structured as follows:
notebooks/
: Jupyter notebooks for different stages of the project.src/
: Python scripts for reusable code.
- Dataset: Credit Card Fraud Detection
- Source: Kaggle
- Handled missing values.
- Normalized numerical features.
- No encoding for categorical variables in this dataset.
- Explored feature distributions, correlations, and outliers.
- Created a new feature, 'Hour,' representing the hour of the day for each transaction.
- Learning Curves.
- SHAP Values or Feature Importance Plot.
- Testing the Model - Sample Input and Predicted Output Comparison.
- Precision: 0.97
- Recall: 0.79
- F1-score: 0.87
- Accuracy: 0.80
- Fine-tune the model further.
- Implement continuous monitoring.
- Evaluate the model on new, unseen data.
- Enhance model explainability with additional visualizations.