Skip to content

Data Analysis of the case of customers default payments in Taiwan, using a Machine Learning algotithm Support Vector Machine (SVM).

Notifications You must be signed in to change notification settings

NicolaRizzitello/Customers-default-payments

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

In this project I used the support vector machine to see how many customers are not in default and how many are in default

Languages

alt text alt text

About Dataset

The dataset used for the analysis has been downloaded by Machine Learning UCI repository. This research aimed at the case of customers default payments in Taiwan and compares the predictive accuracy of probability of default among six data mining methods. this dataset is composed by 30000 rows and 24 columns:

  • Limit_bal: credit limit (numeric)
  • Sex: customer's sex (categorical)
  • Marriage: civil status (categorical)
  • Age: customer's age (numeric)
  • Pay_: historical past payaments from April to September 2005 (categorical)
  • Bill_AMT: amount statement from April to September 2005 (numeric)
  • Pay_amt: past payament amount (numeric)
  • Default: payment, response variable (categorical)

About Analysis

First step to begin the analysis is check null values and cleaning the dataset. Next step has been to balance the observations in the training and test set and I standardized the attributes then I used nested cross validation to estimate generalization error and hyper-parameter simultaneously.
In the last step, I used the Support Vector Machine to my dataset divided into training and test set.

About

Data Analysis of the case of customers default payments in Taiwan, using a Machine Learning algotithm Support Vector Machine (SVM).

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published