Apply the ML-Methods Random Forest, AdaBoost, and Gradient Boosting on real data, and compare its performance. More specifically, the challenge is to predict the survival of the Titanic passengers based on their individual attributes.
The Titanic data set contains information about the passengers of the Titanic, which sank on 1912, including whether they survived or not.
Variable | Definition | Key |
---|---|---|
survival | Survival | 0 = No, 1 = Yes |
pclass | Ticket class | 1 = 1st, 2 = 2nd, 3 = 3rd |
sex | Sex | |
Age | Age in years | |
sibsp | # of siblings / spouses aboard the Titanic | |
parch | # of parents / children aboard the Titanic | |
ticket | Ticket number | |
fare | Passenger fare | |
cabin | Cabin number | |
embarked | Port of Embarkation | C = Cherbourg, Q = Queenstown, S = Southampton |