
The widely considered “unsinkable” RMS Titanic sank after colliding with an iceberg.
Unfortunately, there weren't enough lifeboats for everyone onboard, resulting in the death of 1502 out of 2224
passengers and crew.
While there was some element of luck involved in surviving,
it seems some groups of people were more likely to survive than others.
...
Now we want to know: what sorts of people were more likely to survive?
Our goal is to predict the survival rate (classification: death or alive).
In order to achieve this, we will:
This dataset contains the information of all passengers
A quick way to visualize correlation between features is by using a Heatmap
We split our data into two sets: train set and test set (80% / 20%)
Let's take a look at the histograms of Survived and Pclass...
... and get the counts of male/female
We still have to normalize our data!
We apply the Random Forest Algorithm
For this we separated our datasets into training set and testing set earlier.
If we want to apply cross-validation, we also need to create a validation set

Let's review our process
Assessment: We are ready to move to deployment.
This website is our deployment. We hope you enjoyed it :)
seffs