To perform the data analysis, we'll be using the Titanic dataset from Kaggle.
This dataset is simple to understand and does not require any domain understanding to derive insights.
This dataset contains the details of each passenger on the Titanic and also whether they survived or not.
The following are the field descriptions:
Field |
Descriptions |
---|---|
|
Survival( |
|
Passenger class( |
|
Name of the passenger |
|
Gender of the passenger |
|
Age of the passenger |
|
Number of siblings/spouses aboard |
|
Number of parents/children aboard |
|
Ticket number |
|
Passenger fare |
|
Cabin |
|
Port of embarkation ( |
Since the data is quite simple to understand, we'll keep the survival analysis as the main theme that can be used for the analysis of the data. We'll attach questions to these themes.
These are the questions that we...