In this chapter, we will use a public dataset that was extracted from the website, http://data.worldbank.org/. We have pulled out the following details for all the countries. In case the data is not present for a country, it would appear blank.
We will use the following dataset to learn the concepts in this chapter. The extracted dataset is provided to you in a CSV file named worlddata
. We will use the dataset to learn the concepts of clustering:
Label |
Description |
---|---|
electricity_access |
The percentage of the population with electricity access |
co2_emissions |
Carbon dioxide emissions |
mortality_rateper1000 |
The mortality rate per thousand |
export_percent_to_gdp |
The exports in percentage to GDP |
alternative_and_nuclearenergy_percent_total |
Alternate and nuclear energy contribution from the whole dataset |
forest_area_percent |
Forests that are covered |
net_migration |
Net migration |
male_unemployment |
The unemployment rate in India |
air_transport |
Air traffic and registered carriers departure... |