Book Image

Mastering Python for Data Science

By : Samir Madhavan
Book Image

Mastering Python for Data Science

By: Samir Madhavan

Overview of this book

Table of Contents (19 chapters)
Mastering Python for Data Science
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
7
Estimating the Likelihood of Events
Index

The census income dataset


The following table is a census dataset on income created by the University of California, Irvine:

Columns

Description

age

This refers to the age of a person

work class

This refers to the type of employment a person is involved in

education

This refers to the education level of a person

marital_status

This refers to whether a person is married or not

occupation

This refers to the type of jobs a person is involved in

relationship

This refers to the type of relationship of the person

race

This refers to the ethnicity of a person

gender

This refers to the gender of a person

hours_per_week

This refers to the average hours worked per week

native_country

This refers to the country of origin

greater_than_50k

This refers to the flag that indicates whether a person is earning more than $50K in a year

Let's load this data:

>>> data = pd.read_csv('./Data/census.csv')

Let's check the fill rate of the data:

>>> data...