In situations where the goal is to predict a variable such as price
, it helps to visualize the data and figure out how the dependent variable is being influenced by other variables. The exploratory analysis gives a lot of insight which is not readily available by looking at the data. This section of the chapter will describe how to visualize and draw insights from big data.
- The head of the
dataframe
can be printed using thedataframe.head()
function which produces an output, as shown in the following screenshot:
- Similarly, the tail of the
dataframe
can be printed using thedataframe.tail()
function, which produces an output, as shown in the following screenshot:
- The
dataframe.describe()
function is used to obtain some basic statistics such as the maximum, minimum, and mean values under each column. This is illustrated in the following screenshot:
dataframe.describe() function output
- As you can observe, the dataset has 21,613 records...