The DataFrame class
The DataFrame
class is used to represent two-dimensional data. To illustrate its use, let's create a DataFrame
class containing student data as follows:
grades = DataFrame( [['Alice', 80., 92., 84,], ['Bob', 78., NaN, 86,], ['Samaly', 75., 78., 88.]], index = [17005, 17035, 17028], columns = ['Name', 'Test 1', 'Test 2', 'Final'] )
This code demonstrates one of the most straightforward ways to construct a DataFrame
class. In the preceding case, the data can be specified as any two-dimensional Python data structure, such as a list of lists (as shown in the example) or a NumPy
array. The index
option sets the row names, which are integers representing student IDs here. Likewise, the columns
option sets the column names. Both the index
and column
arguments can be given as any one-dimensional Python structure, such as lists, NumPy
arrays, or a Series
object.
To display the output of the DataFrame
class, run the following statement in a cell:
grades...