Boolean selection for Series and DataFrame objects is virtually identical. Both work by passing a Series of booleans indexed identically to the object being filtered to the indexing operator.
This recipe constructs two complex and independent boolean criteria for different sets of movies. The first set of movies comes from the previous recipe and consists of those with an imdb_score
greater than 8, a content_rating
of PG-13, and a title_year
either before 2000 or after 2009. The second set of movies consists of those with imdb_score
less than 5, a content_rating
of R, and a title_year
between 2000 and 2010.
- Read in the
movie
dataset, set the index to themovie_title
, and create the first set of criteria:
>>> movie = pd.read_csv('data/movie.csv', index_col='movie_title') >>> crit_a1 = movie.imdb_score > 8 >>> crit_a2 = movie.content_rating == 'PG-13' >>> crit_a3 = (movie.title_year < 2000) ...