Book Image

Dancing with Python

By : Robert S. Sutor
Book Image

Dancing with Python

By: Robert S. Sutor

Overview of this book

Dancing with Python helps you learn Python and quantum computing in a practical way. It will help you explore how to work with numbers, strings, collections, iterators, and files. The book goes beyond functions and classes and teaches you to use Python and Qiskit to create gates and circuits for classical and quantum computing. Learn how quantum extends traditional techniques using the Grover Search Algorithm and the code that implements it. Dive into some advanced and widely used applications of Python and revisit strings with more sophisticated tools, such as regular expressions and basic natural language processing (NLP). The final chapters introduce you to data analysis, visualizations, and supervised and unsupervised machine learning. By the end of the book, you will be proficient in programming the latest and most powerful quantum computers, the Pythonic way.
Table of Contents (29 chapters)
Part I: Getting to Know Python
PART II: Algorithms and Circuits
PART III: Advanced Features and Libraries
Other Books You May Enjoy
Appendix C: The Complete UniPoly Class
Appendix D: The Complete Guitar Class Hierarchy
Appendix F: Production Notes

14.4 Data cleaning

Cleaning data is an important topic, and authors have written dozens of books, chapters, and papers on the subject. [CLD] What do you do when data is wrong or missing?

I was surprised when I first looked at the cats DataFrame and discovered that the GENDER column had three codes: F, M, and U. Presumably, the last stands for “unknown.”

F    1863
M    1616
U       6
Name: Gender, dtype: int64

We use a conditional expression to filter the rows to see only those with U for gender:

df[df["Gender"] == "U"]
              Locality  Postcode  Breed  Colour Gender
259    DANDENONG NORTH      3175    DOM  UNKNOW      U
611         SPRINGVALE      3171  DOMSH   WHITE      U
690   NOBLE PARK NORTH      3174  DOMSH  SILTAB      U
1273        NOBLE PARK      3174  DOMSH     TAB      U
1697       KEYSBOROUGH      3173  DOMSH  ...