The materials for this chapter can be found at https://github.com/stefmolin/Hands-On-Data-Analysis-with-Pandas-2nd-edition/tree/master/ch_11. In this chapter, we will be revisiting attempted login data; however, the
simulate.py script has been updated to allow additional command-line arguments. We won't be running the simulation this time, but be sure to take a look at the script and check out the process that was followed to generate the data files and create the database for this chapter in the
0-simulating_the_data.ipynb notebook. The
user_data/ directory contains the files used for this simulation, but we won't be using them directly in this chapter.
The simulated log data we will be using for this chapter can be found in the
logs/ directory. The
hackers_2018.csv files are logs of login attempts and a record of hacker activity from all 2018 simulations, respectively. Files with the
hackers prefix are treated as the labeled data we...