Book Image

Python Machine Learning Workbook for Beginners

By : AI Sciences
Book Image

Python Machine Learning Workbook for Beginners

By: AI Sciences

Overview of this book

<p>Machine Learning (ML) is the lifeblood of businesses worldwide. ML tools empower organizations to identify profitable opportunities fast and help them to better understand potential risks. The ever-expanding data, cost-effective data storage, and competitively priced powerful processing continue to drive the growth of ML. </p><p> </p><p>This is the best time you could enter the exciting machine learning universe. Industries are reinventing themselves constantly by developing more advanced data analysis models. These models analyze larger and more complex data than ever while delivering instantaneous and more accurate results on enormous scales. </p><p>In this backdrop, it is evident that hands-on practice is everything in machine learning. Tons of theory will amount to nothing if you don’t have enough hands-on practice. Textbooks and online classes mislead you into a false sense of mastery. The easy availability of learning resources tricks you and you become overconfident. But when you try to apply the theoretical concepts you have learned, you realize it’s not that simple. </p><p> </p><p>This is where projects play a crucial role in your learning journey. Projects are doubtless the best investment of your time. You’ll not only enjoy learning but you’ll also make quick progress. And unlike studying boring theoretical concepts, you’ll find that working on projects is easier to stay motivated. </p><p> </p><p>The projects in this book cover ten different interesting topics. Each project will help you refine your ML skills and apply them in the real world. These projects also present you with an opportunity to enrich your portfolio, making it simpler to find a great job, explore interesting career paths, and even negotiate a higher pay package. Overall, this learning-by-doing book will help you accomplish your machine learning career goals faster. </p><p> </p><p>The code bundle for this course is available at https://www.aispublishing.net/ai-sciences-book</p>
Table of Contents (15 chapters)
1
About the Author

7.5. Data Visualization

Data visualization is an important step and can reveal important information about the data.

Let’s first group the dataset by title and see what information we can get regarding the ratings of movies. Execute the following script.

Script 8:

1. merged_movie_df.groupby(‘title’).describe()

Output:

image

The output above shows the userId, movieId, and rating columns grouped together with respect to the title column. The describe() method further shows the information as mean, min, max, and standard deviation values for userId, movieId, and rating columns. We are only interested in the ratings column. To extract the mean of ratings grouped by title, you can use the following script.

Script 9:

1. merged_movie_df.groupby(‘title’)[‘rating’].mean().head()

The output below shows that the first two movies got an average rating of 4.0 each, while the third and fourth movies have average ratings of 3 and 5, respectively.

Output...