Book Image

Machine Learning: Make Your Own Recommender System

By : Oliver Theobald
Book Image

Machine Learning: Make Your Own Recommender System

By: Oliver Theobald

Overview of this book

With an introductory overview, the course prepares you for a deep dive into the practical application of Scikit-Learn and the datasets that bring theories to life. From the basics of machine learning to the intricate details of setting up a sandbox environment, this course covers the essential groundwork for any aspiring data scientist. The course focuses on developing your skills in working with data, implementing data reduction techniques, and understanding the intricacies of item-based and user-based collaborative filtering, along with content-based filtering. These core methodologies are crucial for creating accurate and efficient recommender systems that cater to the unique preferences of users. Practical examples and evaluations further solidify your learning, making complex concepts accessible and manageable. The course wraps up by addressing the critical topics of privacy, ethics in machine learning, and the exciting future of recommender systems. This holistic approach ensures that you not only gain technical proficiency but also consider the broader implications of your work in this field. With a final look at further resources, your journey into machine learning and recommender systems is just beginning, armed with the knowledge and tools to explore new horizons.
Table of Contents (15 chapters)
Free Chapter
1
FOREWORD
2
DATASETS USED IN THIS BOOK
3
INTRODUCING SCIKIT-LEARN
4
INTRODUCTION
5
THE ANATOMY
6
SETTING UP A SANDBOX ENVIRONMENT
7
WORKING WITH DATA
8
DATA REDUCTION
9
ITEM-BASED COLLABORATIVE FILTERING
10
USER-BASED COLLABORATIVE FILTERING
11
CONTENT-BASED FILTERING
12
EVALUATION
13
PRIVACY & ETHICS
14
THE FUTURE OF RECOMMENDER SYSTEMS
15
FURTHER RESOURCES

DATASETS USED IN THIS BOOK

 

Goodbooks-10k Datasets (Chapter 6)

These two datasets contain information about books and user ratings collected from www.goodreads.com. The first dataset contains book ratings from individual users, while the second dataset contains information about individual books such as their average rating, number of five-star ratings, ISBN number, author, etc.

https://www.kaggle.com/sriharshavogeti/collaborative-recommender-system-on-goodreads/data

 

Advertising Dataset (Chapter 7)

This dataset contains fabricated information about the features of users responding to online advertisements, including their gender, age, location, daily time spent online, and whether they clicked on the advertisement. The dataset was created by Udemy course instructor Jose Portilla of Pierian Data and is used in his course Python for Data Science and Machine Learning Bootcamp.

https://scatterplotpress.com/p/datasets

 

Melbourne Housing Market (Chapter 8)

This...