Book Image

Mastering Unsupervised Learning with Python [Video]

By : Stefan Jansen
Book Image

Mastering Unsupervised Learning with Python [Video]

By: Stefan Jansen

Overview of this book

<p>In this video course you will understand the assumptions, advantages, and disadvantages of various popular clustering algorithms, and then learn how to apply them to different data sets for analysis. You will apply the Latent Dirichlet Allocation algorithm to model topics, which you can use as an input for a recommendation engine just like the New York Times did. You will be using cutting-edge, nonlinear dimensionality techniques (also called manifold learning)—such as T-SNE and UMAP—and autoencoders (unsupervised deep learning) to assess and visualize the information content in a higher dimension. You will be looking at K-Means, density-based clustering, and Gaussian mixture models. You will see hierarchical clustering through bottom-up and top-down strategies. You will go from preprocessing text to recommending interesting articles. Through this course, you will learn and apply concepts needed to ensure your mastery of unsupervised algorithms in Python.</p> <p>By the end of this course, you will have mastered the application of Unsupervised Learning techniques and will be able to utilize them in your Data Science workflow—for instance, to extract more informative features for Supervised Learning problems. You will be able not only to interpret results but also to enhance them.</p> <p>After having taken this course, you will have mastered the application of Unsupervised Learning with Python. All the code and supporting files for this course are available on Github at <a href="https://github.com/PacktPublishing/Mastering-Unsupervised-learning-with-Python" target="_blank">https://github.com/PacktPublishing/Mastering-Unsupervised-learning-with-Python</a></p> <h2>Style and Approach</h2> <p>An exhaustive course packed with step-by-step instructions, working examples, and helpful advice. This course is divided into clear chunks, so you can learn at your own pace and focus on your area of interest.</p>
Table of Contents (3 chapters)
Chapter 2
Topic Modeling: Semantic Content Recommendations
Content Locked
Section 6
Topic Modeling: Running the Models – Part 2
In this video, let’s understand the (probabilistic) Latent Semantic Analysis (LSA) and Latent Dirichlet Allocation (LDA) and explore how to evaluate topic quality. - Use sklearn’s truncated SVD, LSA and LDA classes - Evaluate topic quality using word clouds and topic coherence