Practical Machine Learning

Practical Machine Learning

By : Sunila Gollapudi

Buy this Book

Practical Machine Learning

By: Sunila Gollapudi

Buy this Book

Overview of this book

This book explores an extensive range of machine learning techniques uncovering hidden tricks and tips for several types of data using practical and real-world examples. While machine learning can be highly theoretical, this book offers a refreshing hands-on approach without losing sight of the underlying principles. Inside, a full exploration of the various algorithms gives you high-quality guidance so you can begin to see just how effective machine learning is at tackling contemporary challenges of big data This is the only book you need to implement a whole suite of open source tools, frameworks, and languages in machine learning. We will cover the leading data science languages, Python and R, and the underrated but powerful Julia, as well as a range of other big data platforms including Spark, Hadoop, and Mahout. Practical Machine Learning is an essential resource for the modern data scientists who want to get to grips with its real-world application. With this book, you will not only learn the fundamentals of machine learning but dive deep into the complexities of real world data before moving on to using Hadoop and its wider ecosystem of tools to process and manage your structured and unstructured data. You will explore different machine learning techniques for both supervised and unsupervised learning; from decision trees to Naïve Bayes classifiers and linear and clustering methods, you will learn strategies for a truly advanced approach to the statistical analysis of data. The book also explores the cutting-edge advancements in machine learning, with worked examples and guidance on deep learning and reinforcement learning, providing you with practical demonstrations and samples that help take the theory–and mystery–out of even the most advanced machine learning methodologies.

Practical Machine Learning

Credits

Foreword

About the Author

Acknowledgments

About the Reviewers

www.PacktPub.com

Preface

Free Chapter

Introduction to Machine learning

Machine learning

Performance measures

Some complementing fields of Machine learning

Machine learning process lifecycle and solution architecture

Machine learning algorithms

Machine learning tools and frameworks

Summary

Machine learning and Large-scale datasets

Big data and the context of large-scale Machine learning

Algorithms and Concurrency

Technology and implementation options for scaling-up Machine learning

Summary

An Introduction to Hadoop's Architecture and Ecosystem

Introduction to Apache Hadoop

Machine learning solution architecture for big data (employing Hadoop)

Hadoop 2.x

Summary

Machine Learning Tools, Libraries, and Frameworks

Machine learning tools – A landscape

Apache Mahout

Julia

Python

Apache Spark

Spring XD

Summary

Decision Tree based learning

Decision trees

Implementing Decision trees

Summary

Instance and Kernel Methods Based Learning

Instance-based learning (IBL)

Kernel methods-based learning

Summary

Association Rules based learning

Association rules based learning

Implementing Apriori and FP-growth

Summary

Clustering based learning

Clustering-based learning

Types of clustering

The k-means clustering algorithm

Implementing k-means clustering

Summary

Bayesian learning

Implementing Naïve Bayes algorithm

Summary

Regression based learning

Regression analysis

Regression methods

Implementing linear and logistic regression

Summary

Deep learning

Background

Deep learning taxonomy

Implementing ANNs and Deep learning methods

Summary

Reinforcement learning

Reinforcement Learning (RL)

Reinforcement learning solution methods

Summary

Ensemble learning

Ensemble learning methods

Implementing ensemble methods

Summary

New generation data architectures for Machine learning

Evolution of data architectures

Emerging perspectives & drivers for new age data architectures

Modern data architectures for Machine learning

Summary

Index

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Hadoop 2.x

Until Hadoop 2.x, all the distributions were focused on addressing the limitations in Hadoop 1.x but did not deviate from the core architecture. Hadoop 2.x really changed the underlying architecture assumptions and turned out to be a real breakthrough; most importantly, the introduction of YARN. YARN was a new framework for managing Hadoop cluster, which introduced the ability to handle real-time processing needs in addition to the batch. Some important issues that were addressed are listed as follows:

Single NameNode issues
Dramatic increase in the number of nodes in the cluster
Extension to the number of tasks that can be successfully addressed with Hadoop

The following figure depicts the difference between the Hadoop 1.x and 2.x architectures and how YARN wires MapReduce and HDFS:

Hadoop ecosystem components

Hadoop has spawned a bunch of auxiliary and supporting frameworks. The following figure depicts the gamut of supporting frameworks contributed by the open source developer groups...

Practical Machine Learning

By : Sunila Gollapudi

Practical Machine Learning

By: Sunila Gollapudi

Overview of this book

Related Content you might be interested in

Current Title:

Practical Machine Learning

Hadoop 2.x

Hadoop ecosystem components