Machine Learning in Java - Second Edition

By : AshishSingh Bhatia, Bostjan Kaluza

Machine Learning in Java - Second Edition

By: AshishSingh Bhatia, Bostjan Kaluza

Overview of this book

As the amount of data in the world continues to grow at an almost incomprehensible rate, being able to understand and process data is becoming a key differentiator for competitive organizations. Machine learning applications are everywhere, from self-driving cars, spam detection, document search, and trading strategies, to speech recognition. This makes machine learning well-suited to the present-day era of big data and Data Science. The main challenge is how to transform data into actionable knowledge. Machine Learning in Java will provide you with the techniques and tools you need. You will start by learning how to apply machine learning methods to a variety of common tasks including classification, prediction, forecasting, market basket analysis, and clustering. The code in this book works for JDK 8 and above, the code is tested on JDK 11. Moving on, you will discover how to detect anomalies and fraud, and ways to perform activity recognition, image recognition, and text analysis. By the end of the book, you will have explored related web resources and technologies that will help you take your learning to the next level. By applying the most effective machine learning methods to real-world problems, you will gain hands-on experience that will transform the way you think about data.

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Free Chapter

Applied Machine Learning Quick Start

Machine learning and data science

Data and problem definition

Data collection

Data preprocessing

Unsupervised learning

Supervised learning

Generalization and evaluation

Summary

Java Libraries and Platforms for Machine Learning

The need for Java

Machine learning libraries

Building a machine learning application

Summary

Basic Algorithms - Classification, Regression, and Clustering

Summary

Customer Relationship Prediction with Ensembles

The customer relationship database

Basic Naive Bayes classifier baseline

Basic modeling

Advanced modeling with ensembles

Summary

Affinity Analysis

Market basket analysis

Association rule learning

The supermarket dataset

Discover patterns

Other applications in various areas

Summary

Recommendation Engines with Apache Mahout

Basic concepts

Getting Apache Mahout

Building a recommendation engine

Content-based filtering

Summary

Fraud and Anomaly Detection

Suspicious and anomalous behavior detection

Suspicious pattern detection

Anomalous pattern detection

Outlier detection using ELKI

Fraud detection in insurance claims

Anomaly detection in website traffic

Summary

Image Recognition with Deeplearning4j

Introducing image recognition

Image classification

Summary

Activity Recognition with Mobile Phone Sensors

Introducing activity recognition

Collecting data from a mobile phone

Building a classifier

Summary

Text Mining with Mallet - Topic Modeling and Spam Detection

Introducing text mining

Installing Mallet

Working with text data

Topic modeling for BBC News

Detecting email spam

Summary

What Is Next?

Machine learning in real life

Standards and markup languages

Machine learning in the cloud

Web resources and competitions

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Topic modeling for BBC News

As discussed earlier, the goal of topic modeling is to identify patterns in a text corpus that correspond to document topics. In this example, we will use a dataset originating from BBC News. This dataset is one of the standard benchmarks in machine-learning research, and is available for non-commercial and research purposes.

The goal is to build a classifier that is able to assign a topic to an uncategorized document.

BBC dataset

In 2006, Greene and Cunningham collected the BBC dataset to study a particular document—Clustering challenge using support vector machines. The dataset consists of 2,225 documents from the BBC News website from 2004 to 2005, corresponding to the stories collected...

Machine Learning in Java - Second Edition

By : AshishSingh Bhatia, Bostjan Kaluza

Machine Learning in Java - Second Edition

By: AshishSingh Bhatia, Bostjan Kaluza

Overview of this book

Related Content you might be interested in

Current Title:

Machine Learning in Java - Second Edition

Java Data Science Cookbook

Java for Data Science

Mastering Java Machine Learning