Introducing machine learning applications
Machine learning, predictive analytics, and related science topics are becoming increasingly popular for solving real-world problems across varied business domains.
Today, machine learning applications are driving mission-critical business decision-making in many organizations. These applications include recommendation engines, targeted advertising, speech recognition, fraud detection, image recognition and categorization, and so on.
In the next section, we will introduce the key components of the Spark ML pipeline API.
Understanding Spark ML pipelines and their components
The machine learning pipeline API was introduced in Apache Spark 1.2. Spark MLlib provides an API for developers to create and execute complex ML workflows. The Pipeline API lets developers quickly assemble distributed machine learning pipelines as the API been standardized applying different learning algorithms. Additionally, we can also combine multiple machine learning algorithms...