IBM Cloud Pak for Data

By : Hemanth Manda, Sriram Srinivasan, Deepak Rangarao

3 (1)

Buy this Book

IBM Cloud Pak for Data

3 (1)

By: Hemanth Manda, Sriram Srinivasan, Deepak Rangarao

Buy this Book

Overview of this book

Cloud Pak for Data is IBM's modern data and AI platform that includes strategic offerings from its data and AI portfolio delivered in a cloud-native fashion with the flexibility of deployment on any cloud. The platform offers a unique approach to addressing modern challenges with an integrated mix of proprietary, open-source, and third-party services. You'll begin by getting to grips with key concepts in modern data management and artificial intelligence (AI), reviewing real-life use cases, and developing an appreciation of the AI Ladder principle. Once you've gotten to grips with the basics, you will explore how Cloud Pak for Data helps in the elegant implementation of the AI Ladder practice to collect, organize, analyze, and infuse data and trustworthy AI across your business. As you advance, you'll discover the capabilities of the platform and extension services, including how they are packaged and priced. With the help of examples present throughout the book, you will gain a deep understanding of the platform, from its rich capabilities and technical architecture to its ecosystem and key go-to-market aspects. By the end of this IBM book, you'll be able to apply IBM Cloud Pak for Data's prescriptive practices and leverage its capabilities to build a trusted data foundation and accelerate AI adoption in your enterprise.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the color images

Conventions used

Get in touch

Share Your Thoughts

Section 1: The Basics

Free Chapter

Chapter 1: The AI Ladder – IBM's Prescriptive Approach

Market dynamics and IBM's Data and AI portfolio

Introduction to the AI ladder

Collect – making data simple and accessible

Organize – creating a trusted analytics foundation

Analyze – building and scaling models with trust and transparency

Infuse – operationalizing AI throughout the business

Summary

Chapter 2: Cloud Pak for Data: A Brief Introduction

The case of a data and AI platform – recap

Overview of Cloud Pak for Data

Exploring unique differentiators, key use cases, and customer adoption

Cloud Pak for Data: additional details

Red Hat OpenShift

Summary

Section 2: Product Capabilities

Chapter 3: Collect – Making Data Simple and Accessible

Data – the world's most valuable asset

Challenges with data-centric delivery

Enterprise data architecture

Data virtualization – accessing data anywhere

Data virtualization versus ETL – when to use what?

Platform connections – streamlining data connectivity

Data estate modernization using Cloud Pak for Data

Summary

Chapter 4: Organize – Creating a Trusted Analytics Foundation

Introducing Data Operations (DataOps)

Organizing enterprise information assets

Establishing metadata and stewardship

Profiling to get a better understanding of your data

Classifying data for completeness

Enabling trust with data quality

Data privacy and activity monitoring

Data integration at scale

Master data management

Extending MDM toward a Digital Twin

Summary

Chapter 5: Analyzing: Building, Deploying, and Scaling Models with Trust and Transparency

Self-service analytics of governed data

BI and reporting

Predictive versus prescriptive analytics

Understanding AI

AI life cycle – Transforming insights into action

AI governance: Trust and transparency

Automating the AI life cycle using Cloud Pak for Data

Summary

Chapter 6: Multi-Cloud Strategy and Cloud Satellite

IBM's multi-cloud strategy

Supported deployment options

Cloud Pak for Data as a Service

IBM Cloud Satellite

A data fabric for a multi-cloud future

Summary

Chapter 7: IBM and Partner Extension Services

IBM and third-party extension services

Collect extension services

Organize extension services

Infuse cartridges

Modernization upgrades to Cloud Pak for Data cartridges

Summary

Chapter 8: Customer Use Cases

Improving health advocacy program efficiency

Voice-enabled chatbots

Risk and control automation

Enhanced border security

Unified Data Fabric

Financial planning and analytics

Summary

Section 3: Technical Details

Chapter 9: Technical Overview, Management, and Administration

Technical requirements

Architecture overview

Infrastructure requirements, storage, and networking

Foundational services and the control plane

Multi-tenancy, resource management, and security

Day 2 operations

Summary

References

Chapter 10: Security and Compliance

Technical requirements

Security and Privacy by Design

Secure operations in a shared environment

User access and authorizations

Meeting compliance requirements

Summary

References

Chapter 11: Storage

Understanding the concept of persistent volumes

Off-cluster storage

Operational considerations

Summary

Analyze – building and scaling models with trust and transparency

Enterprises are either building AI or buying AI solutions to address specific requirements. In the case of a build scenario, companies would benefit significantly from commercially available data science tools such as Watson Studio. IBM's Watson Studio not only allows you to make significant productivity gains but also ensures collaboration among the different data scientists and user personas.

Investing in building AI and retraining employees can have a significant payoff. Pioneers across multiple industries are building AI and separating themselves from laggards:

In construction, they're using AI to optimize infrastructure design and customization.
In healthcare, companies are using AI to predict health problems and disease symptoms.
In life science, organizations are advancing image analysis to research drug effects.
In financial services, companies are using AI to assist in fraud analysis and investigation.
Finally, autonomous vehicles are using AI to adapt to changing conditions in vehicles, while call centers are using AI for automating customer service.

However, several hurdles remain, and enterprises face significant challenges in operationalizing AI value.

There are three areas that we need to tackle:

Data: 80% of time is spent preparing data versus building AI models.
Talent: 65% find it difficult to fund or acquire AI skills.
Trust: 44% say it's very challenging to build trust in AI outcomes.

Source: 2019 Forrester, Challenges That Hold Firms Back From Achieving AI Aspirations.

Also, it's worth pointing out that building AI models is the easy part. The real challenge lies in deploying those AI models into production, monitoring them for accuracy and drift detection, and ensuring that this becomes the norm.

IBM's AI tools and runtimes on Cloud Pak for Data present a differentiated and extremely strong set of capabilities. Supported by the Red Hat OpenShift and Cloud Pak for Data strategy, IBM is in a position to set and lead the market for AI tools. There are plenty of point AI solutions from niche vendors in the market, as evidenced from the numerous analyst reports; however, none of them are solving the problem of putting AI into production in a satisfactory manner. The differentiation that IBM brings to the market is the full end-to-end AI life cycle:

Figure 1.5 – AI life cycle

Customers are looking for an integrated platform for a few reasons. Before we get to these reasons, the following teams care about the integrated platform:

Data science teams are looking for integrated systems to manage assets across the AI life cycle and across project team members.
Chief Data Officer are looking to govern AI models and the data associated with them. Chief Risk Officer (CRO) are looking to control the risks that these models expose by being integrated with business processes.
Extended AI application teams need integration so that they can build, deploy, and run seamlessly. In some situations, Chief information officer (CIOs)/business technology teams who want to de-risk and reduce the costs of taking an AI application to production are responsible for delivering a platform.
Customer Use Case
A Fortune 500 US bank is looking for a solution in order to rapidly deploy machine learning projects to production. The first step in this effort is to put in place a mechanism that allows project teams to deliver pilots without having to go through full risk management processes (from corporate risk/MRM teams). They call this a soft launch, which will work with some production data. The timeline to roll out projects is 6-9 months from conceptualization to pilot completion. This requirement is being championed (and will need to be delivered across the bank) by the business technology team (who are responsible for the AI operations portal). The idea is that this will take the load away from MRM folks who have too much on their plate but still have a clear view of how and what risk was evaluated. LOB will be using the solution every week to retrain models. However, before that, they will upload a CSV file, check any real-time responses, and pump data to verify that the model is meeting strategy goals. All this must be auto-documented.

One of the key differentiators for IBM's AI life cycle is AutoAI, which allows data scientists to create multiple AI models and score them for accuracy. Some of these tests are not supposed to be black and white.

Several customers are beginning to automate AI development. Due to this, the following question arises: why automate model development? Because if you can automate the AI life cycle, you can enhance your success rate.

An automated AI life cycle allows you to do the following:

Expand your talent pool: This lowers the skills required to build and operationalize AI models
Speed up time to delivery: This is done by minimizing mundane tasks.
Increase the readiness of AI-powered apps: This is done by optimizing model accuracy and KPIs.
Deliver real-time governance: This improves trust and transparency by ensuring model management, governance, explainability, and versioning.

Next, we will explore how AI is operationalized in enterprises to address specific use cases and drive business value.

IBM Cloud Pak for Data

By : Hemanth Manda, Sriram Srinivasan, Deepak Rangarao

IBM Cloud Pak for Data

By: Hemanth Manda, Sriram Srinivasan, Deepak Rangarao

Overview of this book

Related Content you might be interested in

Current Title:

IBM Cloud Pak for Data

Hybrid Cloud Infrastructure and Operations Explained

Principles of Data Fabric

Hybrid Cloud Security Patterns

Analyze – building and scaling models with trust and transparency