SAP HANA Cookbook

SAP HANA Cookbook

Overview of this book

SAP HANA is a real-time applications platform that provides a multi-purpose, in-memory appliance. Decision makers in the organization can gain instant insight into business operations. Thus all the data available can be analysed and you can react to the changing business conditions rapidly to make decisions. The real-time platform not only empowers business users and top management to make decisions but also provides the capability to make decisions in real-time.A practical and comprehensive guide that helps you understand the power of SAP HANA’s real-time and in-memory capabilities. It also provides step-by-step instructions to exploit all the possible features of the SAP HANA database, enabling users to harness the full potential of this technology and its features.You will gain an understanding of real-time replications, effective data loading from various sources, how to load data, and how to create re-usable objects such as models and reports.Use this practical guide to enable or transform your business landscape by implementing SAP HANA to meet your business requirements. The book shows you how to load data from different types of systems, create models in SAP HANA, and consume data for decision-making. The book covers various tools at different stages creating models using SAP HANA Studio, and consuming data using reporting tools such as SAP BusinessObjects, SAP Lumira, and so on . This book also explains the in-depth architecture of SAP HANA to help you understand SAP HANA as an appliance, that is, a combination of hardware and software.The book covers the best practices to leverage SAP HANA’s in-memory technology to transform data into insightful information. It also covers technology landscaping, solution architecture, connectivity, data loading, and setting up the environment for modeling purpose (including setup of SAP HANA Studio).If you have an intention to start your career as SAP HANA Modeler, this book is the perfect start.

SAP HANA Cookbook

Credits

About the Authors

About the Reviewers

www.PacktPub.com

Preface

Free Chapter

SAP HANA Studio – Look and Feel

Introduction

Understanding SAP HANA Studio

Switching between different views – perspectives

Navigating SAP HANA Studio – the Navigator Pane

Administering SAP HANA – the Administration Console perspective

Modeling SAP HANA Studio – the Modeler perspective

Data Provisioning

Introduction

Loading data into SAP HANA – data provisioning methods

Uploading data from flat files

Using SLT to load data into SAP HANA

Using SAP Data Services as an ETL tool to load data into SAP HANA

Loading data into SAP HANA using DXC

Loading data using SAP Sybase Replication Server

Modeling

Introduction

Approaching SAP HANA modeling

Creating attribute views

Creating analytic views

Creating calculation views

Preparing documents – Auto Documentation

Modeling with Information Composer

Reporting

Introduction

The reporting layer on top of SAP HANA

Connecting reporting tools to SAP HANA

Creating reports using SAP BusinessObjects Web Intelligence

Creating reports using SAP BusinessObjects Explorer

Creating reports using SAP BusinessObjects Dashboards/Xcelsius

Creating reports using SAP BusinessObjects Analysis for OLAP

Creating reports using Microsoft Excel

Creating reports in SAP Lumira

Advanced Features in SAP HANA

Introduction

Converting different currencies

Creating hierarchies

Creating variables

Creating input parameters

Creating filters

Creating procedures using SQLScript

Creating decision tables

User Management

Introduction

Creating users

Creating roles

Assigning roles to users

Restricting access to data – creating analytic privileges

Securing logging in to SAP HANA – authentication methods

Securing logging in to SAP HANA – privileges

Introduction to SAP HANA

Introduction

Explaining traditional databases and bottlenecks

Introducing technology and hardware innovations

Looking into versions and technical requirements

Describing why you should use SAP HANA

Looking into SAP HANA features

Comparing BWA and SAP HANA

Architecture

Understanding the SAP HANA architecture

Explaining IMCE and its components

Storing data – row storage

Storing data – column storage

Understanding the persistence layer

Understanding backup and recovery

Applications Powered by SAP HANA

Introduction

Introducing flavors on top of SAP HANA

Introducing SAP NetWeaver BW powered by SAP HANA

Introducing SAP Business Suite on SAP HANA

Index

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Storing data – column storage

Having learned about the row store engine of SAP HANA, now let us learn about the column store engine. Data will be stored in RAM, similar to the row store engine. The concept of column storage has emerged from Text Retrieval and Extraction (TREX). This technology was further developed into a full relational column-based datastore. Compression works well with columns and can speed up operations on columns up to a factor of 10. Column storage is optimized for high performance of a read operation. There are two types of indices for column store table for each column: a main storage and a delta storage. For write operations, the delta storage is optimized. The main storage is optimized in terms of the read performance and memory consumption. Performance issues when loading directly to compressed columns can be addressed by the delta tables.

The architecture of a column store is shown in the following diagram:

The components of the column engine are explained as follows:

Optimizer and Executor: Optimizer gets the logical execution plan from SQL Parser or Calc engine as input, and generates the optimized physical execution plan based on the database statistics. The best plan for accessing row or column stores will be determined by the database optimizer. Executor basically executes the physical execution plan to access the row and column stores, and also processes all the intermediate results.
Main Storage: Data is highly compressed and stored in the main storage. Being compressed and stored in column storage, data is read very fast.
Delta Storage: Delta storage is designed for fast writing operation. When there is an update operation to be performed, a new entry is added into the delta storage.
Delta Merge: Write operations are only performed on the delta storage. The database is transferred to the main storage in order to transform the data into a format that is optimized in terms of memory consumption and read performance. This is accomplished by a process called delta merge. The following section is intended to give a better understanding of how this happens and when.

The delta merge process

The following diagram describes the different states of a merge process, which objects are involved, and how they are accessed.

The following operations are performed for the merge process:

Before the merge operation: All the write operations go to the storage Delta1, and the read operations read from the storages Main1 and Delta1.
During the merge operation: When the merge operation is in progress, all the changes go into the second delta storage Delta2. The read operations continue from the original main storage (Main1) and from both the delta storage (Delta1 and Delta2). The uncommitted changes from Delta1 are copied to Delta2. The committed entries in Delta1 and content of Main1 are merged into the new main storage, that is, Main2.
After the merge operation: Main1 and Delta1 storages are deleted after the merge operation is complete.

Consistent view manager and transaction manager

The consistent view manager creates a consistent view throughout data for the moment in time when the query hits the system. Isolation of concurrent transactions is enforced by a central transaction manager, maintaining information about all write transactions and the consistent view manager deciding on visibility of records per table. A so-called transaction token is generated by the transaction manager for each transaction, encoding which transactions are open, and is committed at the point in time when the transaction has started. The transaction token holds all the information needed to construct the consistent view for a transaction or a statement. It is passed as additional context information to all the operations and engines that are involved in the execution of a statement.

It is better to go with column storage under the following situations:

Recommended when the tables contain huge volumes of data
Used when lot of aggregations need to be done on the tables
Used when the tables have huge number of columns
Used when the table has to be searched based on the values of few columns

The main advantages with column storage are

Number of cache cycles will be reduced and this will help to retrieve the data at a faster rate
Supports parallel processing

For more information, refer the following links:

SAP HANA Cookbook

SAP HANA Cookbook

Overview of this book

Related Content you might be interested in

Current Title:

SAP HANA Cookbook

Storing data – column storage

The delta merge process

Consistent view manager and transaction manager