Pentaho Data Integration (PDI) is a great tool to prepare data thanks to its rich data connectors. We will not discuss PDI further here as we already discussed it in the latter part of Chapter 3, Churning Big Data with Pentaho.
Before you proceed to the following examples, complete the steps listed in Appendix B, Hadoop Setup. Note that all the remaining examples work with the 192.168.1.122
IP address configuration at Hortonworks Sandbox VM.
The following steps will help you prepare BI Server to work with Hive:
Copy the
pentaho-hadoop-hive-jdbc-shim-1.3-SNAPSHOT.jar
andpentaho-hadoop-shims-api-1.3-SNAPSHOT.jar
files into the[BISERVER]/administration-console/jdbc
and[BISERVER]/biserver-ce/tomcat/lib
folders respectively. See Chapter 3, Churning Big Data with Pentaho, for information on how to obtain theJAR
files.Launch Pentaho User Console (PUC).
Copy the
Chapter 4
folder from the book's code bundle folder into[BISERVER]/pentaho-solutions...