Before we begin the walkthrough, see Appendix A, Big Data Sets, to complete the Hive nyse_stocks
data preparation and follow these steps:
Launch Spoon if you have closed it.
On the File menu, click on New and select Transformation.
On the left-hand side panel, click on the View tab.
Right-click on the Database connections node to show up a contextual menu and choose New.
The following screenshot shows you how to create a new database connection:
When the Database Connection dialog appears, fill in the following configuration:
Connection Name:
HIVE2
Connection Type:
Hadoop Hive 2
Host Name: [your working IP address]
Database Name:
default
Now follow these steps:
Click on the Test button to verify the connection. If successful, click on the OK button to close it. The display window will look like the following screenshot:
On the left-hand side panel, click on the Design tab.
In the Input group, click on the Table input step and drag it into the working space. The following screenshot...