As a top-level abstraction language, Hive provides a handy tool for manipulating data storage on HDFS with SQL-like language. In this section, we will talk about installing Hive on our Hadoop cluster.
Before we install Hive, we need to make sure Hadoop has been properly installed. Please refer to the previous sections about the configuration of a Hadoop cluster.
Download Hive from a mirror site with a command similar to the following on the administrator machine:
wget http://apache.osuosl.org/hive/stable/hive-0.9.0.tar.gz -P ~/repo
Use the following steps to install Hive:
Log in to the master node from the Hadoop administrator machine as
hduser
with the following command:ssh hduser@master
Copy the archive to
/usr/local
with the following command:sudo wget ftp://hadoop.admin/repo/hive-0.9.0.tar.gz /usr/local
Decompress the Hive archive with the following command:
cd /usr/local tar xvf hive-0.9.0.tar.gz
Create a symbolic link with the following command...