We created a solution to a very simple counting problem and we aptly called it Hello World in Hadoop
. We went through all the components you would have in a Hadoop MapReduce implementation in Java.
Now, we are all set to launch a cluster on EMR and test this simple solution that we created. In our next chapter, we will start with the creation of a S3 bucket and uploading the solution .jar
file as well as the sample input file, and then follow it by launching an EMR cluster, which would execute our solution. On its completion, we will download the output and check it out. We will also learn the various Hadoop job models available on EMR.