Figure 4 shows the structure of the source code and the data directories that are being used in this chapter. A description of each of them is not provided here as the reader should be familiar with them, and they have been covered in Chapter 6, Spark Stream Processing. There are external library file dependency requirements for running the programs using Kafka. For that, the instructions to download the JAR file are in the TODO.txt
file in the lib
folders. The submitPy.sh
and submit.sh
files use some of the Kafka
libraries in the Kafta installation as well. All these external JAR file dependencies have already been covered in Chapter 6, Spark Stream Processing.