Book Image

Java Data Analysis

By : John R. Hubbard
Book Image

Java Data Analysis

By: John R. Hubbard

Overview of this book

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the aim of discovering useful information. Java is one of the most popular languages to perform your data analysis tasks. This book will help you learn the tools and techniques in Java to conduct data analysis without any hassle. After getting a quick overview of what data science is and the steps involved in the process, you’ll learn the statistical data analysis techniques and implement them using the popular Java APIs and libraries. Through practical examples, you will also learn the machine learning concepts such as classification and regression. In the process, you’ll familiarize yourself with tools such as Rapidminer and WEKA and see how these Java-based tools can be used effectively for analysis. You will also learn how to analyze text and other types of multimedia. Learn to work with relational, NoSQL, and time-series data. This book will also show you how you can utilize different Java-based libraries to create insightful and easy to understand plots and graphs. By the end of this book, you will have a solid understanding of the various data analysis techniques, and how to implement them using Java.
Table of Contents (20 chapters)
Java Data Analysis
Credits
About the Author
About the Reviewers
www.PacktPub.com
Customer Feedback
Preface
Index

Java


Java has been, for at least a decade now, the most popular programming language. Nearly all the software described in this appendix is itself written in Java.

Java was developed in the early 1990s by a team of software researchers, led by James Gosling. It was released by Sun Microsystems in 1995 and was maintained by that company until it was acquired by Oracle in 2010.

Java comes pre-installed on the Mac. To update it, open System Preferences and click on the Java icon at the bottom of the panel. This brings up the Java Control Panel. The About… button under the General tab will display the current version of Java that is installed. As of August, 2017, that should be Version 8. The Update tab allows for updates.

Java can be run from the command line or from within an Integrated Development Environment (IDE). To do the former, you will need a text editor to write your code. On the Mac, you can use the TextEdit app. The four-line Hello World program is shown in a TextEdit window in Figure A-5.

Figure A-5. The Hello World program in Java

Note that, for this to compile, the quotation marks must be flat (like this " "), not smart (like this " "). To get that right, open the Preferences panel in TextEdit and de-select Smart Quotes. Also note that the TextEdit's default file types, RTF and TXT, must be overridden—Java source code must be saved in a file whose name is the same as the public class and whose type is .java.

We saved the source code in our hub/demos/ folder. The shell session in Figure A-6 appears as follows:

Figure A-6. Running the Hello World program

From this, we can learn:

  • How to display the source code with the cat command

  • How to compile the source code with the javac command

  • How to run the program with the java command

The javac command creates the class file, Hello.class, which we then execute with the java command, thus generating the expected output.

The Echo program in Figure A-7 illustrates the use of command line arguments.

Figure A-7. Reading command line arguments

The program prints each element of its args[] array. In the run shown here, there are four of them, entered on the command line immediately after the name of the program.