Book Image

Mastering Hadoop

By : Sandeep Karanth
Book Image

Mastering Hadoop

By: Sandeep Karanth

Overview of this book

Table of Contents (21 chapters)
Mastering Hadoop
Credits
About the Author
Acknowledgments
About the Reviewers
www.PacktPub.com
Preface
Index

Summary


With cloud computing becoming a focus for its elasticity and cost-effectiveness, Microsoft has moved into the arena to compete with existing players. To maintain parity with competition, Microsoft Azure not only offers Linux-based Virtual Machines, but has also embraced open source big data systems such as Hadoop. HDInsight offers HaaS on Microsoft Azure.

The key takeaways from this chapter are as follows:

  • Hadoop is now natively available on Windows. Installing Unix emulators or Linux VMs on Windows OS is no longer necessary.

  • Hadoop support on Windows natively has two missing features: Security features and short-circuit HDFS reads are not yet integrated with this system.

  • Hadoop on Windows requires building the Hadoop distribution from scratch. Direct download of Hadoop binaries for Windows is not yet available.

  • HDInsight, Hadoop as a service offering on Microsoft Azure, provides seamless Excel integration and integration with platforms such as the Hortonworks Data Platform.