Book Image

Pentaho for Big Data Analytics

By : Manoj R Patil, Feris Thia
Book Image

Pentaho for Big Data Analytics

By: Manoj R Patil, Feris Thia

Overview of this book

<p>Pentaho accelerates the realization of value from big data with the most complete solution for big data analytics and data integration. The real power of big data analytics is the abstraction between data and analytics. Data can be distributed across the cluster in various formats, and the analytics platform should have the capability to talk to different heterogeneous data stores and fetch the filtered data to enrich its value.<br /><br />Pentaho Big Data Analytics is a practical, hands-on guide that provides you with clear, step-by-step exercises for using Pentaho to take advantage of big data systems, where data beats algorithm, and gives you a good grounding in using Pentaho Business Analytics’ capabilities.<br /><br />This book looks at the key ingredients of the Pentaho Business Analytics platform. We will see how to prepare the Pentaho BI environment, and get to grips with the big data ecosystem through. The book provides a clear guide to the essential tools of Pentaho Business Analytics, providing familiarity with both the various design tools for setting up reports, and the visualization tools necessary for complete data analysis.</p>
Table of Contents (14 chapters)
Pentaho for Big Data Analytics
Credits
About the Authors
About the Reviewers
www.PacktPub.com
Preface
Index

Edge over competitors


What makes Pentaho unique to other existing BI solutions is the vast data connectivity provided by the Pentaho abstraction layer. This makes it a very complete solution for data integration across many heterogonous entry systems and storages.

Pentaho's OLAP solution also provides flexibility on various relational database engines, regardless of whether it is a proprietary database or open source.

The big benefit of Pentaho is its clear vision in adapting Big Data sources and NoSQL solutions, which is more and more accepted in enterprises across the world.

Apache Hadoop has become increasingly popular, and with it, the growing features of Pentaho have proven themselves able to catch up with it. Once you have the Hadoop platform, you can use Pentaho to put or read data in HDFS (Hadoop Distribution File System) format and also orchestrate a map-reduced process in Hadoop clusters with an easy-to-use GUI designer.

Pentaho has also emphasized visualization, the key ingredient of any analytic platform. Their recent acquisition of the Portugal-based business analytic solution company, Webdetails, clearly shows this. Webdetails brought on board a fantastic set of UI-based community tools (known as CTools) such as Community Dashboard Framework (CDF), and Community Data Access (CDA).