Book Image

Practical Big Data Analytics

By : Nataraj Dasgupta
Book Image

Practical Big Data Analytics

By: Nataraj Dasgupta

Overview of this book

Big Data analytics relates to the strategies used by organizations to collect, organize, and analyze large amounts of data to uncover valuable business insights that cannot be analyzed through traditional systems. Crafting an enterprise-scale cost-efficient Big Data and machine learning solution to uncover insights and value from your organization’s data is a challenge. Today, with hundreds of new Big Data systems, machine learning packages, and BI tools, selecting the right combination of technologies is an even greater challenge. This book will help you do that. With the help of this guide, you will be able to bridge the gap between the theoretical world of technology and the practical reality of building corporate Big Data and data science platforms. You will get hands-on exposure to Hadoop and Spark, build machine learning dashboards using R and R Shiny, create web-based apps using NoSQL databases such as MongoDB, and even learn how to write R code for neural networks. By the end of the book, you will have a very clear and concrete understanding of what Big Data analytics means, how it drives revenues for organizations, and how you can develop your own Big Data analytics solution using the different tools and methods articulated in this book.
Table of Contents (16 chapters)
Title Page
Packt Upsell
Contributors
Preface

Machine learning and deep learning links


Scikit-Learn: The most comprehensive Machine Learning package in Python:http://scikit-learn.org/stable/

Tensorflow: A well-known solution for Deep Learning from Google:https://www.tensorflow.org

MLPACK: Machine Learning using C++ and Unix Command Line:http://www.mlpack.org

Word2Vec: One of the well-known packages for Natural Language Processing:https://deeplearning4j.org/word2vec

Vowpal Wabbit: Excellent Machine Learning software used in many Kaggle competitions:https://github.com/JohnLangford/vowpal_wabbit/wiki/Tutorial

LIBSVM & LIBLINEAR: Highly regarded command line machine learning tools:https://www.csie.ntu.edu.tw/~cjlin/libsvm/https://www.csie.ntu.edu.tw/~cjlin/liblinear/

LIBFM: Matrix Factorization:http://www.libfm.org

PaddlePaddle: Deep Learning from Baidu:https://github.com/PaddlePaddle/Paddle

CuDNN: Deep Learning/Neural Network solution from NVIDIA:https://developer.nvidia.com/cudnn

Caffe: Deep Learning framework from Berkeley:http://caffe.berkeleyvision...