Book Image

R Data Science Essentials

Book Image

R Data Science Essentials

Overview of this book

With organizations increasingly embedding data science across their enterprise and with management becoming more data-driven it is an urgent requirement for analysts and managers to understand the key concept of data science. The data science concepts discussed in this book will help you make key decisions and solve the complex problems you will inevitably face in this new world. R Data Science Essentials will introduce you to various important concepts in the field of data science using R. We start by reading data from multiple sources, then move on to processing the data, extracting hidden patterns, building predictive and forecasting models, building a recommendation engine, and communicating to the user through stunning visualizations and dashboards. By the end of this book, you will have an understanding of some very important techniques in data science, be able to implement them using R, understand and interpret the outcomes, and know how they helps businesses make a decision.
Table of Contents (15 chapters)
R Data Science Essentials
Credits
About the Authors
About the Reviewers
www.PacktPub.com
Preface
Index

Summary


In this chapter, we covered the prerequisites on the data format for the implementation of clustering algorithms and a few major clustering techniques, such as the centroid-based clustering algorithm, hierarchical clustering, and model-based and density-based clustering algorithms. We discussed about a few methodologies to evaluate the outcome of a clustering algorithm and various use cases across multiple fields that can be solved with the implementation of a clustering algorithm.

In the next chapter, we will demonstrate why regression models are used, the difference between a logistic regression and linear regression, and how to implement regression models using R. We will also explore the various methods used to check fit accuracy, the different methodologies that can be used to improve the accuracy of the model, and understand the output of regression models.