By : Raghav Bali
By: Raghav Bali

Overview of this book

Data science and machine learning are some of the top buzzwords in the technical world today. From retail stores to Fortune 500 companies, everyone is working hard to making machine learning give them data-driven insights to grow their business. With powerful data manipulation features, machine learning packages, and an active developer community, R empowers users to build sophisticated machine learning systems to solve real-world data problems. This book takes you on a data-driven journey that starts with the very basics of R and machine learning and gradually builds upon the concepts to work on projects that tackle real-world problems. You’ll begin by getting an understanding of the core concepts and definitions required to appreciate machine learning algorithms and concepts. Building upon the basics, you will then work on three different projects to apply the concepts of machine learning, following current trends and cover major algorithms as well as popular R packages in detail. These projects have been neatly divided into six different chapters covering the worlds of e-commerce, finance, and social-media, which are at the very core of this data-driven revolution. Each of the projects will help you to understand, explore, visualize, and derive insights depending upon the domain and algorithms. Through this book, you will learn to apply the concepts of machine learning to deal with data-related problems and solve them using the powerful yet simple language, R.
Table of Contents (15 chapters)
About the Authors
About the Reviewer

Chapter 2. Let's Help Machines Learn

Machine learning, when you first hear it, sounds more like a fancy word from a sci-fi movie than the latest trend in the tech industry. Talk about it to people in general and their responses are either related to being generally curious about the concept or being cautious and fearful about intelligent machines taking over our world in some sort of Terminator-Skynet way.

We live in a digital age and are constantly presented with all sorts of information all the time. As we will see in this and the coming chapters, machine learning is something that loves data. In fact, the recent hype and interest in this field has been fueled by not just the improvements in computing technology but also due to exponential growth in the amount of data being generated every second. The latest numbers stand at around 2.5 quintillion bytes of data every day (that's 2.5 followed by 18 zeroes)!


Fun Fact: More than 300 hours of video data is uploaded to YouTube every minute...