Book Image

Building Data-Driven Applications with Danfo.js

By : Rising Odegua, Stephen Oni
Book Image

Building Data-Driven Applications with Danfo.js

By: Rising Odegua, Stephen Oni

Overview of this book

Most data analysts use Python and pandas for data processing for the convenience and performance these libraries provide. However, JavaScript developers have always wanted to use machine learning in the browser as well. This book focuses on how Danfo.js brings data processing, analysis, and ML tools to JavaScript developers and how to make the most of this library to build data-driven applications. Starting with an overview of modern JavaScript, you’ll cover data analysis and transformation with Danfo.js and Dnotebook. The book then shows you how to load different datasets, combine and analyze them by performing operations such as handling missing values and string manipulations. You’ll also get to grips with data plotting, visualization, aggregation, and group operations by combining Danfo.js with Plotly. As you advance, you’ll create a no-code data analysis and handling system and create-react-app, react-table, react-chart, Draggable.js, and tailwindcss, and understand how to use TensorFlow.js and Danfo.js to build a recommendation system. Finally, you’ll build a Twitter analytics dashboard powered by Danfo.js, Next.js, node-nlp, and Twit.js. By the end of this app development book, you’ll be able to build and embed data analytics, visualization, and ML capabilities into any JavaScript app in server-side Node.js or the browser.
Table of Contents (18 chapters)
1
Section 1: The Basics
3
Section 2: Data Analysis and Manipulation with Danfo.js and Dnotebook
10
Section 3: Building Data-Driven Applications

Chapter 7: Data Aggregation and Group Operations

Data aggregation and group operations are very important methods in data analysis. These methods provide the ability to split data into a set of groups based on the specified key, and then apply some set of groupby operations (aggregations or transformations) to the grouped data to produce a new set of values. The resulting values are then combined into a single data group.

This approach is popularly known as split-apply-combine. The term was actually coined by Hadley Wickham, the author of many popular R packages, to describe group operations. Figure 7.1 describes the idea of split-apply-combine graphically:

Figure 7.1 – groupby illustration

In this chapter, we look into ways of performing group operations: how to group data by column keys and perform data aggregation on grouped data jointly or independently.

This chapter will also show how to access grouped data by keys. It also gives insight into...