40 Algorithms Every Programmer Should Know

By : Imran Ahmad

5 (2)

Buy this Book

40 Algorithms Every Programmer Should Know

5 (2)

By: Imran Ahmad

Buy this Book

Overview of this book

Algorithms have always played an important role in both the science and practice of computing. Beyond traditional computing, the ability to use algorithms to solve real-world problems is an important skill that any developer or programmer must have. This book will help you not only to develop the skills to select and use an algorithm to solve real-world problems but also to understand how it works. You’ll start with an introduction to algorithms and discover various algorithm design techniques, before exploring how to implement different types of algorithms, such as searching and sorting, with the help of practical examples. As you advance to a more complex set of algorithms, you'll learn about linear programming, page ranking, and graphs, and even work with machine learning algorithms, understanding the math and logic behind them. Further on, case studies such as weather prediction, tweet clustering, and movie recommendation engines will show you how to apply these algorithms optimally. Finally, you’ll become well versed in techniques that enable parallel processing, giving you the ability to use these algorithms for compute-intensive tasks. By the end of this book, you'll have become adept at solving real-world computational problems by using a wide range of algorithms.

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Section 1: Fundamentals and Core Algorithms

Free Chapter

Overview of Algorithms

What is an algorithm?

Specifying the logic of an algorithm

Introducing Python packages

Algorithm design techniques

Performance analysis

Validating an algorithm

Summary

Data Structures Used in Algorithms

Exploring data structures in Python

Exploring abstract data types

Summary

Sorting and Searching Algorithms

Introducing Sorting Algorithms

Introduction to Searching Algorithms

Practical Applications

Summary

Designing Algorithms

Introducing the basic concepts of designing an algorithm

Understanding algorithmic strategies

Practical application – solving the TSP

Presenting the PageRank algorithm

Understanding linear programming

Practical application – capacity planning with linear programming

Summary

Graph Algorithms

Representations of graphs

Introducing network analysis theory

Understanding graph traversals

Case study – fraud analytics

Summary

Section 2: Machine Learning Algorithms

Unsupervised Machine Learning Algorithms

Introducing unsupervised learning

Understanding clustering algorithms

Dimensionality reduction

Association rules mining

Practical application– clustering similar tweets together

Anomaly-detection algorithms

Summary

Traditional Supervised Learning Algorithms

Understanding supervised machine learning

Understanding classification algorithms

Understanding regression algorithms

Practical example – how to predict the weather

Summary

Neural Network Algorithms

Understanding ANNs

The Evolution of ANNs

Training a Neural Network

Tools and Frameworks

Transfer Learning

Case study – using deep learning for fraud detection

Summary

Algorithms for Natural Language Processing

Introducing NLP

BoW-based NLP

Introduction to word embedding

Using RNNs for NLP

Using NLP for sentiment analysis

Case study: movie review sentiment analysis

Summary

Recommendation Engines

Introducing recommendation systems

Types of recommendation engines

Understanding the limitations of recommender systems

Areas of practical applications

Practical example – creating a recommendation engine

Summary

Section 3: Advanced Topics

Data Algorithms

Introduction to data algorithms

Presenting data storage algorithms

Presenting streaming data algorithms

Presenting data compression algorithms

A practical example – Twitter real-time sentiment analysis

Summary

Cryptography

Introduction to Cryptography

Understanding the Types of Cryptographic Techniques

Example – Security Concerns When Deploying a Machine Learning Model

Summary

Large-Scale Algorithms

Introduction to large-scale algorithms

The design of parallel algorithms

Strategizing multi-resource processing

Summary

Practical Considerations

Introducing practical considerations

The explainability of an algorithm

Understanding ethics and algorithms

Reducing bias in models

Tackling NP-hard problems

When to use algorithms

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 (2)

5 star

100%

4 star

3 star

2 star

1 star

Specifying the logic of an algorithm

When designing an algorithm, it is important to find different ways to specify its details. The ability to capture both its logic and architecture is required. Generally, just like building a home, it is important to specify the structure of an algorithm before actually implementing it. For more complex distributed algorithms, pre-planning the way their logic will be distributed across the cluster at running time is important for the iterative efficient design process. Through pseudocode and execution plans, both these needs are fulfilled and are discussed in the next section.

Understanding pseudocode

The simplest way to specify the logic for an algorithm is to write the higher-level description of an algorithm in a semi-structured way, called pseudocode. Before writing the logic in pseudocode, it is helpful to first describe its main flow by writing the main steps in plain English. Then, this English description is converted into pseudocode, which is a structured way of writing this English description that closely represents the logic and flow for the algorithm. Well-written algorithm pseudocode should describe the high-level steps of the algorithm in reasonable detail, even if the detailed code is not relevant to the main flow and structure of the algorithm. The following figure shows the flow of steps:

Note that once the pseudocode is written (as we will see in the next section), we are ready to code the algorithm using the programming language of our choice.

A practical example of pseudocode

Figure 1.3 shows the pseudocode of a resource allocation algorithm called SRPMP.In cluster computing, there are many situations where there are parallel tasks that need to be run on a set of available resources, collectively called a resource pool. This algorithm assigns tasks to a resource and creates a mapping set, called Ω. Note that the presented pseudocode captures the logic and flow of the algorithm, which is further explained in the following section:

1: BEGIN Mapping_Phase
2: Ω = { }
3: k = 1
4: FOREACH T_i∈T
5:     ω_i = RA(Δ_k,T_i)
6:     add {ω_i,T_i} to Ω
7:     state_change_Ti [STATE 0: Idle/Unmapped] → [STATE 1: Idle/Mapped]
8:     k=k+1
9:     IF (k>q)
10:       k=1
11:    ENDIF
12: END FOREACH
13: END Mapping_Phase

Let's parse this algorithm line by line:

We start the mapping by executing the algorithm. The Ω mapping set is empty.
The first partition is selected as the resource pool for the T₁ task (see line 3 of the preceding code). Television Rating Point (TRPS) iteratively calls the Rheumatoid Arthritis (RA) algorithm for each T_i task with one of the partitions chosen as the resource pool.
The RA algorithm returns the set of resources chosen for the T_i task, represented by ω_i (see line 5 of the preceding code).
T_i and ω_i are added to the mapping set (see line 6 of the preceding code).
The state of T_i is changed from STATE 0:Idle/Mapping to STATE 1:Idle/Mapped (see line 7 of the preceding code).
Note that for the first iteration, k=1 and the first partition is selected. For each subsequent iteration, the value of k is increased until k>q.
If k becomes greater than q, it is reset to 1 again (see lines 9 and 10 of the preceding code).
This process is repeated until a mapping between all tasks and the set of resources they will use is determined and stored in a mapping set called Ω.
Once each of the tasks is mapped to a set of the resources in the mapping phase, it is executed.

Using snippets

With the popularity of simple but powerful coding language such as Python, an alternative approach is becoming popular, which is to represent the logic of the algorithm directly in the programming language in a somewhat simplified version. Like pseudocode, this selected code captures the important logic and structure of the proposed algorithm, avoiding detailed code. This selected code is sometimes called a snippet. In this book, snippets are used instead of pseudocode wherever possible as they save one additional step. For example, let's look at a simple snippet that is about a Python function that can be used to swap two variables:

define swap(x, y)
    buffer = x
    x = y
    y = buffer

Note that snippets cannot always replace pseudocode. In pseudocode, sometimes we abstract many lines of code as one line of pseudocode, expressing the logic of the algorithm without becoming distracted by unnecessary coding details.

Creating an execution plan

Pseudocode and snippets are not always enough to specify all the logic related to more complex distributed algorithms. For example, distributed algorithms usually need to be divided into different coding phases at runtime that have a precedence order. The right strategy to divide the larger problem into an optimal number of phases with the right precedence constraints is crucial for the efficient execution of an algorithm.

We need to find a way to represent this strategy as well to completely represent the logic and structure of an algorithm. An execution plan is one of the ways of detailing how the algorithm will be subdivided into a bunch of tasks. A task can be mappers or reducers that can be grouped together in blocks called stages. The following diagram shows an execution plan that is generated by an Apache Spark runtime before executing an algorithm. It details the runtime tasks that the job created for executing our algorithm will be divided into:

Note that the preceding diagram has five tasks that have been divided into two different stages: Stage 11 and Stage 12.

40 Algorithms Every Programmer Should Know

By : Imran Ahmad

40 Algorithms Every Programmer Should Know

By: Imran Ahmad

Overview of this book

Related Content you might be interested in

Current Title:

40 Algorithms Every Programmer Should Know