.NET 4.0 Generics Beginner's Guide

.NET 4.0 Generics Beginner's Guide

By : Sudipta Mukherjee

Buy this Book

.NET 4.0 Generics Beginner's Guide

By: Sudipta Mukherjee

Buy this Book

Overview of this book

Generics were added as part of .NET Framework 2.0 in November 2005. Although similar to generics in Java, .NET generics do not apply type erasure but every object has unique representation at run-time. There is no performance hit from runtime casts and boxing conversions, which are normally expensive..NET offers type-safe versions of every classical data structure and some hybrid ones. This book will show you everything you need to start writing type-safe applications using generic data structures available in Generics API. You will also see how you can use several collections for each task you perform. This book is full of practical examples, interesting applications, and comparisons between Generics and more traditional approaches. Finally, each container is bench marked on the basis of performance for a given task, so you know which one to use and when. This book first covers the fundamental concepts such as type safety, Generic Methods, and Generic Containers. As the book progresses, you will learn how to join several generic containers to achieve your goals and query them efficiently using Linq. There are short exercises in every chapter to boost your knowledge. The book also teaches you some best practices, and several patterns that are commonly available in generic code. Some important generic algorithm definitions are present in Power Collection (an API created by Wintellect Inc.) that are missing from .NET framework. This book shows you how to use such algorithms seamlessly with other generic containers. The book also discusses C5 collections. Java Programmers will find themselves at home with this API. This is the closest to JCF. Some very interesting problems are solved using generic containers from .NET framework, C5, and PowerCollection Algorithms ñ a clone of Google Set and Gender Genie for example! The author has also created a website (http://www.consulttoday.com/genguide) for the book where you can find many useful tools, code snippets, and, applications, which are not the part of code-download section

.NET 4.0 Generics

Credits

Foreword

About the Author

Acknowledgement

About the Reviewers

www.PacktPub.com

Preface

Free Chapter

Why Generics?

An analogy

Setting up the environment

Summary

Lists

Why bother learning about generic lists?

Types of generic lists

Checking whether a sequence is a palindrome or not

Time for action — creating the generic stack as the buffer

Time for action — completing the rest of the method

Designing a generic anagram finder

Time for action — creating the method

Life is full of priorities, let's bring some order there

Time for action — creating the data structure for the prioritized shopping list

Time for action — let's add some gadgets to the list and see them

Time for action — let's strike off the gadgets with top-most priority after we have bought them

Time for action — let's create an appointment list

Live sorting and statistics for online bidding

Time for action — let's create a custom class for live sorting

Why did we have three LinkedList<T> as part of the data structure?

An attempt to answer questions asked by your boss

Time for action — associating products with live sorted bid amounts

Time for action — finding common values across different bidding amount lists

You will win every scrabble game from now on

Time for action — creating the method to find the character histogram of a word

Time for action — checking whether a word can be formed

Time for action — let's see whether it works

Trying to fix an appointment with a doctor?

Time for action — creating a set of dates of the doctors' availability

Time for action — finding out when both doctors shall be present

Revisiting the anagram problem

Time for action — re-creating the anagram finder

Lists under the hood

Summary

Dictionaries

Types of generic associative structures

Creating a tag cloud generator using dictionary

Time for action — creating the word histogram

Creating a bubble wrap popper game

Time for action — creating the game console

Look how easy it was!

How did we decide we need a dictionary and not a list?

Let's build a generic autocomplete service

Time for action — creating a custom dictionary for autocomplete

Time for action — creating a class for autocomplete

The most common pitfall. Don't fall there!

Let's play some piano

Time for action — creating the keys of the piano

How are we recording the key strokes?

Time for action — switching on recording and playing recorded keystrokes

C# Dictionaries can help detect cancer. Let's see how!

Time for action — creating the KNN API

Time for action — getting the patient records

Time for action — creating the helper class to read a delimited file

Time for action — let's see how to use the predictor

Tuples are great for many occasions including games

Time for action — putting it all together

Why have we used Tuples?

How did we figure out whether the game is over or not?

Summary

LINQ to Objects

What makes LINQ?

Time for action — creating an Extension method

Time for action — consuming our new Extension method

Putting it all together, LINQ Standard Query Operators

Time for action — getting the LINQPad

Time for action — finding all names with *am*

Time for action — finding all vowels

Time for action — finding all running processes matching a Regex

Time for action — playing with the indexed version of Where()

Time for action — learn how to go about creating a Where() clause

Time for action — let's say "Hello" to your buddies

Time for action — radio "Lucky Caller" announcement

Time for action — flattening a dictionary

Time for action — leaving the first few elements

Time for action — picking conditionally

Time for action — skipping save looping

Time for action — reversing word-by-word

Time for action — checking whether a given string is a palindrome or not

Time for action — sorting names alphabetically

Time for action — sorting 2D points by their co-ordinates

Time for action — sorting a list of fruits

Time for action — indexing an array of strings

Time for action — grouping by length

Time for action — finding common names from two names' lists

Time for action — finding all names from the list, removing duplicates

Time for action — pulling it all together including duplicates

Time for action — finding all names that appear mutually exclusively

Time for action — removing duplicate song IDs from the list

Time for action — making sure it works!

Time for action — making a list out of IEnumerable<T>

Time for action — tagging names

Time for action — one-to-many mapping

Time for action — finding the first element that satisfies a condition

Time for action — getting acquainted with FirstOrDefault()

Time for action — checking whether a sequence is palindromic

Time for action — understanding ElementAt()

Time for action — check out DefaultIfEmpty()

Time for action — generating arithmetic progression ranges

Time for action — running a filter on a range

Time for action — let's go round and round with Repeat()

Time for action — checking whether there is only one item matching this pattern

Time for action — set to default if there is more than one matching elements

Time for action — checking Any()

Time for action — how to check whether all items match a condition

Summary

Observable Collections

Active change/Statistical change

Passive change/Non-statistical change

Data sensitive change

Time for action — creating a simple math question monitor

Time for action — creating the collections to hold questions

Time for action — attaching the event to monitor the collections

Time for action — dealing with the change as it happens

Time for action — putting it all together

Time for action — creating a Twitter browser

Time for action — creating the interface

Time for action — creating the TweetViewer user control design

Time for action — gluing the TweetViewer control

Time for action — putting everything together

Time for action — dealing with the change in the list of names in the first tab

Time for action — a few things to beware of at the form load

Time for action — things to do when names get added or deleted

Time for action — sharing the load and creating a task for each BackgroundWorker

Time for action — a sample run of the application

Summary

Concurrent Collections

Creating and running asynchronous tasks

Simulating a survey (which is, of course, simultaneous by nature)

Time for action — creating the blocks

Devising a data structure for finding the most in-demand item

Time for action — creating the concurrent move-to-front list

Time for action — simulating a bank queue with multiple tellers

Time for action — making our bank queue simulator more useful

Be a smart consumer, don't wait till you have it all

Exploring data structure mapping

Summary

Power Collections

Setting up the environment

BinarySearch()

Time for action — finding a name from a list of names

CartesianProduct()

Time for action — generating names of all the 52 playing cards

RandomShuffle()

Time for action — randomly shuffling the deck

NCopiesOf()

Time for action — creating random numbers of any given length

Time for action — creating a custom random number generator

ForEach()

Time for action — creating a few random numbers of given any length

Rotate() and RotateInPlace()

Time for action — rotating a word

Time for action — creating a word guessing game

RandomSubset()

Time for action — picking a set of random elements

Reverse()

Time for action — reversing any collection

EqualCollections()

Time for action — revisiting the palindrome problem

DisjointSets()

Time for action — checking for common stuff

Time for action — finding anagrams the easiest way

Creating an efficient arbitrary floating point representation

Time for action — creating a huge number API

Creating an API for customizable default values

Time for action — creating a default value API

Mapping data structure

Algorithm conversion strategy

Summary

C5 Collections

Setting up the environment

Time for action — cloning Gender Genie!

Time for action — revisiting the anagram problem

Time for action — Google Sets idea prototype

Time for action — finding the most sought-after item

Sorting algorithms

Summary

Patterns, Practices, and Performance

Generic container patterns

A special Tuple<> pattern

Time for action — refactoring deeply nested if-else blocks

Best practices when using Generics

Selecting a generic collection

Best practices when creating custom generic collections

Performance analysis

How would we do this investigation?

Benchmarking experiment 1

Benchmarking experiment 2

Benchmarking experiment 3

Benchmarking experiment 4

Benchmarking experiment 5

Benchmarking experiment 6

Benchmarking experiment 7

Benchmarking experiment 8

Benchmarking experiment 9

Summary

Performance Cheat Sheet

Parameters to consider

Migration Cheat Sheet

Pop Quiz Answers

Chapter 2

Chapter 3

Chapter 4

Customer Reviews

5 star

4 star

3 star

2 star

1 star

An analogy

Here is an interesting analogy. Assume that there is a model hand pattern:

If we fill the pattern with clay, we get a clay-modeled hand. If we fill it with bronze, we get a hand model replica made of bronze. Although the material in these two hand models are very different, they share the same pattern (or they were created using the same algorithm, if you would agree to that term, in a broader sense).

Reason 1: Generics can save you a lot of typing

Extrapolating the algorithm part, let's say we have to implement some sorting algorithm; however, data types can vary for the input. To solve this, you can use overloading, as follows:

//Overloaded sort methods
private int[] Sort(int[] inputArray)
{
//Sort input array in-place
//and return the sorted array
return inputArray;
}
private float[] Sort(float[] inputArray)
{
//Sort input array in-place
//and return the sorted array
return inputArray;
}

However, you have to write the same code for all numeric data types supported by .NET. That's bad. Wouldn't it be cool if the compiler could somehow be instructed at compile time to yield the right version for the given data type at runtime? That's what Generics is about. Instead of writing the same method for all data types, you can create one single method with a symbolic data type. This will instruct the compiler to yield a specific code for the specific data type at runtime, as follows:

private T[] Sort<T>(T[] inputArray)
{
//Sort input array in-place
//and return the sorted array
return inputArray;
}

T is short for Type. If you replace T with anything, it will still compile; because it's the symbolic name for the generic type that will get replaced with a real type in the .NET type system at runtime.

So once we have this method, we can call it as follows:

int[] inputArray = { 1, 2, 0, 3 };
inputArray = Sort<int>(inputArray);

However, if you hover your mouse pointer right after the first brace ((), you can see in the tooltip, the expected type is already int[], as shown in the following screenshot:

That's the beauty of Generics. As we had mentioned int inside< and>, the compiler now knows for sure that it should expect only an int[] as the argument to the Sort<T> () method.

However, if you change int to float, you will see that the expectation of the compiler also changes. It then expects a float[] as the argument, as shown:

Now if you think you can fool the compiler by passing an integer array while it is asking for a float, you are wrong. That's blocked by compiler-time type checking. If you try something similar to the following:

You will get the following compiler error:

Argument 1: cannot convert from 'int[]' to 'float[]'

This means that Generics ensures strong type safety and is an integral part of the .NET framework, which is type safe.

Reason 2: Generics can save you type safety woes, big time

The previous example was about a sorting algorithm that doesn't change with data type. There are other things that become easier while dealing with Generics.

There are broadly two types of operations that can be performed on a list of elements:

1. Location centric operations
2. Data centric operations

Adding some elements at the front and deleting elements at an index are a couple of examples of location-centric operations on a list of data. In such operations, the user doesn't need to know about the data. It's just some memory manipulation at best.

However, if the request is to delete every odd number from a list of integers, then that's a data-centric operation. To be able to successfully process this request, the method has to know how to determine whether an integer is odd or not. This might sound trivial for an integer; however, the point is the logic of determining whether an element is a candidate for deletion or not, is not readily known to the compiler. It has to be delegated.

Before Generics appeared in .NET 2.0, people were using (and unfortunately these are still in heavy use) non-generic collections that are capable of storing a list of objects.

As an object sits at the top of the hierarchy in the .NET object model, this opens floodgates. If such a list exists and is exposed, people can put in just about anything in that list and the compiler won't complain a bit, because to the compiler everything is fine as they are all objects.

So, if a loosely typed collection such as ArrayList is used to store objects of type T, then for any data-centric operation, these must be down-casted to T again. Now, if somehow an entry that is not T, is put into the list, then this down-casting will result in an exception at runtime.

Suppose, I want to maintain a list of my students, then we can do that by using ArrayList to store a list of such Student objects:

class Student
{
public char Grade
{
get; set;
}
public int Roll
{
get; set;
}
public string Name
{
get; set;
}
}
//List of students
ArrayList studentList = new ArrayList();
Student newStudent = new Student();
newStudent.Name = "Dorothy";
newStudent.Roll = 1;
newStudent.Grade = 'A';
studentList.Add(newStudent);
newStudent = new Student();
newStudent.Name = "Sam";
newStudent.Roll = 2;
newStudent.Grade ='B';
studentList.Add(newStudent);
foreach (Object s in studentList)
{
//Type-casting. If s is anything other than a student
//or a derived class, this line will throw an exception.
//This is a data centric operation.
Student currentStudent = (Student)s;
Console.WriteLine("Roll # " + currentStudent.Roll + " " + currentStudent.Name + " Scored a " + curr entStudent.Grade);
}

What's the problem with this approach?

All this might look kind of okay, because we have been taking great care not to put anything else in the list other than Student objects. So, while we de-reference them after boxing, we don't see any problem. However, as the ArrayList can take any object as the argument, we could, by mistake, write something similar to the following:

studentList.Add("Generics"); //Fooling the compiler

As ArrayList is a loosely typed collection, it doesn't ensure compile-time type checking. So, this code won't generate any compile-time warning, and eventually it will throw the following exception at runtime when we try to de-reference this, to put in a Student object.

Then, it will throw an InvalidCastException:

What the exception in the preceding screenshot actually tells us is that Generics is a string and it can't cast that to Student, for the obvious reason that the compiler has no clue how to convert a string to a Student object.

Unfortunately, this only gets noticed by the compiler during runtime. With Generics, we can catch this sort of error early on at compile time.

Following is the generic code to maintain that list:

//Creating a generic list of type "Student".
//This is a strongly-typed-collection of type "Student".
//So nothing, except Student or derived class objects from Student
//can be put in this list myStudents
List<Student> myStudents = new List<Student>();
//Adding a couple of students to the list
Student newStudent = new Student();
newStudent.Name = "Dorothy";
newStudent.Roll = 1;
newStudent.Grade = 'A';
myStudents.Add(newStudent);
newStudent = new Student();
newStudent.Name = "Sam";
newStudent.Roll = 2;
newStudent.Grade = 'B';
myStudents.Add(newStudent);
//Looping through the list of students
foreach (Student currentStudent in myStudents)
{
//There is no need to type cast. Because compiler
//already knows that everything inside this list
//is a Student.
Console.WriteLine("Roll # " + currentStudent.Roll + " " + currentStudent.Name + " Scored a " + currentStudent.Grade);
}

The reasons mentioned earlier are the basic benefits of Generics. Also with Generics, language features such as LINQ and completely new languages such as F# came into existence. So, this is important. I hope you are convinced that Generics is a great programming tool and you are ready to learn it.

Reason 3: Generics leads to faster code

In the .NET Framework, everything is an object so it's okay to throw in anything to the non-generic loosely typed collection such as ArrayList, as shown in the previous example. This means we have to box (up-cast to object for storing things in the Arraylist; this process is implicit) and unbox (down-cast the object to the desired object type). This leads to slower code.

Here is the result of an experiment. I created two lists, one ArrayList and one List<int> to store integers:

And following is the data that drove the preceding graph:

ArrayList	List<T>
1323	185
1303	169
1327	172
1340	169
1302	172

The previous table mentions the total time taken in milliseconds to add 10,000,000 elements to the list. Clearly, generic collection is about seven times faster.

Reason 4: Generics is now ubiquitous in the .NET ecosystem

Look around. If you care to develop any non-trivial application, you are better off using some of the APIs built for the specific job at hand. Most of the APIs available rely heavily on strong typing and they achieve this through Generics. We shall discuss some of these APIs (LINQ, PowerCollections, C5) that are being predominantly used by the .NET community in this book.

So far, I have been giving you reasons to learn Generics. At this point, I am sure, you are ready to experiment with .NET Generics. Please check out the instructions in the next section to install the necessary software if you don't have it already.

.NET 4.0 Generics Beginner's Guide

By : Sudipta Mukherjee

.NET 4.0 Generics Beginner's Guide

By: Sudipta Mukherjee

Overview of this book

Related Content you might be interested in

Current Title:

.NET 4.0 Generics Beginner's Guide

An analogy

Reason 1: Generics can save you a lot of typing

Reason 2: Generics can save you type safety woes, big time

What's the problem with this approach?

Reason 3: Generics leads to faster code

Reason 4: Generics is now ubiquitous in the .NET ecosystem