.NET 4.0 Generics Beginner's Guide

.NET 4.0 Generics Beginner's Guide

By : Sudipta Mukherjee

Buy this Book

.NET 4.0 Generics Beginner's Guide

By: Sudipta Mukherjee

Buy this Book

Overview of this book

Generics were added as part of .NET Framework 2.0 in November 2005. Although similar to generics in Java, .NET generics do not apply type erasure but every object has unique representation at run-time. There is no performance hit from runtime casts and boxing conversions, which are normally expensive..NET offers type-safe versions of every classical data structure and some hybrid ones. This book will show you everything you need to start writing type-safe applications using generic data structures available in Generics API. You will also see how you can use several collections for each task you perform. This book is full of practical examples, interesting applications, and comparisons between Generics and more traditional approaches. Finally, each container is bench marked on the basis of performance for a given task, so you know which one to use and when. This book first covers the fundamental concepts such as type safety, Generic Methods, and Generic Containers. As the book progresses, you will learn how to join several generic containers to achieve your goals and query them efficiently using Linq. There are short exercises in every chapter to boost your knowledge. The book also teaches you some best practices, and several patterns that are commonly available in generic code. Some important generic algorithm definitions are present in Power Collection (an API created by Wintellect Inc.) that are missing from .NET framework. This book shows you how to use such algorithms seamlessly with other generic containers. The book also discusses C5 collections. Java Programmers will find themselves at home with this API. This is the closest to JCF. Some very interesting problems are solved using generic containers from .NET framework, C5, and PowerCollection Algorithms ñ a clone of Google Set and Gender Genie for example! The author has also created a website (http://www.consulttoday.com/genguide) for the book where you can find many useful tools, code snippets, and, applications, which are not the part of code-download section

.NET 4.0 Generics

Credits

Foreword

About the Author

Acknowledgement

About the Reviewers

www.PacktPub.com

Preface

Free Chapter

Why Generics?

An analogy

Setting up the environment

Summary

Lists

Why bother learning about generic lists?

Types of generic lists

Checking whether a sequence is a palindrome or not

Time for action — creating the generic stack as the buffer

Time for action — completing the rest of the method

Designing a generic anagram finder

Time for action — creating the method

Life is full of priorities, let's bring some order there

Time for action — creating the data structure for the prioritized shopping list

Time for action — let's add some gadgets to the list and see them

Time for action — let's strike off the gadgets with top-most priority after we have bought them

Time for action — let's create an appointment list

Live sorting and statistics for online bidding

Time for action — let's create a custom class for live sorting

Why did we have three LinkedList<T> as part of the data structure?

An attempt to answer questions asked by your boss

Time for action — associating products with live sorted bid amounts

Time for action — finding common values across different bidding amount lists

You will win every scrabble game from now on

Time for action — creating the method to find the character histogram of a word

Time for action — checking whether a word can be formed

Time for action — let's see whether it works

Trying to fix an appointment with a doctor?

Time for action — creating a set of dates of the doctors' availability

Time for action — finding out when both doctors shall be present

Revisiting the anagram problem

Time for action — re-creating the anagram finder

Lists under the hood

Summary

Dictionaries

Types of generic associative structures

Creating a tag cloud generator using dictionary

Time for action — creating the word histogram

Creating a bubble wrap popper game

Time for action — creating the game console

Look how easy it was!

How did we decide we need a dictionary and not a list?

Let's build a generic autocomplete service

Time for action — creating a custom dictionary for autocomplete

Time for action — creating a class for autocomplete

The most common pitfall. Don't fall there!

Let's play some piano

Time for action — creating the keys of the piano

How are we recording the key strokes?

Time for action — switching on recording and playing recorded keystrokes

C# Dictionaries can help detect cancer. Let's see how!

Time for action — creating the KNN API

Time for action — getting the patient records

Time for action — creating the helper class to read a delimited file

Time for action — let's see how to use the predictor

Tuples are great for many occasions including games

Time for action — putting it all together

Why have we used Tuples?

How did we figure out whether the game is over or not?

Summary

LINQ to Objects

What makes LINQ?

Time for action — creating an Extension method

Time for action — consuming our new Extension method

Putting it all together, LINQ Standard Query Operators

Time for action — getting the LINQPad

Time for action — finding all names with *am*

Time for action — finding all vowels

Time for action — finding all running processes matching a Regex

Time for action — playing with the indexed version of Where()

Time for action — learn how to go about creating a Where() clause

Time for action — let's say "Hello" to your buddies

Time for action — radio "Lucky Caller" announcement

Time for action — flattening a dictionary

Time for action — leaving the first few elements

Time for action — picking conditionally

Time for action — skipping save looping

Time for action — reversing word-by-word

Time for action — checking whether a given string is a palindrome or not

Time for action — sorting names alphabetically

Time for action — sorting 2D points by their co-ordinates

Time for action — sorting a list of fruits

Time for action — indexing an array of strings

Time for action — grouping by length

Time for action — finding common names from two names' lists

Time for action — finding all names from the list, removing duplicates

Time for action — pulling it all together including duplicates

Time for action — finding all names that appear mutually exclusively

Time for action — removing duplicate song IDs from the list

Time for action — making sure it works!

Time for action — making a list out of IEnumerable<T>

Time for action — tagging names

Time for action — one-to-many mapping

Time for action — finding the first element that satisfies a condition

Time for action — getting acquainted with FirstOrDefault()

Time for action — checking whether a sequence is palindromic

Time for action — understanding ElementAt()

Time for action — check out DefaultIfEmpty()

Time for action — generating arithmetic progression ranges

Time for action — running a filter on a range

Time for action — let's go round and round with Repeat()

Time for action — checking whether there is only one item matching this pattern

Time for action — set to default if there is more than one matching elements

Time for action — checking Any()

Time for action — how to check whether all items match a condition

Summary

Observable Collections

Active change/Statistical change

Passive change/Non-statistical change

Data sensitive change

Time for action — creating a simple math question monitor

Time for action — creating the collections to hold questions

Time for action — attaching the event to monitor the collections

Time for action — dealing with the change as it happens

Time for action — putting it all together

Time for action — creating a Twitter browser

Time for action — creating the interface

Time for action — creating the TweetViewer user control design

Time for action — gluing the TweetViewer control

Time for action — putting everything together

Time for action — dealing with the change in the list of names in the first tab

Time for action — a few things to beware of at the form load

Time for action — things to do when names get added or deleted

Time for action — sharing the load and creating a task for each BackgroundWorker

Time for action — a sample run of the application

Summary

Concurrent Collections

Creating and running asynchronous tasks

Simulating a survey (which is, of course, simultaneous by nature)

Time for action — creating the blocks

Devising a data structure for finding the most in-demand item

Time for action — creating the concurrent move-to-front list

Time for action — simulating a bank queue with multiple tellers

Time for action — making our bank queue simulator more useful

Be a smart consumer, don't wait till you have it all

Exploring data structure mapping

Summary

Power Collections

Setting up the environment

BinarySearch()

Time for action — finding a name from a list of names

CartesianProduct()

Time for action — generating names of all the 52 playing cards

RandomShuffle()

Time for action — randomly shuffling the deck

NCopiesOf()

Time for action — creating random numbers of any given length

Time for action — creating a custom random number generator

ForEach()

Time for action — creating a few random numbers of given any length

Rotate() and RotateInPlace()

Time for action — rotating a word

Time for action — creating a word guessing game

RandomSubset()

Time for action — picking a set of random elements

Reverse()

Time for action — reversing any collection

EqualCollections()

Time for action — revisiting the palindrome problem

DisjointSets()

Time for action — checking for common stuff

Time for action — finding anagrams the easiest way

Creating an efficient arbitrary floating point representation

Time for action — creating a huge number API

Creating an API for customizable default values

Time for action — creating a default value API

Mapping data structure

Algorithm conversion strategy

Summary

C5 Collections

Setting up the environment

Time for action — cloning Gender Genie!

Time for action — revisiting the anagram problem

Time for action — Google Sets idea prototype

Time for action — finding the most sought-after item

Sorting algorithms

Summary

Patterns, Practices, and Performance

Generic container patterns

A special Tuple<> pattern

Time for action — refactoring deeply nested if-else blocks

Best practices when using Generics

Selecting a generic collection

Best practices when creating custom generic collections

Performance analysis

How would we do this investigation?

Benchmarking experiment 1

Benchmarking experiment 2

Benchmarking experiment 3

Benchmarking experiment 4

Benchmarking experiment 5

Benchmarking experiment 6

Benchmarking experiment 7

Benchmarking experiment 8

Benchmarking experiment 9

Summary

Performance Cheat Sheet

Parameters to consider

Migration Cheat Sheet

Pop Quiz Answers

Chapter 2

Chapter 3

Chapter 4

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Appendix A. Performance Cheat Sheet

List<T>

Methods	Complexity (for n elements and count m)
`Add`	`O(1)`
`AddRange`	`O(n + m)`
`AsReadOnly`	`O(1)`
`BinarySearch`	`O(log n)`
`Clear`	`O(n)`
`Contains`	`O(n)`
`ConvertAll`	`O(n)`
`CopyTo`	`O(n)`
`Exists`	`O(n)`
`Find`	`O(n)`
`FindAll`	`O(n)`
`FindIndex`	`O(n)`
`ForEach`	`O(n)`
`GetRange`	`O(n)`
`IndexOf`	`O(n)`
`Insert`	`O(n)`
`InsertRange`	`O(n + m)`
`LastIndexOf`	`O(n)`
`RemoveAll`	`O(n)`
`RemoveAt`	`O(n)`
`RemoveRange`	`O(n)`
`Reverse`	`O(n)`
`Sort`	`O(n log n)`
`ToArray`	`O(n)`
`TrimExcess`	`O(n)`
`TrueForAll`	`O(n)`

Stack<T>

Methods	Complexity (for n elements)
`Clear`	`O(n)`
`Contains`	`O(n)`
`CopyTo`	`O(n)`
`Peek`	`O(1)`
`Pop`	`O(1)`
`Push`	`O(1)`
`CopyTo`	`O(n)`
`ToArray`	`O(n)`
`TrimExcess`	`O(n)`

Queue<T>

Methods	Complexity (for n elements)
`Clear`	`O(n)`
`Contains`	`O(n)`
`CopyTo`	`O(n)`
`Dequeue`	`O(1)`
`Enqueue`	`O(1)`
`Peek`	`O(1)`
`ToArray`	`O(n)`

HashSet<T>

Methods	Complexity (n and m are number of elements)
`Add`	`O(1)`
`Clear`	`O(n)`
`Contains`	`O(1)`
`CopyTo`	`O(n)`
`ExceptWith`	`O(n)`
`IntersectWith`	`O(n + m)`
`IsProperSubsetOf`	`O(n)`
`IsProperSupersetOf`	`O(n)`
`IsSubsetOf`	`O(n + m)`
`IsSupersetOf`	`O(n + m)`
`Overlaps`	`O(n)`
`Remove`	`O(1)`
`RemoveWhere`	`O(n)`
`SetEquals`	`O(n)`
`SymmetricExceptWith`	`O(n + m)`
`TrimExcess`	`O(n)`
`UnionWith`	`O(n)`

SortedSet<T>

Methods	Complexity (for n elements and count m. l is lower bound and u is upper bound of the view)
`Add`	`O(log n)`
`Clear`	`O(n)`
`Contains`	`O(log n)`
`CopyTo`	`O(n)`
`ExceptWith`	`O(n)`
`GetViewBetween`	`O(u l)`
`IntersectWith`	`O(n)`
`IsProperSubsetOf`	`O(n)`
`IsProperSupersetOf`	`O(n + m)`
`IsSubsetOf`	`O(n + m)`
`IsSupersetOf`	`O(n + m)`
`Overlaps`	`O(n)`
`Remove`	`O(n)`
`RemoveWhere`	`O(n)`
`Reverse`	`O(1)`
`SetEquals`	`O(n + m)`
`SymmetricExceptWith`	`O(n + m)`
`UnionWith`	`O(n)`

Dictionary<TKey,TValue>

Methods	Complexity (for n elements)
`Add`	`O(1)`
`Clear`	`O(n)`
`ContainsKey`	`O(1)`
`ContainsValue`	`O(n)`
`Remove`	`O(1)`
`TryGetValue`	`O(1)`

SortedDictionary<TKey,TValue>

Methods	Complexity (for n elements)
`Add`	`O(log n)`
`Clear`	`O(1)`
`ContainsKey`	`O(log n)`
`ContainsValue`	`O(n)`
`Remove`	`O(log n)`
`TryGetValue`	`O(log n)`

Parameters to consider

The following are the top 20 parameters to consider when selecting a generic collection:

1. Simple or associative
2. Random access capability
3. Lookup speed
4. Random insertion speed
5. Edge insertion speed
6. Random deletion speed
7. Edge deletion speed
8. Speed to empty
9. Time to count
10. Zero-based indexing
11. Native sorting capability
12. Thread safety
13. Interoperability
14. Platform portability
15. Memory requirement
16. Construction versatility
17. Native bsearch support
18. Speed of set operations
19. Code readability
20. Least effort to migrate

None of these facilities come in a single collection. So you need to deal with calculated tread-off and strike a balance between computational cost and optimum performance.

The parameters are explained as follows:

1. Simple or associative:
If you want to store a few elements in a random order then you don’t need an associative collection. Simple lists are IList<T> based where as associative collections are IDictionary<TKey,TValue> based.
2. Random access capability:
If you regularly need to access elements, you would need a collection that supports this functionality to boost performance. Mostly, random access capability is offered by zero-based indexing. Containers that implement IList offer this functionality.
3. Lookup speed:
Lookup speed can be crucial in the selection of associative containers. More the speed, the better. Normally, hash-based implementations outperform tree-based or list-based implementations. Thus, accessing an element in SortedDictionary<TKey,TValue> is faster than SortedList<TKey,TValue>.
4. Random insertion speed:
If there are a lot of insertions at random locations in the collection, then you should consider how fast you can insert a few elements at arbitrary locations inside the collection. LinkedList<T> offers faster random insertion than List<T>.
5. Edge insertion speed:
If you know that there will be many insertions at the edges (start or end) of the collection, choose one that is programmed to offer a faster speed. For example, Stack<T>, Queue<T>, or LinkedList<T>. Inserting in an array-based container, such as List<T>, offers the worst performance as all the elements beyond the point of insertion have to be shifted.
6. Random deletion speed:
If there are a lot of deletions at random locations in the collection, then you should consider how fast you can delete a few elements at arbitrary locations inside the collection. LinkedList<T> offers faster random deletion than List<T>.
7. Edge deletion speed:
If deletions occur only at the extreme ends of the collection, then you should consider collections optimized for that, such as Stack<T>, Queue<T>, or LinkedList<T> over List<T>.
8. Speed to empty:
In many situations, you need to clear all the elements of the collection, perhaps inside a loop. In such situations, you should consider collections that take minimum time to clear all the elements.
9. Time to count:
It is crucial to count how many elements there are in the collection. If it takes O(n) that’s not good. In such situations, resort to collections that offer constant timecount operations. Luckily most of them do.
10. Zero-based indexing:
Zero-based indexing has become the habit of programmers of our time, thanks to C arrays and C++ vectors. If you need random access on a simple sequential collection, resort to the one that offers this, such as List<T>, rather than using the ElementAt() method.
11. Native sorting capability:
If you need to sort the collection every now and then, you can consider those that offer native sorting capabilities such as List<T> or resort to a sorted collection such as SortedSet<T>. It is better than using OrderBy() because a Lambda expression evaluation is generally slower.
12. Thread safety:
If your collection will be used in a multi-threaded environment, it is better to use new concurrent collections than a primitive locking mechanism.
13. Interoperability:
IUse collections that are more flexible, or in other words, that implement more interfaces. For example, if you need associative storage, then you can use SortedList<TKey,TValue>. However, SortedDictionary<TKey,TValue> is also good because it supports serialization. So, in future, it would be easy to serialize.
14. Platform portability:
If you are programming for a platform limited by memory, then some part of the framework will be absent. Be sure to check for the availability of the collection in the framework targeted to the hardware you are using.
15. Memory requirement:
If you are in a memory-crunched system, you should also check the storage requirement of the collection. If it takes up too much memory, you might have to resort to something else. For example, if all you need is to add elements at the end and process them from the front, a Queue<T> would do just fine and you don’t need a List<T> which is more heavyweight.
16. Construction versatility:
The easier it is to create the generic container, the better. It’s better if a collection offers more variations in constructor, because you always remain prepared for unprecedented situations. You would never know how much information you would need to create a collection.
17. Native bsearch support:
Binary search is crucial to finding an item in a long list. You can always use the BinarySearch() method of an Array class; however, if native support is available, that’s better. You would save a couple of boxing and unboxing calls.
18. Speed of set operations:
Although you can perform all set operations via LINQ, resort to one proper set implementation if you need a set.
19. Code readability:
Make sure to choose a collection that signals your intent more obviously than others, resulting in more readable code.
20. Least effort to migrate:
Remember that the element of least surprise always works. Select collections that are available conceptually in several other languages.

.NET 4.0 Generics Beginner's Guide

By : Sudipta Mukherjee

.NET 4.0 Generics Beginner's Guide

By: Sudipta Mukherjee

Overview of this book

Related Content you might be interested in

Current Title:

.NET 4.0 Generics Beginner's Guide

Appendix A. Performance Cheat Sheet

Parameters to consider