Book Image

Learning AWK Programming

By : Shiwang Kalkhanda
5 (1)
Book Image

Learning AWK Programming

5 (1)
By: Shiwang Kalkhanda

Overview of this book

AWK is one of the most primitive and powerful utilities which exists in all Unix and Unix-like distributions. It is used as a command-line utility when performing a basic text-processing operation, and as programming language when dealing with complex text-processing and mining tasks. With this book, you will have the required expertise to practice advanced AWK programming in real-life examples. The book starts off with an introduction to AWK essentials. You will then be introduced to regular expressions, AWK variables and constants, arrays and AWK functions and more. The book then delves deeper into more complex tasks, such as printing formatted output in AWK, control flow statements, GNU's implementation of AWK covering the advanced features of GNU AWK, such as network communication, debugging, and inter-process communication in the GAWK programming language which is not easily possible with AWK. By the end of this book, the reader will have worked on the practical implementation of text processing and pattern matching using AWK to perform routine tasks.
Table of Contents (11 chapters)

Case-sensitive matching

In bracket expressions, we can use both lowercase and uppercase characters to match the characters for case-insensitive results, but the same is not true for other regular expressions. However, this becomes difficult if you have to match to many characters. Here comes the two built-in string functions of AWK, tolower() and toupper(). Using these two functions, we can perform case-insensitive match operations.

For example, to match the first name 'John' of an employee in the emp.dat file, we can perform a case-insensitive match using the tolower() function, as follows:

$ awk 'tolower($1) ~ /john/' emp.dat

The output on execution of this code is as follows:

John    Kapur   9911556789  [email protected]      M   hr      2200

Another way to perform a case-insensitive match operation is by setting the IGNORECASE bash variable value to TRUE.

...