Linux Shell Scripting Cookbook - Third Edition

By : Clif Flynt, Sarath Lakshman, Shantanu Tushar

Linux Shell Scripting Cookbook - Third Edition

By: Clif Flynt, Sarath Lakshman, Shantanu Tushar

Overview of this book

The shell is the most powerful tool your computer provides. Despite having it at their fingertips, many users are unaware of how much the shell can accomplish. Using the shell, you can generate databases and web pages from sets of files, automate monotonous admin tasks such as system backups, monitor your system's health and activity, identify network bottlenecks and system resource hogs, and more. This book will show you how to do all this and much more. This book, now in its third edition, describes the exciting new features in the newest Linux distributions to help you accomplish more than you imagine. It shows how to use simple commands to automate complex tasks, automate web interactions, download videos, set up containers and cloud servers, and even get free SSL certificates. Starting with the basics of the shell, you will learn simple commands and how to apply them to real-world issues. From there, you'll learn text processing, web interactions, network and system monitoring, and system tuning. Software engineers will learn how to examine system applications, how to use modern software management tools such as git and fossil for their own work, and how to submit patches to open-source projects. Finally, you'll learn how to set up Linux Containers and Virtual machines and even run your own Cloud server with a free SSL Certificate from letsencrypt.org.

Preface

What this book covers

What you need for this book

Free Chapter

Shell Something Out

Introduction

Displaying output in a terminal

Using variables and environment variables

Function to prepend to environment variables

Math with the shell

Playing with file descriptors and redirection

Arrays and associative arrays

Visiting aliases

Grabbing information about the terminal

Getting and setting dates and delays

Debugging the script

Functions and arguments

Sending output from one command to another

Reading n characters without pressing the return key

Running a command until it succeeds

Field separators and iterators

Comparisons and tests

Customizing bash with configuration files

Have a Good Command

Introduction

Concatenating with cat

Recording and playing back terminal sessions

Finding files and file listing

Playing with xargs

Translating with tr

Checksum and verification

Cryptographic tools and hashes

Sorting unique and duplicate lines

Temporary file naming and random numbers

Splitting files and data

Slicing filenames based on extensions

Renaming and moving files in bulk

Spell–checking and dictionary manipulation

Automating interactive input

Making commands quicker by running parallel processes

Examining a directory, files and subdirectories in it

File In, File Out

Introduction

Generating files of any size

The intersection and set difference (A-B) on text files

Finding and deleting duplicate files

Working with file permissions, ownership, and the sticky bit

Making files immutable

Generating blank files in bulk

Finding symbolic links and their targets

Enumerating file type statistics

Using loopback files

Creating ISO files and hybrid ISO

Finding the difference between files, and patching

Using head and tail for printing the last or first 10 lines

Listing only directories - alternative methods

Fast command-line navigation using pushd and popd

Counting the number of lines, words, and characters in a file

Printing the directory tree

Manipulating video and image files

Texting and Driving

Introduction

Using regular expressions

Searching and mining text inside a file with grep

Cutting a file column-wise with cut

Using sed to perform text replacement

Using awk for advanced text processing

Finding the frequency of words used in a given file

Compressing or decompressing JavaScript

Merging multiple files as columns

Printing the nth word or column in a file or line

Printing text between line numbers or patterns

Printing lines in the reverse order

Parsing e-mail address and URLs from text

Removing a sentence in a file containing a word

Replacing a pattern with text in all the files in a directory

Text slicing and parameter operations

Tangled Web? Not At All!

Introduction

Downloading from a web page

Downloading a web page as plain text

A primer on cURL

Accessing unread Gmail e-mails from the command line

Parsing data from a website

Image crawler and downloader

Web photo album generator

Twitter command-line client

Accessing word definitions via a web server

Finding broken links in a website

Tracking changes to a website

Posting to a web page and reading the response

Downloading a video from the Internet

Summarizing text with OTS

Translating text from the command line

Repository Management

Introduction

Creating a new git repository

Cloning a remote git repository

Adding and committing changes with git

Creating and merging branches with git

Sharing your work

Pushing a branch to a server

Checking the status of a git repository

Viewing git history

Finding bugs

Tagging snapshots

Committing message ethics

Using fossil

Creating a new fossil repository

Cloning a remote fossil repository

Opening a fossil project

Adding and committing changes with fossil

Using branches and forks with fossil

Sharing your work with fossil

Updating your local fossil repository

Checking the status of a fossil repository

Viewing fossil history

The Backup Plan

Introduction

Archiving with tar

Archiving with cpio

Compressing data with gzip

Archiving and compressing with zip

Faster archiving with pbzip2

Creating filesystems with compression

Backing up snapshots with rsync

Differential archives

Creating entire disk images using fsarchiver

The Old-Boy Network

Introduction

Setting up the network

Let us ping!

Tracing IP routes

Listing all available machines on a network

Running commands on a remote host with SSH

Running graphical commands on a remote machine

Transferring files through the network

Connecting to a wireless network

Password-less auto-login with SSH

Port forwarding using SSH

Mounting a remote drive at a local mount point

Network traffic and port analysis

Measuring network bandwidth

Creating arbitrary sockets

Building a bridge

Sharing an Internet connection

Basic firewall using iptables

Creating a Virtual Private Network

Put On the Monitors Cap

Introduction

Monitoring disk usage

Calculating the execution time for a command

Collecting information about logged in users, boot logs, and boot failures

Listing the top ten CPU– consuming processes in an hour

Monitoring command outputs with watch

Logging access to files and directories

Logging with syslog

Managing log files with logrotate

Monitoring user logins to find intruders

Monitoring remote disk usage health

Determining active user hours on a system

Measuring and optimizing power usage

Monitoring disk activity

Checking disks and filesystems for errors

Examining disk health

Getting disk statistics

Administration Calls

Introduction

Gathering information about processes

What's what – which, whereis, whatis, and file

Killing processes, and sending and responding to signals

Sending messages to user terminals

The /proc filesystem

Gathering system information

Scheduling with a cron

Database styles and uses

Writing and reading SQLite databases

Writing and reading a MySQL database from Bash

User administration scripts

Bulk image resizing and format conversion

Taking screenshots from the terminal

Managing multiple terminals from one

Tracing the Clues

Introduction

Tracing packets with tcpdump

Finding packets with ngrep

Tracing network routes with ip

Tracing system calls with strace

Tracing dynamic library functions with ltrace

Tuning a Linux System

Introduction

Identifying services

Gathering socket data with ss

Gathering system I/O usage with dstat

Identifying a resource hog with pidstat

Tuning the Linux kernel with sysctl

Tuning a Linux system with config files

Changing scheduler priority using the nice command

Containers, Virtual Machines, and the Cloud

Introduction

Using Linux containers

Using Docker

Using Virtual Machines in Linux

Linux in the cloud

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Playing with file descriptors and redirection

File descriptors are integers associated with the input and output streams. The best-known file descriptors are stdin, stdout, and stderr. The contents of one stream can be redirected to another. This recipe shows examples on how to manipulate and redirect with file descriptors.

Getting ready

Shell scripts frequently use standard input (stdin), standard output (stdout), and standard error (stderr). A script can redirect output to a file with the greater-than symbol. Text generated by a command may be normal output or an error message. By default, both normal output (stdout) and error messages (stderr) are sent to the display. The two streams can be separated by specifying a specific descriptor for each stream.

File descriptors are integers associated with an opened file or data stream. File descriptors 0, 1, and 2 are reserved, as given here:

0: stdin
1: stdout
2: stderr

How to do it...

Use the greater-than symbol to append text to a file:

        $ echo "This is a sample text 1" > temp.txt

This stores the echoed text in temp.txt. If temp.txt already exists, the single greater-than sign will delete any previous contents.

Use double-greater-than to append text to a file:

        $ echo "This is sample text 2" >> temp.txt

Use cat to view the contents of the file:

        $ cat temp.txt
        This is sample text 1
        This is sample text 2

The next recipes demonstrate redirecting stderr. A message is printed to the stderr stream when a command generates an error message. Consider the following example:

$ ls +
ls: cannot access +: No such file or directory

Here + is an invalid argument and hence an error is returned.

Successful and unsuccessful commands
When a command exits because of an error, it returns a nonzero exit status. The command returns zero when it terminates after successful completion. The return status is available in the special variable $? (run echo $? immediately after the command execution statement to print the exit status).

The following command prints the stderr text to the screen rather than to a file (and because there is no stdout output, out.txt will be empty):

$ ls + > out.txt
ls: cannot access +: No such file or directory

In the following command, we redirect stderr to out.txt with 2> (two greater-than):

$ ls + 2> out.txt # works

You can redirect stderr to one file and stdout to another file.

$ cmd 2>stderr.txt 1>stdout.txt

It is also possible to redirect stderr and stdout to a single file by converting stderr to stdout using this preferred method:

$ cmd 2>&1 allOutput.txt

This can be done even using an alternate approach:

$ cmd &> output.txt

If you don't want to see or save any error messages, you can redirect the stderr output to /dev/null, which removes it completely. For example, consider that we have three files a1, a2, and a3. However, a1 does not have the read-write-execute permission for the user. To print the contents of all files starting with the letter a, we use the cat command. Set up the test files as follows:

$ echo A1 > a1
$ echo A2 > a2
$ echo A3 > a3
$ chmod 000 a1  #Deny all permissions

Displaying the contents of the files using wildcards (a*), will generate an error message for the a1 file because that file does not have the proper read permission:

$ cat a*
cat: a1: Permission denied
A2
A3

Here, cat: a1: Permission denied belongs to the stderr data. We can redirect the stderr data into a file, while sending stdout to the terminal.

$ cat a* 2> err.txt #stderr is redirected to err.txt
A2
A3

$ cat err.txt
cat: a1: Permission denied

Some commands generate output that we want to process and also save for future reference or other processing. The stdout stream is a single stream that we can redirect to a file or pipe to another program. You might think there is no way for us to have our cake and eat it too.

However, there is a way to redirect data to a file, while providing a copy of redirected data as stdin to the next command in a pipe. The tee command reads from stdin and redirects the input data to stdout and one or more files.

command | tee FILE1 FILE2 | otherCommand

In the following code, the stdin data is received by the tee command. It writes a copy of stdout to the out.txt file and sends another copy as stdin for the next command. The cat -n command puts a line number for each line received from stdin and writes it into stdout:

$ cat a* | tee out.txt | cat -n
cat: a1: Permission denied
     1 A2
     2 A3

Use cat to examine the contents of out.txt:

$ cat out.txt
A2
A3

Observe that cat: a1: Permission denied does not appear, because it was sent to stderr. The tee command reads only from stdin.

By default, the tee command overwrites the file. Including the -a option will force it to append the new data.

$ cat a* | tee -a out.txt | cat -n

Commands with arguments follow the format: command FILE1 FILE2 ... or simply command FILE.

To send two copies of the input to stdout, use - for the filename argument:

$ cmd1 | cmd2 | cmd -

Consider this example:

$ echo who is this | tee -
who is this
who is this

Alternately, we can use /dev/stdin as the output filename to use stdin.
Similarly, use /dev/stderr for standard error and /dev/stdout for standard output. These are special device files that correspond to stdin, stderr, and stdout.

How it works...

The redirection operators (> and >>) send output to a file instead of the terminal. The > and >> operators behave slightly differently. Both redirect output to a file, but the single greater-than symbol (>) empties the file and then writes to it, whereas the double greater-than symbol (>>) adds the output to the end of the existing file.

By default, the redirection operates on standard output. To explicitly take a specific file descriptor, you must prefix the descriptor number to the operator.

The > operator is equivalent to 1> and similarly it applies for >> (equivalent to 1>>).

When working with errors, the stderr output is dumped to the /dev/null file. The ./dev/null file is a special device file where any data received by the file is discarded. The null device is often known as a black hole, as all the data that goes into it is lost forever.

There's more...

Commands that read input from stdin can receive data in multiple ways. It is possible to specify file descriptors of our own, using cat and pipes. Consider this example:

$ cat file | cmd
$ cmd1 | cmd2

Redirection from a file to a command

We can read data from a file as stdin with the less-than symbol (<):

$ cmd < file

Redirecting from a text block enclosed within a script

Text can be redirected from a script into a file. To add a warning to the top of an automatically generated file, use the following code:

#!/bin/bash 
cat<<EOF>log.txt 
This is a generated file. Do not edit. Changes will be overwritten. 
EOF

The lines that appear between cat <<EOF >log.txt and the next EOF line will appear as the stdin data. The contents of log.txt are shown here:

$ cat log.txt 
This is a generated file. Do not edit. Changes will be overwritten.

Custom file descriptors

A file descriptor is an abstract indicator for accessing a file. Each file access is associated with a special number called a file descriptor. 0, 1, and 2 are reserved descriptor numbers for stdin, stdout, and stderr.

The exec command can create new file descriptors. If you are familiar with file access in other programming languages, you may be familiar with the modes for opening files. These three modes are commonly used:

Read mode
Write with append mode
Write with truncate mode

The < operator reads from the file to stdin. The > operator writes to a file with truncation (data is written to the target file after truncating the contents). The >> operator writes to a file by appending (data is appended to the existing file contents and the contents of the target file will not be lost). File descriptors are created with one of the three modes.

Create a file descriptor for reading a file:

$ exec 3<input.txt # open for reading with descriptor number 3

We can use it in the following way:

$ echo this is a test line > input.txt
$ exec 3<input.txt

Now you can use file descriptor 3 with commands. For example, we will use cat<&3:

$ cat<&3
this is a test line

If a second read is required, we cannot reuse the file descriptor 3. We must create a new file descriptor (perhaps 4) with exec to read from another file or re-read from the first file.

Create a file descriptor for writing (truncate mode):

$ exec 4>output.txt # open for writing

Consider this example:

$ exec 4>output.txt
$ echo newline >&4
$ cat output.txt
newline

Now create a file descriptor for writing (append mode):

$ exec 5>>input.txt

Consider the following example:

$ exec 5>>input.txt
$ echo appended line >&5
$ cat input.txt
newline
appended line

Linux Shell Scripting Cookbook - Third Edition

By : Clif Flynt, Sarath Lakshman, Shantanu Tushar

Linux Shell Scripting Cookbook - Third Edition

By: Clif Flynt, Sarath Lakshman, Shantanu Tushar

Overview of this book

Related Content you might be interested in

Current Title:

Linux Shell Scripting Cookbook - Third Edition

Bash Cookbook

CentOS Quick Start Guide

Learning Linux Shell Scripting