The Software Developer's Guide to Linux

By : David Cohen, Christian Sturm

5 (2)

Buy this Book

The Software Developer's Guide to Linux

5 (2)

By: David Cohen, Christian Sturm

Buy this Book

Overview of this book

Developers are always looking to raise their game to the next level, yet most are completely lost when it comes to the Linux command line. This book is the bridge that will take you to the next level in your software development career. Most of the skills in the book can be immediately put to work to make you a more efficient developer. It’s written specifically for software engineers, not Linux system administrators, so each chapter will equip you with just enough theory to understand what you’re doing before diving into practical commands that you can use in your day-to-day work as a software developer. As you work through the book, you’ll quickly absorb the basics of how Linux works while you get comfortable moving around the command line. Once you’ve got the core skills, you’ll see how to apply them in different contexts that you’ll come across as a software developer: building and working with Docker images, automating boring build tasks with shell scripts, and troubleshooting issues in production environments. By the end of the book, you’ll be able to use Linux and the command line comfortably and apply your newfound skills in your day-to-day work to save time, troubleshoot issues, and be the command-line wizard that your team turns to.

Preface

Who this book is for

What this book is not

What this book covers

To get the most out of this book

Get in touch

How the Command Line Works

In the beginning…was the REPL

Command-line syntax (read)

Command line vs. shell

Basic command-line skills

Getting help

Shell autocompletion

Conclusion

Free Chapter

Working with Processes

Process basics

Practical commands for working with Linux processes

Advanced process concepts and tools

Review – example troubleshooting session

Conclusion

Service Management with systemd

The basics

Processes and services

Using Shell History

Executing previous commands with !

Jumping to the beginning or end of the current line

Conclusion

Introducing Files

Files on Linux: the absolute basics

The filesystem tree

Basic filesystem operations

Editing files

File types

Advanced file operations

Advanced filesystem knowledge for the real world

Conclusion

Editing Files on the Command Line

Nano

Vi(m)

Editing a file you don’t have permissions for

Setting your preferred editor

Conclusion

Users and Groups

What is a user?

Root versus everybody else

sudo

What is a group?

Mini project: user and group management

Advanced: what is a user, really?

A note on scriptability

Conclusion

Ownership and Permissions

Deciphering a long listing

File attributes

Ownership

Permissions

Changing ownership (chown) and permissions (chmod)

Managing Installed Software

Working with software packages

Caution required – curl | bash

Compiling third-party software from source

Conclusion

Configuring Software

Configuration hierarchy

Command-line arguments

Environment variables

Configuration files

systemd units

Quick note: configuration in Docker

Conclusion

Pipes and Redirection

File descriptors

Input and output redirection (or, playing with file descriptors for fun and profit)

Connecting commands together with pipes (|)

The CLI tools you need to know

Practical pipe patterns

Advanced: inspecting file descriptors

Conclusion

Automating Tasks with Shell Scripts

Why you need Bash scripting basics

Basics

Bash versus other shells

Shebangs and executable text files, aka scripts

Testing

Conditionals: if/then/else

Loops

Functions

Input and output redirection

Variable interpolation syntax – ${}

Limitations of shell scripts

Conclusion

Citations

Secure Remote Access with SSH

Public key cryptography primer

SSH keys

Practical project: Set up a key-based login to a remote server

Converting SSH2 keys to the OpenSSH format

SSH-agent

Common SSH errors and the -v (verbose) argument

File transfer

Tunnels

The configuration file

Conclusion

Version Control with Git

Some background on Git

What is a distributed version control system?

Git basics

Terms you might come across

Best practices for commit messages

GUIs

Useful shell aliases

Poor man’s GitHub

Conclusion

Containerizing Applications with Docker

How containers work as packages

Prerequisite: Docker install

Docker crash course

Creating images with a Dockerfile

Container commands

Docker project: Python/Flask application container

Containers vs. virtual machines

A quick note on Docker image repositories

Painfully learned container lessons

Container theory: namespacing

How do we do Ops with containers?

Conclusion

Monitoring Application Logs

Introduction to logging

The systemd journal

Example journalctl commands

Logging in Docker containers

Load Balancing and HTTP

Basic terminology

Common misunderstandings about HTTP

Conclusion

Other Books You May Enjoy

Index

Customer Reviews

5 (2)

5 star

100%

4 star

3 star

2 star

1 star

Advanced process concepts and tools

This marks the beginning of the “advanced” section of this chapter. While you don’t need to master all the concepts in this section to work effectively with Linux processes, they can be extremely helpful. If you have a few extra minutes, we recommend at least familiarizing yourself with each one.

Signals

How does systemctl tell your web server to re-read its configuration files? How can you politely ask a process to shut down cleanly? And how can you kill a malfunctioning process immediately, because it’s bringing your production application to its knees?

In Unix and Linux, all of this is done with signals. Signals are numerical messages that can be sent between programs. They’re a way for processes to communicate with each other and with the operating system, allowing processes to send and receive specific messages.

These messages can be used to communicate a variety of things to a process, for example, indicating that a particular event has happened or that a specific action or response is required.

Practical uses of signals

Let’s look at a few examples of the practical value that the signal mechanism enables. Signals can be used to implement inter-process communication; for example, one process can send a signal to another process indicating that it’s finished with a particular task and that the other process can now start working. This allows processes to coordinate their actions and work together in a smooth and efficient manner, much like execution threads in programming languages (but without the associated memory sharing).

Another common application of process signals is to handle program errors. For example, a process can be designed to catch the SIGSEGV signal, which indicates a segmentation fault. When a process receives this signal, it can trap that signal and then take action to log the error, dump core for debugging purposes, or clean up any resources that were being used before shutting down gracefully.

Process signals can also be used to implement graceful shutdowns. For example, when a system is shutting down, a signal can be sent to all processes to give them a chance to save their state and clean up any resources they were using, via “trapping” signals.

Trapping

Many of the signals can be “trapped” by the processes that receive them: this is essentially the same idea as catching and handling an error in a programming language.

If the receiving process has a handler function for the signal that’s being sent, then that handler function is run. That’s how programs re-read their configuration without restarting, and finish their database writes and close their file handles after receiving the shutdown signal.

The kill command

However, it’s not just processes that communicate via signals: the frighteningly named (and, technically speaking, incorrectly named) kill is a program that allows users to send signals to processes, too.

One of the most common uses of user-sent processes via the kill command is to interrupt a process that is no longer responding. For example, if a process is stuck in an infinite loop, a “kill” signal can be sent to force it to stop.

The kill command allows you to send a signal to a process by specifying its PID. If the process you’d like to terminate has PID 2600, you’d run:

kill 2600

This command would send signal 15 (SIGTERM, or “terminate”) to the process, which would then have a chance to trap the signal and shut down cleanly.

Note

As you can see from the included table of standard signal numbers, the default signal that kill sends is “terminate” (signal 15), not “kill” (SIGKILL is 9). The kill program is not just for killing processes but also for sending any kind of signal. It’s really confusingly named and I’m sorry about that – it’s just one of those idiosyncrasies of Unix and Linux that you’ll get used to.

If you don’t want to send the default signal 15, you can specify the signal you’d like to send with a dash; to send a SIGHUP to the same process, you’d run:

kill –1 2600

Running man signal will give you a list of signals that you can send:

Figure 2.6: Example of output of the man signal command

It pays – sometimes quite literally, in engineering interviews – to be familiar with a few of these:

SIGHUP (1) – “hangup”: interpreted by many applications – for example, nginx – as “re-read your configuration because I’ve made changes to it.”
SIGINT (2) – “interrupt”: often interpreted the same as SIGTERM - “please shut down cleanly.”
SIGTERM (15) – “terminate”: nicely asks a process to shut down.
SIGUSR1 (30) and SIGUSR2 (31) are sometimes used for application-defined messaging For example, SIGUSR1 asks nginx to re-open the log files it’s writing to, which is useful if you’ve just rotated them.
SIGKILL (9) – SIGKILL cannot be trapped and handled by processes. If this signal is sent to a program, the operating system will kill that program immediately. Any cleanup code, like flushing writes or safe shutdown, is not performed, so this is generally a last resort, since it could lead to data corruption.

If you want to explore Linux a bit deeper, feel free to poke around the /proc directory. That’s definitely beyond the basics, but it’s a directory that contains a filesystem subtree for every process, where live information about the processes is looked up as you read those files.

/proc

In practice, this knowledge can come in handy during troubleshooting when you’ve identified a misbehaving (or mysterious) process and want to know exactly what it’s doing in real time.

You can learn a lot about a process by poking around in its /proc subdirectory and casually googling.

Many of the tools we show you in this chapter actually use /proc to gather process information, and only show you a subset of what’s there. If you want to see everything and do the filtering yourself, /proc is the place to look.

lsof – show file handles that a process has open

The lsof command shows all files that a process has opened for reading and writing. This is useful because it only takes one small bug for a program to leak file handles (internal references to files that it has requested access to). This can lead to resource usage issues, file corruption, and a long list of strange behavior.

Thankfully, getting a list of files that a process has open is easy. Just run lsof and pass the –p flag with a PID (you’ll usually have to run this as root). This will return the list of files that the process (in this case, with PID 1589) has open:

  ~ lsof -p 1589

Figure 2.7: Example of list of files opened by the 1589 process using the lsof -p 1589 command

The above is the output for an nginx web server process. The first line shows you the current working directory for the process: in this case, the root directory (/). You can also see that it has file handles open on its own binary (/usr/sbin/nginx) and various libraries in /usr/lib/.

Further down, you might notice a few more interesting filepaths:

Figure 2.8: Further opened files of the 1589 process

This listing includes the log files nginx is writing to, and socket files (Unix, IPv4, and IPv6) that it’s reading and writing to. In Unix and Linux, network sockets are just a special kind of file, which makes it easy to use the same core toolset across a wide variety of use cases – tools that work with files are extremely powerful in an environment where almost everything is represented as a file.

Inheritance

Except for the very first process, init (PID 1), all processes are created by a parent process, which essentially makes a copy of itself and then “forks” (splits) that copy off. When a process is forked, it typically inherits its parent’s permissions, environment variables, and other attributes.

Although this default behavior can be prevented and changed, it’s a bit of a security risk: software that you run manually receives the permissions of your current user (or even root privileges, if you use sudo). All child processes that might be created by that process – for example, during installation, compilation, and so on – inherit those permissions.

Imagine a web server process that was started with root privileges (so it could bind to a network port) and environment variables containing cloud authentication keys (so it could grab data from the cloud). When this main process forks off a child process that needs neither root privileges nor sensitive environment variables, it’s an unnecessary security risk to pass those along to the child. As a result, dropping privileges and clearing environment variables is a common pattern in services spawning child processes.

From a security perspective, it is important to keep this in mind to prevent situations where information such as passwords or access to sensitive files could be leaked. While it is outside the scope of this book to go into details of how to avoid this, it’s important to be aware of this if you’re writing software that’s going to run on Linux systems.

The Software Developer's Guide to Linux

By : David Cohen, Christian Sturm

The Software Developer's Guide to Linux

By: David Cohen, Christian Sturm

Overview of this book

Related Content you might be interested in

Current Title:

The Software Developer's Guide to Linux

Fundamentals of Linux

CentOS Quick Start Guide

Working with Linux - Quick Hacks for the Command Line

Advanced process concepts and tools

Signals

Practical uses of signals

Trapping

The kill command

lsof – show file handles that a process has open

Inheritance