Book Image

Red Hat Enterprise Linux Troubleshooting Guide

By : Benjamin Cane
Book Image

Red Hat Enterprise Linux Troubleshooting Guide

By: Benjamin Cane

Overview of this book

Red Hat Enterprise Linux is an operating system that allows you to modernize your infrastructure, boost efficiency through virtualization, and finally prepare your data center for an open, hybrid cloud IT architecture. It provides the stability to take on today's challenges and the flexibility to adapt to tomorrow's demands. In this book, you begin with simple troubleshooting best practices and get an overview of the Linux commands used for troubleshooting. The book will cover the troubleshooting methods for web applications and services such as Apache and MySQL. Then, you will learn to identify system performance bottlenecks and troubleshoot network issues; all while learning about vital troubleshooting steps such as understanding the problem statement, establishing a hypothesis, and understanding trial, error, and documentation. Next, the book will show you how to capture and analyze network traffic, use advanced system troubleshooting tools such as strace, tcpdump & dmesg, and discover common issues with system defaults. Finally, the book will take you through a detailed root cause analysis of an unexpected reboot where you will learn to recover a downed system.
Table of Contents (19 chapters)
Red Hat Enterprise Linux Troubleshooting Guide
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

A look back


Now that we have resolved the issue, let's take a second to look at what we did to resolve the issue.

Too many open files

In order to troubleshoot our issue, we executed a scheduled cron job manually. If we circle back to previous chapters, this is a prime example of duplicating an issue and seeing it for ourselves.

In this case, the job was not performing the tasks it was supposed to. In order to identify the reason, we ran it manually.

During that manual execution, we were able to identify the following error:

IOError: [Errno 24] Too many open files: '/opt/myapp/queue/1433955823.29_0.txt'

This error is very common and is caused by the job running into user limits that prevent a single user from opening too many files. To resolve this we added custom settings to the /etc/security/limits.conf file.

These changes set the soft limitation of open files to 100000 for our user by default. We also allowed the user to increase the open files limit to 500000 on an ad-hoc basis via the hard...