Learn LLVM 17 - Second Edition

By : Kai Nacke, Amy Kwan

Learn LLVM 17 - Second Edition

By: Kai Nacke, Amy Kwan

Overview of this book

LLVM was built to bridge the gap between the theoretical knowledge found in compiler textbooks and the practical demands of compiler development. With a modular codebase and advanced tools, LLVM empowers developers to build compilers with ease. This book serves as a practical introduction to LLVM, guiding you progressively through complex scenarios and ensuring that you navigate the challenges of building and working with compilers like a pro. The book starts by showing you how to configure, build, and install LLVM libraries, tools, and external projects. You’ll then be introduced to LLVM's design, unraveling its applications in each compiler stage: frontend, optimizer, and backend. Using a real programming language subset, you'll build a frontend, generate LLVM IR, optimize it through the pipeline, and generate machine code. Advanced chapters extend your expertise, covering topics such as extending LLVM with a new pass, using LLVM tools for debugging, and enhancing the quality of your code. You'll also focus on just-in-time compilation issues and the current state of JIT-compilation support with LLVM. Finally, you’ll develop a new backend for LLVM, gaining insights into target description and how instruction selection works. By the end of this book, you'll have hands-on experience with the LLVM compiler development framework through real-world examples and source code snippets.

Preface

What’s new in this edition

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Conventions used

Get in touch

Share Your Thoughts

Download a free PDF copy of this book

Part 1: The Basics of Compiler Construction with LLVM

Free Chapter

Chapter 1: Installing LLVM

Compiling LLVM versus installing binaries

Getting the prerequisites ready

Cloning the repository and building from source

Customizing the build process

Summary

Chapter 2: The Structure of a Compiler

Building blocks of a compiler

An arithmetic expression language

Lexical analysis

Syntactical analysis

Semantic analysis

Generating code with the LLVM backend

Summary

Part 2: From Source to Machine Code Generation

Chapter 3: Turning the Source File into an Abstract Syntax Tree

Defining a real programming language

Creating the project layout

Managing the input files for the compiler

Handling messages for the user

Structuring the lexer

Constructing a recursive descent parser

Performing semantic analysis

Summary

Chapter 4: Basics of IR Code Generation

Generating IR from the AST

Using AST numbering to generate IR code in SSA form

Setting up the module and the driver

Summary

Chapter 5: IR Generation for High-Level Language Constructs

Technical requirements

Working with arrays, structs, and pointers

Getting the application binary interface right

Creating IR code for classes and virtual functions

Summary

Chapter 6: Advanced IR Generation

Throwing and catching exceptions

Generating metadata for type-based alias analysis

Adding debug metadata

Summary

Chapter 7: Optimizing IR

Technical requirements

The LLVM pass manager

Implementing a new pass

Using the ppprofiler pass with LLVM tools

Adding an optimization pipeline to your compiler

Summary

Part 3: Taking LLVM to the Next Level

Chapter 8: The TableGen Language

Technical requirements

Understanding the TableGen language

Experimenting with the TableGen language

Generating C++ code from a TableGen file

Drawbacks of TableGen

Summary

Chapter 9: JIT Compilation

Technical requirements

LLVM’s overall JIT implementation and use cases

Using JIT compilation for direct execution

Implementing our own JIT compiler with LLJIT

Building a JIT compiler class from scratch

Summary

Chapter 10: Debugging Using LLVM Tools

Technical requirements

Instrumenting an application with sanitizers

Finding bugs with libFuzzer

Performance profiling with XRay

Checking the source with the clang static analyzer

Creating your own clang-based tool

Summary

Part 4: Roll Your Own Backend

Chapter 11: The Target Description

Setting the stage for a new backend

Adding the new architecture to the Triple class

Extending the ELF file format definition in LLVM

Creating the target description

Adding the M88k backend to LLVM

Implementing the assembler parser

Creating the disassembler

Summary

Chapter 12: Instruction Selection

Defining the rules of the calling convention

Instruction selection via the selection DAG

Adding register and instruction information

Putting an empty frame lowering in place

Emitting machine instructions

Creating the target machine and the sub-target

Global instruction selection

How to further evolve the backend

Summary

Chapter 13: Beyond Instruction Selection

Adding a new machine function pass to LLVM

Integrating a new target into the clang frontend

Targeting a different CPU architecture

Summary

Index

Why subscribe?

Other Books You May Enjoy

Packt is searching for authors like you

Share Your Thoughts

Download a free PDF copy of this book

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Working with arrays, structs, and pointers

For almost all applications, basic types such as INTEGER are not sufficient. For example, to represent mathematical objects such as a matrix or a complex number, you must construct new data types based on existing ones. These new data types are generally known as aggregate or composite.

Arrays are a sequence of elements of the same type. In LLVM, arrays are always static, which means that the number of elements is constant. The tinylang type ARRAY [10] OF INTEGER or the C type long[10] is expressed in IR as follows:

[10 x i64]

Structures are composites of different types. In programming languages, they are often expressed with named members. For example, in tinylang, a structure is written as RECORD x: REAL; color: INTEGER; y: REAL; END; and the same structure in C is struct { float x; long color; float y; };. In LLVM IR, only the type names are listed:

{ float, i64, float }

To access a member, a numerical index is used. Like...

Learn LLVM 17 - Second Edition

By : Kai Nacke, Amy Kwan

Learn LLVM 17 - Second Edition

By: Kai Nacke, Amy Kwan

Overview of this book

Related Content you might be interested in

Current Title:

Learn LLVM 17 - Second Edition

LLVM Techniques, Tips, and Best Practices Clang and Middle-End Libraries

Clang Compiler Frontend

Working with arrays, structs, and pointers