Learn LLVM 12

By : Kai Nacke

Learn LLVM 12

By: Kai Nacke

Overview of this book

LLVM was built to bridge the gap between compiler textbooks and actual compiler development. It provides a modular codebase and advanced tools which help developers to build compilers easily. This book provides a practical introduction to LLVM, gradually helping you navigate through complex scenarios with ease when it comes to building and working with compilers. You’ll start by configuring, building, and installing LLVM libraries, tools, and external projects. Next, the book will introduce you to LLVM design and how it works in practice during each LLVM compiler stage: frontend, optimizer, and backend. Using a subset of a real programming language as an example, you will then learn how to develop a frontend and generate LLVM IR, hand it over to the optimization pipeline, and generate machine code from it. Later chapters will show you how to extend LLVM with a new pass and how instruction selection in LLVM works. You’ll also focus on Just-in-Time compilation issues and the current state of JIT-compilation support that LLVM provides, before finally going on to understand how to develop a new backend for LLVM. By the end of this LLVM book, you will have gained real-world experience in working with the LLVM compiler development framework with the help of hands-on examples and source code snippets.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Code in Action

Download the color images

Conventions used

Get in touch

Reviews

Section 1 – The Basics of Compiler Construction with LLVM

Free Chapter

Chapter 1: Installing LLVM

Getting the prerequisites ready

Building with CMake

Customizing the build process

Summary

Chapter 2: Touring the LLVM Source

Technical requirements

Contents of the LLVM mono repository

Layout of an LLVM project

Creating your own project using LLVM libraries

Targeting a different CPU architecture

Summary

Chapter 3: The Structure of a Compiler

Technical requirements

Building blocks of a compiler

An arithmetic expression language

Lexical analysis

Syntactical analysis

Semantic analysis

Generating code with the LLVM backend

Summary

Section 2 – From Source to Machine Code Generation

Chapter 4: Turning the Source File into an Abstract Syntax Tree

Technical requirements

Defining a real programming language

Creating the project layout

Managing source files and user messages

Structuring the lexer

Constructing a recursive descent parser

Generating a parser and lexer with bison and flex

Performing semantic analysis

Summary

Chapter 5: Basics of IR Code Generation

Technical requirements

Generating IR from the AST

Using AST numbering to generate IR code in SSA form

Setting up the module and the driver

Summary

Chapter 6: IR Generation for High-Level Language Constructs

Technical requirements

Working with arrays, structs, and pointers

Getting the application binary interface right

Creating IR code for classes and virtual functions

Summary

Chapter 7: Advanced IR Generation

Technical requirements

Throwing and catching exceptions

Generating metadata for type-based alias analysis

Adding debug metadata

Summary

Chapter 8: Optimizing IR

Technical requirements

Introducing the LLVM Pass manager

Implementing a Pass using the new Pass manager

Adapting a Pass for use with the old Pass manager

Adding an optimization pipeline to your compiler

Summary

Section 3 –Taking LLVM to the Next Level

Chapter 9: Instruction Selection

Technical requirements

Understanding the LLVM target backend structure

Using MIR to test and debug the backend

How instruction selection works

Supporting new machine instructions

Summary

Chapter 10: JIT Compilation

Technical requirements

Getting an overview of LLVM's JIT implementation and use cases

Using JIT compilation for direct execution

Utilizing a JIT compiler for code evaluation

Summary

Chapter 11: Debugging Using LLVM Tools

Technical requirements

Instrumenting an application with sanitizers

Finding bugs with libFuzzer

Performance profiling with XRay

Checking the source with the Clang Static Analyzer

Creating your own Clang-based tool

Summary

Chapter 12: Create Your Own Backend

Technical requirements

Setting the stage for a new backend

Adding the new architecture to the Triple class

Extending the ELF file format definition in LLVM

Creating the target description

Implementing the DAG instruction selection classes

Generating assembler instructions

Emitting machine code

Adding support for disassembling

Piecing it all together

Summary

Why subscribe?

Other Books You May Enjoy

Packt is searching for authors like you

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Managing source files and user messages

A real compiler must deal with many files. Usually, the developer calls the compiler with the name of the main compilation unit. This compilation unit can refer to other files, for example, via #include directives in C or import statements in Python or Modula-2. An imported module can import other modules and so on. All these files must be loaded into memory and run through the analysis stages of the compiler. During development, a developer may make syntactical or semantical errors. When detected, an error message, including the source line and a marker, should be printed. At this point, it should be obvious that this essential component is not trivial.

Luckily, LLVM comes with a solution: the llvm::SourceMgr class. A new source file is added to SourceMgr with a call to the AddNewSourceBuffer() method. Alternatively, a file can be loaded with a call to the AddIncludeFile() method. Both methods return an ID to identify the buffer. You use...

Learn LLVM 12

By : Kai Nacke

Learn LLVM 12

By: Kai Nacke

Overview of this book

Related Content you might be interested in

Current Title:

Learn LLVM 12

LLVM Techniques, Tips, and Best Practices Clang and Middle-End Libraries

Managing source files and user messages