LangImpl08.rst 5.76 KB

Kaleidoscope: Compiling to Object Code

Chapter 8 Introduction

Welcome to Chapter 8 of the "Implementing a language with LLVM" tutorial. This chapter describes how to compile our language down to object files.

Choosing a target

LLVM has native support for cross-compilation. You can compile to the architecture of your current machine, or just as easily compile for other architectures. In this tutorial, we'll target the current machine.

To specify the architecture that you want to target, we use a string called a "target triple". This takes the form <arch><sub>-<vendor>-<sys>-<abi> (see the cross compilation docs).

As an example, we can see what clang thinks is our current target triple:

$ clang --version | grep Target
Target: x86_64-unknown-linux-gnu

Running this command may show something different on your machine as you might be using a different architecture or operating system to me.

Fortunately, we don't need to hard-code a target triple to target the current machine. LLVM provides sys::getDefaultTargetTriple, which returns the target triple of the current machine.

auto TargetTriple = sys::getDefaultTargetTriple();

LLVM doesn't require us to link in all the target functionality. For example, if we're just using the JIT, we don't need the assembly printers. Similarly, if we're only targeting certain architectures, we can only link in the functionality for those architectures.

For this example, we'll initialize all the targets for emitting object code.

InitializeAllTargetInfos();
InitializeAllTargets();
InitializeAllTargetMCs();
InitializeAllAsmParsers();
InitializeAllAsmPrinters();

We can now use our target triple to get a Target:

std::string Error;
auto Target = TargetRegistry::lookupTarget(TargetTriple, Error);

// Print an error and exit if we couldn't find the requested target.
// This generally occurs if we've forgotten to initialise the
// TargetRegistry or we have a bogus target triple.
if (!Target) {
  errs() << Error;
  return 1;
}

Target Machine

We will also need a TargetMachine. This class provides a complete machine description of the machine we're targeting. If we want to target a specific feature (such as SSE) or a specific CPU (such as Intel's Sandylake), we do so now.

To see which features and CPUs that LLVM knows about, we can use llc. For example, let's look at x86:

$ llvm-as < /dev/null | llc -march=x86 -mattr=help
Available CPUs for this target:

  amdfam10      - Select the amdfam10 processor.
  athlon        - Select the athlon processor.
  athlon-4      - Select the athlon-4 processor.
  ...

Available features for this target:

  16bit-mode            - 16-bit mode (i8086).
  32bit-mode            - 32-bit mode (80386).
  3dnow                 - Enable 3DNow! instructions.
  3dnowa                - Enable 3DNow! Athlon instructions.
  ...

For our example, we'll use the generic CPU without any additional features, options or relocation model.

auto CPU = "generic";
auto Features = "";

TargetOptions opt;
auto RM = Optional<Reloc::Model>();
auto TargetMachine = Target->createTargetMachine(TargetTriple, CPU, Features, opt, RM);

Configuring the Module

We're now ready to configure our module, to specify the target and data layout. This isn't strictly necessary, but the frontend performance guide recommends this. Optimizations benefit from knowing about the target and data layout.

TheModule->setDataLayout(TargetMachine->createDataLayout());
TheModule->setTargetTriple(TargetTriple);

Emit Object Code

We're ready to emit object code! Let's define where we want to write our file to:

auto Filename = "output.o";
std::error_code EC;
raw_fd_ostream dest(Filename, EC, sys::fs::OF_None);

if (EC) {
  errs() << "Could not open file: " << EC.message();
  return 1;
}

Finally, we define a pass that emits object code, then we run that pass:

legacy::PassManager pass;
auto FileType = CGFT_ObjectFile;

if (TargetMachine->addPassesToEmitFile(pass, dest, nullptr, FileType)) {
  errs() << "TargetMachine can't emit a file of this type";
  return 1;
}

pass.run(*TheModule);
dest.flush();

Putting It All Together

Does it work? Let's give it a try. We need to compile our code, but note that the arguments to llvm-config are different to the previous chapters.

$ clang++ -g -O3 toy.cpp `llvm-config --cxxflags --ldflags --system-libs --libs all` -o toy

Let's run it, and define a simple average function. Press Ctrl-D when you're done.

$ ./toy
ready> def average(x y) (x + y) * 0.5;
^D
Wrote output.o

We have an object file! To test it, let's write a simple program and link it with our output. Here's the source code:

#include <iostream>

extern "C" {
    double average(double, double);
}

int main() {
    std::cout << "average of 3.0 and 4.0: " << average(3.0, 4.0) << std::endl;
}

We link our program to output.o and check the result is what we expected:

$ clang++ main.cpp output.o -o main
$ ./main
average of 3.0 and 4.0: 3.5

Full Code Listing

Next: Adding Debug Information