ReleaseNotes.rst 16 KB

Raw Blame History Permalink


LLVM 10.0.0 Release Notes


Introduction

Non-comprehensive list of changes in this release
Changes to the LLVM IR
Changes to the AArch64 Backend
Changes to the ARM Backend
Changes to the MIPS Target
Changes to the PowerPC Target
Changes to the SystemZ Target
Changes to the X86 Target
Changes to the WebAssembly Target
Changes to the Windows Target
Changes to the RISC-V Target
Changes to the C API
Changes to the Go bindings


Changes to LLDB
External Open Source Projects Using LLVM 10
Additional Information


Introduction
This document contains the release notes for the LLVM Compiler Infrastructure,
release 10.0.0.  Here we describe the status of LLVM, including major improvements
from the previous release, improvements in various subprojects of LLVM, and
some of the current users of the code.  All LLVM releases may be downloaded
from the LLVM releases web site.
For more information about LLVM, including information about the latest
release, please check out the main LLVM web site.  If you
have questions or comments, the LLVM Developer's Mailing List is a good place to send
them.

Non-comprehensive list of changes in this release

The ISD::FP_ROUND_INREG opcode and related code was removed from SelectionDAG.
Enabled MemorySSA as a loop dependency. Since
r370957
(D58311 [MemorySSA & LoopPassManager]
Enable MemorySSA as loop dependency. Update tests.), the MemorySSA analysis
is being preserved and used by a series of loop passes. The most significant
use is in LICM, where the instruction hoisting and sinking relies on aliasing
information provided by MemorySSA vs previously creating an AliasSetTracker.
The LICM step of promoting variables to scalars still relies on the creation
of an AliasSetTracker, but its use is reduced to only be enabled for loops
with a small number of overall memory instructions. This choice was motivated
by experimental results showing compile and run time benefits or replacing the
AliasSetTracker usage with MemorySSA without any performance penalties.
The fact that MemorySSA is now preserved by and available in a series of loop
passes, also opens up opportunities for its use in those respective passes.
The BasicBlockPass, BBPassManager and all their uses were deleted in
this revision.
The LLVM_BUILD_LLVM_DYLIB and LLVM_LINK_LLVM_DYLIB CMake options are no longer
available on Windows.
As per :ref:`LLVM Language Reference Manual <i_getelementptr>`,
getelementptr inbounds can not change the null status of a pointer,
meaning it can not produce non-null pointer given null base pointer, and
likewise given non-null base pointer it can not produce null pointer; if it
does, the result is a :ref:`poison value <poisonvalues>`.
Since r369789
(D66608 [InstCombine] icmp eq/ne (gep
inbounds P, Idx..), null -> icmp eq/ne P, null) LLVM uses that for
transformations. If the original source violates these requirements this
may result in code being miscompiled. If you are using Clang front-end,
Undefined Behaviour Sanitizer -fsanitize=pointer-overflow check
will now catch such cases.
Windows Control Flow Guard: the -cfguard option now emits CFG checks on
indirect function calls. The previous behavior is still available with the
-cfguard-nochecks option. Note that this feature should always be used
with optimizations enabled.

Callbacks have been added to CommandLine Options.  These can
be used to validate or selectively enable other options.
The function attributes no-frame-pointer-elim and
no-frame-pointer-elim-non-leaf have been replaced by frame-pointer,
which has 3 values: none, non-leaf, and all. The values mean what
functions should retain frame pointers.
The inter-procedural analysis and optimization capabilities in the Attributor
framework and pass have been substantially advanced (initial commit
D59918, LLVM-Dev talk).
In this release, 19 different attributes are inferred, including 12 LLVM IR
attributes and 7 "abstract" attributes, such as liveness. The Attributor is
still under heavy development and disabled by default; to enable an early run
pass -mllvm -attributor-disable=false to an invocation of clang.
New matrix math intrinsics have been added to LLVM
(see :ref:`LLVM Language Reference Manual <i_matrixintrinsics>`), together
with the LowerMatrixIntrinsics pass. The pass lowers matrix intrinsics
to a set of efficient vector instructions. The lowering pass is off
by default and can be enabled by passing -mllvm -enable-matrix to an
invocation of clang.


Changes to the LLVM IR

Unnamed function arguments now get printed with their automatically
generated name (e.g. "i32 %0") in definitions. This may require front-ends
to update their tests; if so there is a script utils/add_argument_names.py
that correctly converted 80-90% of Clang tests. Some manual work will almost
certainly still be needed.
A new freeze instruction is added. The freeze instruction is used to stop
IR-level propagation of undef and poison values. Currently its support is
preliminary; a freeze-equivalent operation for SelDag/MIR needs to be added.


Changes to the AArch64 Backend

Added support for Cortex-A65, Cortex-A65AE, Neoverse E1 and Neoverse N1 cores.
With a few more bugs fixed in the LLVM 10 release, clang-cl can now target
Windows-on-ARM well, demonstrated by building complex pieces of software such
as Chromium and the Electron framework.
Support for -fpatchable-function-entry was added.


Changes to the ARM Backend

Optimized ARMv8.1-M code generation, including generating Low Overhead Loops.
Added auto-vectorization for the ARMv8.1-M MVE vector extension.
Support was added for inline asm constraints s,j,x,N,O.


Changes to the MIPS Target

Improved support for octeon and added support for octeon+
MIPS-family CPU.

min, max, umin, umax atomics now supported on MIPS targets.
Now PC-relative relocations are generated for .eh_frame sections when
possible. That allows to link MIPS binaries without having to pass the
-Wl,-z,notext option.
Fix evaluating J-format branch (j, jal, ...) targets when the
instruction is not in the first 256 MB region.
Fixed jal, sc, scs, ll, lld, la, lw, sw
instructions expanding. Now they accept more types of expression as arguments,
correctly handle load/store for XGOT model, expand using less instructions
or registers.
Initial MIPS support has been added to llvm-exegesis.
Generates _mcount calls using proper MIPS ABI.
Improved support of GlobalISel instruction selection framework. This feature
is still in experimental state for MIPS targets though.


Changes to the PowerPC Target
Optimization:

Improved register pressure estimates in the loop vectorizer based on type
Improved the PowerPC cost model for the vectorizer
Enabled vectorization of math routines on PowerPC using MASSV (Mathematical Acceleration SubSystem) library

compiler-rt:

Added/improved conversion functions from IBM long double to 128-bit integers

Codegen:

Optimized memory access instructions in loops (pertaining to update-form instructions and address computation)
Added options to disable hoisting instructions to hotter blocks based on statically or profile-based block hotness estimates
Code generation improvements (particularly with floating point and vector code as well as handling condition registers)
Various infrastructural improvements, code refactoring, and bug fixes
Optimized handling of control flow based on multiple comparison of same values

Tools:

llvm-readobj supports displaying file header, section headers, symbol table and relocation entries for XCOFF object files
llvm-objdump supports disassembling physical sections for XCOFF object files


Changes to the SystemZ Target

Added support for the -march=z15 and -mtune=z15 command line options
(as aliases to the existing -march=arch13 and -mtune=arch13 options).
Added support for the -march=native command line option.
Added support for the -mfentry, -mnop-mcount, and -mrecord-mcount
command line options.
Added support for the GHC calling convention.
Miscellaneous codegen enhancements, in particular to enable better
reuse of condition code values and improved use of conditional
move instructions.


Changes to the X86 Target

Less-than-128-bit vector types, v2i32, v4i16, v2i16, v8i8, v4i8, and v2i8, are
now stored in the lower bits of an xmm register and the upper bits are
undefined. Previously the elements were spread apart with undefined bits in
between them.
v32i8 and v64i8 vectors with AVX512F enabled, but AVX512BW disabled will now
be passed in ZMM registers for calls and returns. Previously they were passed
in two YMM registers. Old behavior can be enabled by passing
-x86-enable-old-knl-abi.

-mprefer-vector-width=256 is now the default behavior skylake-avx512 and
later Intel CPUs. This tries to limit the use of 512-bit registers which can
cause a decrease in CPU frequency on these CPUs. This can be re-enabled by
passing -mprefer-vector-width=512 to clang or passing
-mattr=-prefer-256-bit to llc.
Deprecated the mpx feature flag for the Intel MPX instructions. There were no
intrinsics for this feature. This change only this effects the results
returned by getHostCPUFeatures on CPUs that implement the MPX instructions.
The feature flag fast-partial-ymm-or-zmm-write which previously disabled
vzeroupper insertion has been removed. It has been replaced with a vzeroupper
feature flag which has the opposite polarity. So -vzeroupper has the same
effect as +fast-partial-ymm-or-zmm-write.


Changes to the WebAssembly Target


__attribute__((used)) no longer implies that a symbol is exported, for
consistency with other targets.
Multivalue function signatures are now supported in WebAssembly object files
The new atomic.fence instruction is now supported
Thread-Local Storage (TLS) is now supported.
SIMD support is significantly expanded.


Changes to the Windows Target

Fixed section relative relocations in .debug_frame in DWARF debug info


Changes to the RISC-V Target
New Features:

The Machine Outliner is now supported, but not enabled by default.
Shrink-wrapping is now supported.
The Machine Scheduler has been enabled and scheduler descriptions for the
Rocket micro-architecture have been added, covering both 32- and 64-bit Rocket
cores.
This release lays the groundwork for enabling LTO in a future LLVM release.
In particular, LLVM now uses a new target-abi module metadata item to
represent the chosen RISC-V psABI variant. Frontends should add this module
flag to prevent ABI lowering problems when LTO is enabled in a future LLVM
release.
Support has been added for assembling RVC HINT instructions.
Added code lowering for half-precision floats.
The fscsr and frcsr (fssr, frsr) obsolete aliases have been added to
the assembler for use in legacy code.
The stack can now be realigned even when there are variable-sized objects in
the same frame.
fastcc is now supported. This is a more efficient, unstandardised, calling
convention for calls to private leaf functions in the same IR module.
llvm-objdump now supports -M no-aliases and -M numeric for altering the
dumped assembly. These match the behaviour of GNU objdump, respectively
disabling instruction aliases and printing the numeric register names rather
than the ABI register names.

Improvements:

Trap and Debugtrap now lower to RISC-V-specific trap instructions.
LLVM IR Inline assembly now supports using ABI register names and using
floating point registers in constraints.
Stack Pointer adjustments have been changed to better match RISC-V's immediates.

ra (x1) can now be used as a callee-saved register.
The assembler now suggests spelling corrections for unknown assembly
mnemonics.
Stack offsets of greater than 32-bits are now accepted on RV64.
Variadic functions can now be tail-call optimised, as long as they do not use
stack memory for passing arguments.
Code generation has been changed for 32-bit arithmetic operations on RV64 to
reduce sign-extensions.

Bug Fixes:

There was an issue with register preservation after calls in interrupt
handlers, where some registers were marked as preserved even though they were
not being preserved by the call. This has been corrected, and now only
callee-saved registers are live over a function call in an interrupt handler
(just like calls in regular functions).
Atomic instructions now only accept GPRs (plus an offset) in memory operands.
Fixed some issues with evaluation of relocations and fixups.
The error messages around missing RISC-V extensions in the assembler have been
improved.
The error messages around unsupported relocations have been improved.
Non-PIC code no longer forces Local Exec TLS.
There have been some small changes to the code generation for atomic
operations.
RISC-V no longer emits incorrect CFI directives in function prologues and
epilogues.
RV64 no longer clears the upper bits when returning complex types from
libcalls using the LP64 psABI.

Compiler-RT:

RISC-V (both 64-bit and 32-bit) is now supported by compiler-rt, allowing
crtbegin and crtend to be built.
The Sanitizers now support 64-bit RISC-V on Linux.


Changes to the C API

C DebugInfo API LLVMDIBuilderCreateTypedef is updated to include an extra
argument AlignInBits, to facilitate / propagate specified Alignment information
present in a typedef to Debug information in LLVM IR.


Changes to the Go bindings

Go DebugInfo API CreateTypedef is updated to include an extra argument AlignInBits,
to facilitate / propagate specified Alignment information present in a typedef
to Debug information in LLVM IR.


Changes to LLDB

Improved support for building with MinGW
Initial support for debugging Windows ARM and ARM64 binaries
Improved error messages in the expression evaluator.
Tab completions for command options now also provide a description for each option.
Fixed that printing structs/classes with the expression command sometimes did not
print the members/contents of the class.
Improved support for using classes with bit-field members in the expression evaluator.
Greatly improved support for DWARF v5.


External Open Source Projects Using LLVM 10
Zig Programming Language
Zig  is a system programming language intended to be
an alternative to C. It provides high level features such as generics, compile
time function execution, and partial evaluation, while exposing low level LLVM
IR features such as aliases and intrinsics. Zig uses Clang to provide automatic
import of .h symbols, including inline functions and simple macros. Zig uses
LLD combined with lazily building compiler-rt to provide out-of-the-box
cross-compiling for all supported targets.

Additional Information
A wide variety of additional information is available on the LLVM web page, in particular in the documentation section.  The web page also contains versions of the
API documentation which is up-to-date with the Subversion version of the source
code.  You can access versions of these documents specific to this release by
going into the llvm/docs/ directory in the LLVM tree.
If you have any questions or comments about LLVM, please feel free to contact
us via the mailing lists.