LLVMGPU
-extract-address-computation-gpu
link
Extract address computations from memory accesses
This pass is similar to extract-address-computation
except it also
supports memory accesses that are specific to GPUs.
-iree-convert-to-nvvm
link
Perform final conversion from builtin/GPU/HAL/standard dialect to LLVM and NVVM dialects
-iree-convert-to-rocdl
link
Perform final conversion from builtin/GPU/HAL/standard dialect to LLVM and ROCDL dialects
-iree-llvmgpu-assign-constant-ordinals
link
Assigns executable constant ordinals across all LLVMGPU variants.
-iree-llvmgpu-cast-address-space-function
link
Cast address space to generic in CallOp and FuncOp
-iree-llvmgpu-cast-type-to-fit-mma
link
Perform type extension/truncation over vector.contract types to target GPU MMA intrinsics
-iree-llvmgpu-configure-tensor-layouts
link
Pass to set layouts on tensors for later vector distribution
-iree-llvmgpu-link-executables
link
Links LLVMGPU HAL executables within the top-level program module.
Optionslink
-target : Target backend name whose executables will be linked by this pass.
-iree-llvmgpu-lower-executable-target
link
Perform lowering of executable target using one of the IREE::HAL::DispatchLoweringPassPipeline
-iree-llvmgpu-pack-shared-memory-alloc
link
Pass pack shared memory allocation in order to reduce memory usage.
-iree-llvmgpu-prefetch-shared-memory
link
Rotate scf.for loops to prefetch shared memory with distance 1
-iree-llvmgpu-promote-matmul-to-fit-mma
link
Pass to promote contraction ops to fit mma shapes
-iree-llvmgpu-select-lowering-strategy
link
Select a IREE::HAL::DispatchLoweringPassPipeline for lowering the target variant
-iree-llvmgpu-tensor-pad
link
Pass to pad out tensors up to static dimensions.
-iree-llvmgpu-tensorcore-vectorization
link
Pass to convert linalg into Vector and transform it to a form that can be lowered to GPU MMA ops
-iree-llvmgpu-tile-and-distribute
link
Pass to tile and distribute linalg ops within a workgroup.
-iree-llvmgpu-vector-distribute
link
Pass to distribute vectorized functions.
-iree-llvmgpu-vector-lowering
link
Pass to lower Vector ops before conversion to LLVM.
-iree-llvmgpu-vector-to-gpu
link
Pass to convert vector to gpu.
-iree-test-llvmgpu-legalize-ops
link
Test pass for several legalization patterns.
-iree-test-llvmgpu-query-mma
link
Test pass for querying the supported mma intrinsic instructions.