[all-commits] [llvm/llvm-project] f4ace6: AMDGPU: Add target id and code object v4 support

Konstantin Zhuravlyov via All-commits all-commits at lists.llvm.org
Wed Mar 24 08:54:35 PDT 2021


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: f4ace6373747a661ebdae7a14f9e510c7adfea4e
      https://github.com/llvm/llvm-project/commit/f4ace6373747a661ebdae7a14f9e510c7adfea4e
  Author: Konstantin Zhuravlyov <kzhuravl_dev at outlook.com>
  Date:   2021-03-24 (Wed, 24 Mar 2021)

  Changed paths:
    M lld/test/ELF/amdgpu-abi-version.s
    M lld/test/ELF/lto/amdgcn-oses.ll
    M llvm/include/llvm/BinaryFormat/ELF.h
    M llvm/include/llvm/MC/MCParser/MCTargetAsmParser.h
    M llvm/include/llvm/MC/MCSubtargetInfo.h
    M llvm/include/llvm/Support/AMDGPUMetadata.h
    M llvm/include/llvm/Support/AMDHSAKernelDescriptor.h
    M llvm/lib/MC/MCParser/AsmParser.cpp
    M llvm/lib/MC/MCParser/MasmParser.cpp
    M llvm/lib/MC/MCSubtargetInfo.cpp
    M llvm/lib/ObjectYAML/ELFYAML.cpp
    M llvm/lib/Target/AMDGPU/AMDGPUAsmPrinter.cpp
    M llvm/lib/Target/AMDGPU/AMDGPUAsmPrinter.h
    M llvm/lib/Target/AMDGPU/AMDGPUHSAMetadataStreamer.cpp
    M llvm/lib/Target/AMDGPU/AMDGPUHSAMetadataStreamer.h
    M llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp
    M llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.h
    M llvm/lib/Target/AMDGPU/AMDGPUPTNote.h
    M llvm/lib/Target/AMDGPU/AsmParser/AMDGPUAsmParser.cpp
    M llvm/lib/Target/AMDGPU/Disassembler/AMDGPUDisassembler.cpp
    M llvm/lib/Target/AMDGPU/GCNSubtarget.h
    M llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUTargetStreamer.cpp
    M llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUTargetStreamer.h
    M llvm/lib/Target/AMDGPU/SIISelLowering.cpp
    M llvm/lib/Target/AMDGPU/SIISelLowering.h
    M llvm/lib/Target/AMDGPU/SIInstrInfo.td
    M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp
    M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.h
    M llvm/lib/Target/AMDGPU/Utils/AMDGPUPALMetadata.cpp
    M llvm/lib/Target/AMDGPU/Utils/AMDGPUPALMetadata.h
    M llvm/test/CodeGen/AMDGPU/GlobalISel/lds-global-non-entry-func.ll
    M llvm/test/CodeGen/AMDGPU/and.ll
    M llvm/test/CodeGen/AMDGPU/attr-amdgpu-flat-work-group-size-v3.ll
    M llvm/test/CodeGen/AMDGPU/attr-amdgpu-flat-work-group-size.ll
    M llvm/test/CodeGen/AMDGPU/break-smem-soft-clauses.mir
    M llvm/test/CodeGen/AMDGPU/cluster-flat-loads-postra.mir
    M llvm/test/CodeGen/AMDGPU/directive-amdgcn-target.ll
    R llvm/test/CodeGen/AMDGPU/elf-header-flags-sram-ecc.ll
    A llvm/test/CodeGen/AMDGPU/elf-header-flags-sramecc.ll
    M llvm/test/CodeGen/AMDGPU/elf-header-flags-xnack.ll
    M llvm/test/CodeGen/AMDGPU/elf-header-osabi.ll
    M llvm/test/CodeGen/AMDGPU/elf-notes.ll
    M llvm/test/CodeGen/AMDGPU/fabs.ll
    M llvm/test/CodeGen/AMDGPU/flat-scratch-reg.ll
    M llvm/test/CodeGen/AMDGPU/hsa-metadata-enqueue-kernel-v3.ll
    M llvm/test/CodeGen/AMDGPU/hsa-metadata-from-llvm-ir-full-v3.ll
    M llvm/test/CodeGen/AMDGPU/hsa-metadata-hidden-args-v3.ll
    M llvm/test/CodeGen/AMDGPU/hsa-metadata-hostcall-absent-v3.ll
    M llvm/test/CodeGen/AMDGPU/hsa-metadata-hostcall-present-v3.ll
    M llvm/test/CodeGen/AMDGPU/hsa-metadata-images-v3.ll
    M llvm/test/CodeGen/AMDGPU/hsa-metadata-invalid-ocl-version-1-v3.ll
    M llvm/test/CodeGen/AMDGPU/hsa-metadata-invalid-ocl-version-2-v3.ll
    M llvm/test/CodeGen/AMDGPU/hsa-metadata-invalid-ocl-version-3-v3.ll
    M llvm/test/CodeGen/AMDGPU/hsa-metadata-wavefrontsize.ll
    M llvm/test/CodeGen/AMDGPU/hsa-note-no-func.ll
    M llvm/test/CodeGen/AMDGPU/hsa.ll
    A llvm/test/CodeGen/AMDGPU/kernarg-size.ll
    M llvm/test/CodeGen/AMDGPU/large-alloca-compute.ll
    M llvm/test/CodeGen/AMDGPU/lds-global-non-entry-func.ll
    M llvm/test/CodeGen/AMDGPU/lshr.v2i16.ll
    M llvm/test/CodeGen/AMDGPU/s_addk_i32.ll
    M llvm/test/CodeGen/AMDGPU/s_mulk_i32.ll
    M llvm/test/CodeGen/AMDGPU/sram-ecc-default.ll
    M llvm/test/CodeGen/AMDGPU/stack-realign-kernel.ll
    A llvm/test/CodeGen/AMDGPU/tid-mul-func-xnack-all-any.ll
    A llvm/test/CodeGen/AMDGPU/tid-mul-func-xnack-all-not-supported.ll
    A llvm/test/CodeGen/AMDGPU/tid-mul-func-xnack-all-off.ll
    A llvm/test/CodeGen/AMDGPU/tid-mul-func-xnack-all-on.ll
    A llvm/test/CodeGen/AMDGPU/tid-mul-func-xnack-any-off-1.ll
    A llvm/test/CodeGen/AMDGPU/tid-mul-func-xnack-any-off-2.ll
    A llvm/test/CodeGen/AMDGPU/tid-mul-func-xnack-any-on-1.ll
    A llvm/test/CodeGen/AMDGPU/tid-mul-func-xnack-any-on-2.ll
    A llvm/test/CodeGen/AMDGPU/tid-mul-func-xnack-invalid-any-off-on.ll
    A llvm/test/CodeGen/AMDGPU/tid-one-func-xnack-any.ll
    A llvm/test/CodeGen/AMDGPU/tid-one-func-xnack-not-supported.ll
    A llvm/test/CodeGen/AMDGPU/tid-one-func-xnack-off.ll
    A llvm/test/CodeGen/AMDGPU/tid-one-func-xnack-on.ll
    A llvm/test/CodeGen/AMDGPU/trap-abis.ll
    M llvm/test/MC/AMDGPU/hsa-diag-v3.s
    M llvm/test/MC/AMDGPU/hsa-gfx10-v3.s
    M llvm/test/MC/AMDGPU/hsa-v3.s
    A llvm/test/MC/AMDGPU/hsa-v4.s
    M llvm/test/MC/AMDGPU/hsa_isa_version_attrs.s
    M llvm/test/MC/AMDGPU/isa-version-hsa.s
    M llvm/test/MC/AMDGPU/isa-version-pal.s
    M llvm/test/MC/AMDGPU/isa-version-unk.s
    M llvm/test/MC/AMDGPU/round-trip.s
    R llvm/test/Object/AMDGPU/elf-header-flags-sram-ecc.yaml
    A llvm/test/Object/AMDGPU/elf-header-flags-sramecc.yaml
    M llvm/test/Object/AMDGPU/elf-header-flags-xnack.yaml
    M llvm/test/tools/llvm-objdump/ELF/AMDGPU/kd-failure.s
    M llvm/test/tools/llvm-objdump/ELF/AMDGPU/kd-sgpr.s
    M llvm/test/tools/llvm-objdump/ELF/AMDGPU/kd-vgpr.s
    M llvm/test/tools/llvm-objdump/ELF/AMDGPU/kd-zeroed-gfx10.s
    M llvm/test/tools/llvm-objdump/ELF/AMDGPU/kd-zeroed-gfx9.s
    M llvm/test/tools/llvm-objdump/ELF/AMDGPU/kd-zeroed-raw.s
    M llvm/test/tools/llvm-readobj/ELF/amdgpu-elf-headers.test
    M llvm/test/tools/llvm-readobj/ELF/note-amd.s
    M llvm/tools/llvm-readobj/ELFDumper.cpp

  Log Message:
  -----------
  AMDGPU: Add target id and code object v4 support

  - Add target id support (https://clang.llvm.org/docs/ClangOffloadBundler.html#target-id)
  - Add code object v4 support (https://llvm.org/docs/AMDGPUUsage.html#elf-code-object)
    - Add kernarg_size to kernel descriptor
    - Change trap handler ABI to no longer move queue pointer into s[0:1]
  - Cleanup ELF definitions
    - Add V2, V3, V4 suffixes to make a clear distinction for code object version
    - Consolidate note names

Differential Revision: https://reviews.llvm.org/D95638




More information about the All-commits mailing list