[all-commits] [llvm/llvm-project] bd402e: AMDGPU: Start selecting flat/global atomicrmw fmin...
Matt Arsenault via All-commits
all-commits at lists.llvm.org
Fri Jun 21 11:44:30 PDT 2024
Branch: refs/heads/users/arsenm/amdgpu-select-flat-global-atomicrmw-fmin-fmax
Home: https://github.com/llvm/llvm-project
Commit: bd402eaa677735d794f6bd4a8c32dc7d9e2bf16c
https://github.com/llvm/llvm-project/commit/bd402eaa677735d794f6bd4a8c32dc7d9e2bf16c
Author: Matt Arsenault <Matthew.Arsenault at amd.com>
Date: 2024-06-21 (Fri, 21 Jun 2024)
Changed paths:
M llvm/lib/Target/AMDGPU/AMDGPU.td
M llvm/lib/Target/AMDGPU/BUFInstructions.td
M llvm/lib/Target/AMDGPU/FLATInstructions.td
M llvm/lib/Target/AMDGPU/GCNSubtarget.h
M llvm/lib/Target/AMDGPU/SIISelLowering.cpp
M llvm/test/CodeGen/AMDGPU/flat-atomicrmw-fmax.ll
M llvm/test/CodeGen/AMDGPU/flat-atomicrmw-fmin.ll
M llvm/test/CodeGen/AMDGPU/global-atomicrmw-fmax.ll
M llvm/test/CodeGen/AMDGPU/global-atomicrmw-fmin.ll
M llvm/test/CodeGen/AMDGPU/global_atomics_scan_fmax.ll
M llvm/test/CodeGen/AMDGPU/global_atomics_scan_fmin.ll
A llvm/test/CodeGen/AMDGPU/llvm.amdgcn.struct.ptr.buffer.atomic.fmax.f32.ll
A llvm/test/CodeGen/AMDGPU/llvm.amdgcn.struct.ptr.buffer.atomic.fmax.f64.ll
A llvm/test/CodeGen/AMDGPU/llvm.amdgcn.struct.ptr.buffer.atomic.fmin.f32.ll
A llvm/test/CodeGen/AMDGPU/llvm.amdgcn.struct.ptr.buffer.atomic.fmin.f64.ll
M llvm/test/Transforms/AtomicExpand/AMDGPU/expand-atomic-f32-agent.ll
M llvm/test/Transforms/AtomicExpand/AMDGPU/expand-atomic-f32-system.ll
M llvm/test/Transforms/AtomicExpand/AMDGPU/expand-atomic-f64-agent.ll
M llvm/test/Transforms/AtomicExpand/AMDGPU/expand-atomic-f64-system.ll
M llvm/test/Transforms/AtomicExpand/AMDGPU/expand-atomic-rmw-fmax.ll
M llvm/test/Transforms/AtomicExpand/AMDGPU/expand-atomic-rmw-fmin.ll
Log Message:
-----------
AMDGPU: Start selecting flat/global atomicrmw fmin/fmax.
Define subtarget features for atomic fmin/fmax support.
The flat/global support is a real messe. We had float/double support at
the beginning in gfx6 and gfx7. gfx8 removed these. gfx10 reintroduced them.
gfx11 removed the f64 versions again.
gfx9 partially reintroduced them, in gfx90a and gfx940 but only for f64.
Commit: 05ea4c8158dc240c59d20ec6936a7df6e390d43b
https://github.com/llvm/llvm-project/commit/05ea4c8158dc240c59d20ec6936a7df6e390d43b
Author: Matt Arsenault <Matthew.Arsenault at amd.com>
Date: 2024-06-21 (Fri, 21 Jun 2024)
Changed paths:
M clang/test/CodeGenCUDA/amdgpu-atomic-ops.cu
Log Message:
-----------
fix clang test, depends unsafe attr handling
Commit: 58b72f3a83e025b9c3f05e802206b0bf688bf227
https://github.com/llvm/llvm-project/commit/58b72f3a83e025b9c3f05e802206b0bf688bf227
Author: Matt Arsenault <Matthew.Arsenault at amd.com>
Date: 2024-06-21 (Fri, 21 Jun 2024)
Changed paths:
M llvm/lib/Target/AMDGPU/SIISelLowering.cpp
M llvm/test/CodeGen/AMDGPU/flat-atomicrmw-fmax.ll
M llvm/test/CodeGen/AMDGPU/flat-atomicrmw-fmin.ll
M llvm/test/CodeGen/AMDGPU/global-atomicrmw-fmax.ll
M llvm/test/CodeGen/AMDGPU/global-atomicrmw-fmin.ll
M llvm/test/CodeGen/AMDGPU/global_atomics_scan_fmax.ll
M llvm/test/CodeGen/AMDGPU/global_atomics_scan_fmin.ll
M llvm/test/Transforms/AtomicExpand/AMDGPU/expand-atomic-f32-agent.ll
M llvm/test/Transforms/AtomicExpand/AMDGPU/expand-atomic-f32-system.ll
M llvm/test/Transforms/AtomicExpand/AMDGPU/expand-atomic-f64-agent.ll
M llvm/test/Transforms/AtomicExpand/AMDGPU/expand-atomic-f64-system.ll
M llvm/test/Transforms/AtomicExpand/AMDGPU/expand-atomic-rmw-fmax.ll
M llvm/test/Transforms/AtomicExpand/AMDGPU/expand-atomic-rmw-fmin.ll
Log Message:
-----------
Handle amdgpu-unsafe-fp-atomics attr
Compare: https://github.com/llvm/llvm-project/compare/5853be59be6f...58b72f3a83e0
To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications
More information about the All-commits
mailing list