[all-commits] [llvm/llvm-project] 3aef52: [AMDGPU] Fix negative immediate offset for unbuffe...

Mon Jun 24 15:16:49 PDT 2024

  Branch: refs/heads/users/shawbyoung/spr/bolt-sync-doc-for-dropping-disparate-profiles
  Home:   https://github.com/llvm/llvm-project
  Commit: 3aef525aa4b9a5395b6ac4ae771e28e64b27a126
      https://github.com/llvm/llvm-project/commit/3aef525aa4b9a5395b6ac4ae771e28e64b27a126
  Author: vangthao95 <vang.thao at amd.com>
  Date:   2024-06-24 (Mon, 24 Jun 2024)

  Changed paths:
    M llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.cpp
    M llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.h
    M llvm/lib/Target/AMDGPU/AMDGPUInstructionSelector.cpp
    M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp
    M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.h
    M llvm/test/CodeGen/AMDGPU/GlobalISel/inst-select-load-constant.mir
    M llvm/test/CodeGen/AMDGPU/GlobalISel/smrd.ll
    M llvm/test/CodeGen/AMDGPU/cgp-addressing-modes-smem.ll
    M llvm/test/CodeGen/AMDGPU/gfx12_scalar_subword_loads.ll
    M llvm/test/CodeGen/AMDGPU/global-saddr-load.ll
    M llvm/test/CodeGen/AMDGPU/llvm.prefetch.ll
    M llvm/test/CodeGen/AMDGPU/smrd.ll

  Log Message:
  -----------
  [AMDGPU] Fix negative immediate offset for unbuffered smem loads (#89165)

For unbuffered smem loads, it is illegal for the immediate offset to be
negative if the resulting IOFFSET + (SGPR[Offset] or M0 or zero) is
negative.

New PR of https://github.com/llvm/llvm-project/pull/79553.

  Commit: d2864d1716b5e5b5eabb3004b04a1d7f67803f0d
      https://github.com/llvm/llvm-project/commit/d2864d1716b5e5b5eabb3004b04a1d7f67803f0d
  Author: Louis Dionne <ldionne.2 at gmail.com>
  Date:   2024-06-24 (Mon, 24 Jun 2024)

  Changed paths:
    M libcxx/CMakeLists.txt
    M libcxx/lib/abi/powerpc-ibm-aix.libcxxabi.v1.stable.exceptions.nonew.abilist
    M libcxx/lib/abi/powerpc64-ibm-aix.libcxxabi.v1.stable.exceptions.nonew.abilist
    M libcxx/lib/abi/x86_64-unknown-freebsd.libcxxabi.v1.stable.exceptions.nonew.abilist
    M libcxx/lib/abi/x86_64-unknown-linux-gnu.libcxxabi.v1.stable.exceptions.nonew.abilist
    M libcxx/lib/abi/x86_64-unknown-linux-gnu.libcxxabi.v1.stable.noexceptions.nonew.abilist

  Log Message:
  -----------
  [libc++] Build with -fsized-deallocation (#96217)

This patch makes libc++ build with -fsized-deallocation. That flag is
enabled by default in recent versions of Clang, so this patch will make
libc++ forward-compatible with ToT Clang.

  Commit: bea7f3d8739a12c599525c18c151f37b1293a080
      https://github.com/llvm/llvm-project/commit/bea7f3d8739a12c599525c18c151f37b1293a080
  Author: Nick Desaulniers (paternity leave) <nickdesaulniers at users.noreply.github.com>
  Date:   2024-06-24 (Mon, 24 Jun 2024)

  Changed paths:
    M libc/include/llvm-libc-macros/CMakeLists.txt
    A libc/include/llvm-libc-macros/link-macros.h
    M libc/startup/linux/CMakeLists.txt
    M libc/startup/linux/do_start.cpp

  Log Message:
  -----------
  [libc][startup] create header for ElfW and use in startup (#96510)

This is necessary for 32b platforms such as ARM and i386.

Link: #94128

  Commit: dd8d978b7642b410f3a4895aa20f0aada4a0f7e4
      https://github.com/llvm/llvm-project/commit/dd8d978b7642b410f3a4895aa20f0aada4a0f7e4
  Author: Keith Smiley <keithbsmiley at gmail.com>
  Date:   2024-06-24 (Mon, 24 Jun 2024)

  Changed paths:
    M utils/bazel/llvm-project-overlay/lldb/BUILD.bazel

  Log Message:
  -----------
  [bazel] Switch mach_gen to apple_genrule (#96551)

mig is a tool vendored with Xcode. Using apple_genrule makes sure that
the bazel selected version of Xcode is preferred, and that the action is
invalidated when that version changes.

  Commit: a030c8bc87be0599e861f5d8fe8254ac8d6e84ba
      https://github.com/llvm/llvm-project/commit/a030c8bc87be0599e861f5d8fe8254ac8d6e84ba
  Author: Keith Smiley <keithbsmiley at gmail.com>
  Date:   2024-06-24 (Mon, 24 Jun 2024)

  Changed paths:
    A utils/bazel/llvm-project-overlay/lldb/tools/lldb-dap/BUILD.bazel

  Log Message:
  -----------
  [bazel] Add support for building lldb-dap (#96556)

  Commit: 687d6fbfac6205e6f53ea58584278c09738ea708
      https://github.com/llvm/llvm-project/commit/687d6fbfac6205e6f53ea58584278c09738ea708
  Author: Akshay Deodhar <adeodhar at nvidia.com>
  Date:   2024-06-24 (Mon, 24 Jun 2024)

  Changed paths:
    M llvm/include/llvm/IR/IntrinsicsNVVM.td
    M llvm/lib/Target/NVPTX/NVPTXIntrinsics.td
    M llvm/lib/Target/NVPTX/NVPTXLowerArgs.cpp
    M llvm/lib/Target/NVPTX/NVPTXUtilities.cpp
    M llvm/lib/Target/NVPTX/NVPTXUtilities.h
    A llvm/test/CodeGen/NVPTX/lower-args-gridconstant.ll

  Log Message:
  -----------
  [NVPTX] Basic support for "grid_constant" (#96125)

- Adds a helper function for checking whether an argument is a
[grid_constant](https://docs.nvidia.com/cuda/nvvm-ir-spec/index.html#supported-properties).
- Adds support for cvta.param using changes from
https://github.com/llvm/llvm-project/pull/95289
- Supports escaped grid_constant pointers conservatively, by casting all
uses to the generic address space with cvta.param.

  Commit: 5ae50698a0d6a3022af2e79d405a7eb6c8c790f0
      https://github.com/llvm/llvm-project/commit/5ae50698a0d6a3022af2e79d405a7eb6c8c790f0
  Author: Ramkumar Ramachandra <ramkumar.ramachandra at codasip.com>
  Date:   2024-06-24 (Mon, 24 Jun 2024)

  Changed paths:
    M llvm/lib/Analysis/LoopAccessAnalysis.cpp
    M llvm/test/Analysis/LoopAccessAnalysis/symbolic-stride.ll

  Log Message:
  -----------
  LAA: strip unnecessary getUniqueCastUse (#92119)

733b8b2 ([LAA] Simplify identification of speculatable strides [nfc])
refactored getStrideFromPointer() to compute directly on SCEVs, and
return an SCEV expression instead of a Value. However, it left behind a
call to getUniqueCastUse(), which is completely unnecessary. Remove
this, showing a positive test update, and simplify the surrounding
program logic.

  Commit: 16903ace180755b7558234ff2b2e8d89b00dcb88
      https://github.com/llvm/llvm-project/commit/16903ace180755b7558234ff2b2e8d89b00dcb88
  Author: lntue <35648136+lntue at users.noreply.github.com>
  Date:   2024-06-24 (Mon, 24 Jun 2024)

  Changed paths:
    M libc/config/darwin/arm/entrypoints.txt
    M libc/config/linux/aarch64/entrypoints.txt
    M libc/config/linux/arm/entrypoints.txt
    M libc/config/linux/riscv/entrypoints.txt
    M libc/docs/math/index.rst
    M libc/src/__support/FPUtil/double_double.h
    M libc/src/__support/FPUtil/dyadic_float.h
    M libc/src/__support/macros/optimization.h
    M libc/src/math/generic/CMakeLists.txt
    A libc/src/math/generic/range_reduction_double_common.h
    A libc/src/math/generic/range_reduction_double_fma.h
    A libc/src/math/generic/range_reduction_double_nofma.h
    A libc/src/math/generic/sin.cpp
    A libc/src/math/generic/sincos_eval.h
    M libc/src/math/x86_64/CMakeLists.txt
    R libc/src/math/x86_64/sin.cpp
    M libc/test/src/math/sin_test.cpp
    M libc/test/src/math/smoke/CMakeLists.txt
    A libc/test/src/math/smoke/sin_test.cpp

  Log Message:
  -----------
  [libc][math] Implement double precision sin correctly rounded to all rounding modes. (#95736)

- Algorithm:
- Step 1 - Range reduction: for a double precision input `x`, return `k`
and `u` such that
    - k is an integer
    - u = x - k * pi / 128, and |u| < pi/256
- Step 2 - Calculate `sin(u)` and `cos(u)` in double-double using Taylor
polynomials with errors < 2^-70 with FMA or < 2^-66 w/o FMA.
- Step 3 - Calculate `sin(x) = sin(k*pi/128) * cos(u) + cos(k*pi/128) *
sin(u)` using look-up table for `sin(k*pi/128)` and `cos(k*pi/128)`.
- Step 4 - Use Ziv's rounding test to decide if the result is correctly
rounded.
- Step 4' - If the Ziv's rounding test failed, redo step 1-3 using
128-bit precision.
- Currently, without FMA instructions, the large range reduction only
works correctly for the default rounding mode (FE_TONEAREST).
- Provide `LIBC_MATH` flag so that users can set `LIBC_MATH =
LIBC_MATH_SKIP_ACCURATE_PASS` to build the `sin` function without step 4
and 4'.

  Commit: a32b7199f0c15ea1c6c9490b6166c019c9d4bd2b
      https://github.com/llvm/llvm-project/commit/a32b7199f0c15ea1c6c9490b6166c019c9d4bd2b
  Author: Chelsea Cassanova <chelsea_cassanova at apple.com>
  Date:   2024-06-24 (Mon, 24 Jun 2024)

  Changed paths:
    M lldb/bindings/python/python-typemaps.swig
    M lldb/include/lldb/API/SBProcess.h
    M lldb/include/lldb/Core/AddressRangeListImpl.h
    M lldb/include/lldb/Target/Process.h
    M lldb/source/API/SBProcess.cpp
    M lldb/source/Target/Process.cpp
    R lldb/test/API/python_api/find_in_memory/Makefile
    R lldb/test/API/python_api/find_in_memory/TestFindInMemory.py
    R lldb/test/API/python_api/find_in_memory/TestFindRangesInMemory.py
    R lldb/test/API/python_api/find_in_memory/address_ranges_helper.py
    R lldb/test/API/python_api/find_in_memory/main.cpp

  Log Message:
  -----------
  Revert commits that add `TestFind(Ranges)InMemory.py` (#96560)

Reverting to unblock macOS buildbots which are currently failing on
these tests.
https://green.lab.llvm.org/job/llvm.org/view/LLDB/job/as-lldb-cmake/6377/

  Commit: 31b734da006ef0efde43a444ae2fd562be4cc909
      https://github.com/llvm/llvm-project/commit/31b734da006ef0efde43a444ae2fd562be4cc909
  Author: shaw young <58664393+shawbyoung at users.noreply.github.com>
  Date:   2024-06-24 (Mon, 24 Jun 2024)

  Changed paths:
    M libc/config/darwin/arm/entrypoints.txt
    M libc/config/linux/aarch64/entrypoints.txt
    M libc/config/linux/arm/entrypoints.txt
    M libc/config/linux/riscv/entrypoints.txt
    M libc/docs/math/index.rst
    M libc/include/llvm-libc-macros/CMakeLists.txt
    A libc/include/llvm-libc-macros/link-macros.h
    M libc/src/__support/FPUtil/double_double.h
    M libc/src/__support/FPUtil/dyadic_float.h
    M libc/src/__support/macros/optimization.h
    M libc/src/math/generic/CMakeLists.txt
    A libc/src/math/generic/range_reduction_double_common.h
    A libc/src/math/generic/range_reduction_double_fma.h
    A libc/src/math/generic/range_reduction_double_nofma.h
    A libc/src/math/generic/sin.cpp
    A libc/src/math/generic/sincos_eval.h
    M libc/src/math/x86_64/CMakeLists.txt
    R libc/src/math/x86_64/sin.cpp
    M libc/startup/linux/CMakeLists.txt
    M libc/startup/linux/do_start.cpp
    M libc/test/src/math/sin_test.cpp
    M libc/test/src/math/smoke/CMakeLists.txt
    A libc/test/src/math/smoke/sin_test.cpp
    M libcxx/CMakeLists.txt
    M libcxx/lib/abi/powerpc-ibm-aix.libcxxabi.v1.stable.exceptions.nonew.abilist
    M libcxx/lib/abi/powerpc64-ibm-aix.libcxxabi.v1.stable.exceptions.nonew.abilist
    M libcxx/lib/abi/x86_64-unknown-freebsd.libcxxabi.v1.stable.exceptions.nonew.abilist
    M libcxx/lib/abi/x86_64-unknown-linux-gnu.libcxxabi.v1.stable.exceptions.nonew.abilist
    M libcxx/lib/abi/x86_64-unknown-linux-gnu.libcxxabi.v1.stable.noexceptions.nonew.abilist
    M lldb/bindings/python/python-typemaps.swig
    M lldb/include/lldb/API/SBProcess.h
    M lldb/include/lldb/Core/AddressRangeListImpl.h
    M lldb/include/lldb/Target/Process.h
    M lldb/source/API/SBProcess.cpp
    M lldb/source/Target/Process.cpp
    R lldb/test/API/python_api/find_in_memory/Makefile
    R lldb/test/API/python_api/find_in_memory/TestFindInMemory.py
    R lldb/test/API/python_api/find_in_memory/TestFindRangesInMemory.py
    R lldb/test/API/python_api/find_in_memory/address_ranges_helper.py
    R lldb/test/API/python_api/find_in_memory/main.cpp
    M llvm/include/llvm/IR/IntrinsicsNVVM.td
    M llvm/lib/Analysis/LoopAccessAnalysis.cpp
    M llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.cpp
    M llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.h
    M llvm/lib/Target/AMDGPU/AMDGPUInstructionSelector.cpp
    M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp
    M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.h
    M llvm/lib/Target/NVPTX/NVPTXIntrinsics.td
    M llvm/lib/Target/NVPTX/NVPTXLowerArgs.cpp
    M llvm/lib/Target/NVPTX/NVPTXUtilities.cpp
    M llvm/lib/Target/NVPTX/NVPTXUtilities.h
    M llvm/test/Analysis/LoopAccessAnalysis/symbolic-stride.ll
    M llvm/test/CodeGen/AMDGPU/GlobalISel/inst-select-load-constant.mir
    M llvm/test/CodeGen/AMDGPU/GlobalISel/smrd.ll
    M llvm/test/CodeGen/AMDGPU/cgp-addressing-modes-smem.ll
    M llvm/test/CodeGen/AMDGPU/gfx12_scalar_subword_loads.ll
    M llvm/test/CodeGen/AMDGPU/global-saddr-load.ll
    M llvm/test/CodeGen/AMDGPU/llvm.prefetch.ll
    M llvm/test/CodeGen/AMDGPU/smrd.ll
    A llvm/test/CodeGen/NVPTX/lower-args-gridconstant.ll
    M utils/bazel/llvm-project-overlay/lldb/BUILD.bazel
    A utils/bazel/llvm-project-overlay/lldb/tools/lldb-dap/BUILD.bazel

  Log Message:
  -----------
  Merge branch 'main' into users/shawbyoung/spr/bolt-sync-doc-for-dropping-disparate-profiles

Compare: https://github.com/llvm/llvm-project/compare/456a7d095d62...31b734da006e

To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications