[all-commits] [llvm/llvm-project] 497a97: Address reviewer feedback: fix getWaitCountMax and...

Pankaj Dwivedi via All-commits all-commits at lists.llvm.org
Wed Nov 26 05:30:47 PST 2025


  Branch: refs/heads/users/Pankajdwivedi-25/Compiler_option_for_expanding_s_waitcnt_instructions
  Home:   https://github.com/llvm/llvm-project
  Commit: 497a973a76355e3221c1f4fc5c3481b2ec829e2a
      https://github.com/llvm/llvm-project/commit/497a973a76355e3221c1f4fc5c3481b2ec829e2a
  Author: Pankaj kumar divedi <Pankajkumar.divedi at amd.com>
  Date:   2025-11-26 (Wed, 26 Nov 2025)

  Changed paths:
    M llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp

  Log Message:
  -----------
  Address reviewer feedback: fix getWaitCountMax and reduce code duplication

- Fix getWaitCountMax() to use correct bitmasks based on architecture:
  - Pre-GFX12: Use getVmcntBitMask/getLgkmcntBitMask for LOAD_CNT/DS_CNT
  - GFX12+: Use getLoadcntBitMask/getDscntBitMask for LOAD_CNT/DS_CNT
- Refactor repetitive if-blocks for LOAD_CNT, DS_CNT, EXP_CNT into
  a single loop using getCounterRef helper function
- Fix X_CNT to return proper getXcntBitMask(IV) instead of 0



To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications


More information about the All-commits mailing list