[all-commits] [llvm/llvm-project] 497a97: Address reviewer feedback: fix getWaitCountMax and...
Pankaj Dwivedi via All-commits
all-commits at lists.llvm.org
Wed Nov 26 05:30:47 PST 2025
Branch: refs/heads/users/Pankajdwivedi-25/Compiler_option_for_expanding_s_waitcnt_instructions
Home: https://github.com/llvm/llvm-project
Commit: 497a973a76355e3221c1f4fc5c3481b2ec829e2a
https://github.com/llvm/llvm-project/commit/497a973a76355e3221c1f4fc5c3481b2ec829e2a
Author: Pankaj kumar divedi <Pankajkumar.divedi at amd.com>
Date: 2025-11-26 (Wed, 26 Nov 2025)
Changed paths:
M llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp
Log Message:
-----------
Address reviewer feedback: fix getWaitCountMax and reduce code duplication
- Fix getWaitCountMax() to use correct bitmasks based on architecture:
- Pre-GFX12: Use getVmcntBitMask/getLgkmcntBitMask for LOAD_CNT/DS_CNT
- GFX12+: Use getLoadcntBitMask/getDscntBitMask for LOAD_CNT/DS_CNT
- Refactor repetitive if-blocks for LOAD_CNT, DS_CNT, EXP_CNT into
a single loop using getCounterRef helper function
- Fix X_CNT to return proper getXcntBitMask(IV) instead of 0
To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications
More information about the All-commits
mailing list