[all-commits] [llvm/llvm-project] 48c7b2: [AMDGPU] Add -amdgpu-expand-waitcnt-profiling opti...
Pankaj Dwivedi via All-commits
all-commits at lists.llvm.org
Tue Dec 2 05:03:33 PST 2025
Branch: refs/heads/users/Pankajdwivedi-25/Compiler_option_for_expanding_s_waitcnt_instructions
Home: https://github.com/llvm/llvm-project
Commit: 48c7b23636cf18645c1bc01b3f6f367130154e4a
https://github.com/llvm/llvm-project/commit/48c7b23636cf18645c1bc01b3f6f367130154e4a
Author: Pankaj kumar divedi <Pankajkumar.divedi at amd.com>
Date: 2025-12-02 (Tue, 02 Dec 2025)
Changed paths:
M llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp
A llvm/test/CodeGen/AMDGPU/expand-waitcnt-profiling.ll
Log Message:
-----------
[AMDGPU] Add -amdgpu-expand-waitcnt-profiling option for PC-sampling profiling
Commit: a28ab4e6c7b0079a0b7d38aab7ec4cc1a3d926af
https://github.com/llvm/llvm-project/commit/a28ab4e6c7b0079a0b7d38aab7ec4cc1a3d926af
Author: Pankaj kumar divedi <Pankajkumar.divedi at amd.com>
Date: 2025-12-02 (Tue, 02 Dec 2025)
Changed paths:
M llvm/test/CodeGen/AMDGPU/expand-waitcnt-profiling.ll
Log Message:
-----------
add run line for diff GPU Gen and counter types
Commit: 7e993fb33983b5a1912a840940f6d18d3ab14b06
https://github.com/llvm/llvm-project/commit/7e993fb33983b5a1912a840940f6d18d3ab14b06
Author: Pankaj kumar divedi <Pankajkumar.divedi at amd.com>
Date: 2025-12-02 (Tue, 02 Dec 2025)
Changed paths:
M llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp
Log Message:
-----------
Address reviewer feedback: fix getWaitCountMax and reduce code duplication
- Fix getWaitCountMax() to use correct bitmasks based on architecture:
- Pre-GFX12: Use getVmcntBitMask/getLgkmcntBitMask for LOAD_CNT/DS_CNT
- GFX12+: Use getLoadcntBitMask/getDscntBitMask for LOAD_CNT/DS_CNT
- Refactor repetitive if-blocks for LOAD_CNT, DS_CNT, EXP_CNT into
a single loop using getCounterRef helper function
- Fix X_CNT to return proper getXcntBitMask(IV) instead of 0
Commit: 709640d569e7ea3c886dc98a3b51c04aaff4fd70
https://github.com/llvm/llvm-project/commit/709640d569e7ea3c886dc98a3b51c04aaff4fd70
Author: Pankaj kumar divedi <Pankajkumar.divedi at amd.com>
Date: 2025-12-02 (Tue, 02 Dec 2025)
Changed paths:
M llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp
M llvm/test/CodeGen/AMDGPU/expand-waitcnt-profiling.ll
Log Message:
-----------
skip expanding out-of-order events
Commit: 399166c09c9dfa699bef321e238e55d4501ba600
https://github.com/llvm/llvm-project/commit/399166c09c9dfa699bef321e238e55d4501ba600
Author: Pankaj kumar divedi <Pankajkumar.divedi at amd.com>
Date: 2025-12-02 (Tue, 02 Dec 2025)
Changed paths:
M llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp
Log Message:
-----------
fix: resolve issue after rebase
Compare: https://github.com/llvm/llvm-project/compare/33d5f3413591...399166c09c9d
To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications
More information about the All-commits
mailing list