[llvm-dev] Assign different RegClasses to a virtual register based on 'uniform' attribute?

Wed Dec 21 16:02:37 PST 2016

2016年12月21日星期三，Matt Arsenault <arsenm2 at gmail.com> 写道：
>
> On Dec 21, 2016, at 10:26, Ruiling Song <ruiling.song83 at gmail.com> wrote:
>
>
> 2016-12-20 22:14 GMT+08:00 Tom Stellard <tom at stellard.net>:
>>
>> On Tue, Dec 20, 2016 at 11:00:09AM +0800, Ruiling Song wrote:
>> > Hi,
>> >
>> > I am working on a new LLVM target for Intel GPU, which also has same
kind
>> > of scalar/vector register classes used in AMDGPU target. Like for a i32
>> > virtual register, it will be held in scalar register if its value is
>> > uniform across a wavefront/warp, otherwise it will be in a vector
register.
>> > Does AMDGPU already done this? I read the code, but I didn't figure
out how
>> > to do this. Anybody has idea on this?
>> >
>>
>> In the AMDGPU backend we select everything we can to scalar
>> instructions, and then after instruction selection, we move
>> non-uniform values to the vector ALU.  This is done by
>> the SIFixSGPRCopiesPass, which relies heavily on
>> SIInstrInfo::moveToVALU().
>
> Hi Tom,
>
> I take a look at the code, it looks like a good idea. It really helps me
a lot. Thanks Tom! I have a question for the code, why it only pass
copy-like instructions as TopInst to moveToALU()? Is there any special
reason to do like this? I thought that iterating through all the MIs and
fix regClass if needed would be ok. Am I thinking it too simple?
>
> - Ruiling
>>
>> -Tom
>>
>> > - Ruiling
>
>
>
>
> --
> - Ruiling
>
> The instruction selector will insert these copies to satisfy the register
operand constraints, so by finding all users (and users of users) of the
illegal copies you find the same thing.
Only checking copy-like MIs and their further user sounds more reasonable.
if iterating through all MIs and checking all the MIs' the Src&Dst register
class to find out instructions that need to fix, which is what I previously
thought, would just waste some compile time.

The instruction set is different, so we’re really replacing the
instructions and not exactly just changing the register classes.
> I think this process logically makes sense, moving things to vector as
forced. However I’m uncertain if this is the best approach. I’ve debated
going the other direction and selecting everything to vector instruction,
and having an optimization pass move parts to scalars.
I don't know what makes you thinking about the change. Is "the other
direction" more safe because it only optimize known pattern? Or do you ever
meet some situation that you cannot handle it smart using current solution?
I will do some try in my target, may be we can have some further discussion
after I know more about LLVM on this.
- Ruiling
This is what the AMD compiler does. There are different trade offs, but one
advantage is you immediately have something resembling a legal program to
begin with.
> -Matt

-- 
- Ruiling
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20161222/4519fb78/attachment.html>