[llvm-dev] RFC: New function attribute HasInaccessibleState
Joseph Tremoulet via llvm-dev
llvm-dev at lists.llvm.org
Fri Dec 11 11:16:33 PST 2015
<<<
I may misunderstand, but it seems to me that this solves only query for aliasing with a pointer known to be pointing only to globals defined in the current compilation unit.
For any pointer which "may point somewhere else”, you won’t be able to resolve the non-aliasing with the “internal state” for malloc/free, right?
To take the original example in this thread:
int *x = malloc(4);
*x = 2;
int *y = malloc(4);
*y = 4;
A pointer analysis can solve this case, but I’m not sure it scale inter procedurally and will have a limited impact outside of LTO anyway.
>>>
I think you're understanding correctly, but I don't understand what you're saying will go badly with the malloc example. Quoting the start of the thread:
<<<
The intention behind introducing this attribute is to relax the conditions in GlobalsAA as below:
(this code is in GlobalsAAResult::AnalyzeCallGraph)
if (F->isDeclaration()) {
// Try to get mod/ref behaviour from function attributes.
- if (F->doesNotAccessMemory()) {
+ if (F->doesNotAccessMemory() || F->onlyAccessesArgMemory()) {
// Can't do better than that!
} else if (F->onlyReadsMemory()) {
FunctionEffect |= Ref;
if (!F->isIntrinsic())
// This function might call back into the module and read a global -
// consider every global as possibly being read by this function.
FR.MayReadAnyGlobal = true;
} else {
FunctionEffect |= ModRef;
// Can't say anything useful unless it's an intrinsic - they don't
// read or write global variables of the kind considered here.
KnowNothing = !F->isIntrinsic();
}
continue;
}
This relaxation allows functions that (transitively) call library functions (such as printf/malloc) to still maintain and propagate GlobalsAA info. In general, this adds more precision to the description of these functions.
Concerns regarding impact on other optimizations (I'm repeating a few examples that Hal mentioned earlier).
1.
>A readnone function is one whose output is a function only of its inputs, and if you have this:
>
> int *x = malloc(4);
> *x = 2;
> int *y = malloc(4);
> *y = 4;
> you certainly don't want EarlyCSE to replace the second call to malloc with the result of the first (which it will happily do if you mark malloc as readnone).
>>>
It sounded like improving GlobalsAA (and thus disambiguation against globals) was the explicit goal, and that the concern with the malloc case was that you don't want EarlyCSE to start combining those two calls; I may be misunderstanding the code, but I wouldn't expect EarlyCSE to start combining those calls just because they have a new meaningful-only-to-GlobalsAA "almost-readnone" attribute.
To the larger point of whether there are other similar cases that extending GlobalsAA wouldn't allow us to optimize -- yes, certainly. I'm just saying that I think that the notion of "external state" is much easier to define in the context of a particular analysis than the IR as a whole, and that I'd expect that coordinating the notion across analyses would require methods on the analysis API explicitly for that coordination.
—
Mehdi
More information about the llvm-dev
mailing list