[PATCH] D73176: [ARM] Fix dropped dollar sign from symbols in branch targets
Momchil Velikov via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Jan 24 06:32:04 PST 2020
chill added inline comments.
================
Comment at: llvm/lib/Target/ARM/AsmParser/ARMAsmParser.cpp:6051
+ // $42 -> immediate
+ // $foo -> symbol name
S = Parser.getTok().getLoc();
----------------
pratlucas wrote:
> pratlucas wrote:
> > chill wrote:
> > > What if we have here `$ foo`, i.e. whitespace after the dollar? We should not paste together two separate tokens `$` and `<whatever>` to form an identifier, `$foo`, `$12` are identifiers, but `$ 12` is not.
> > >
> > >
> > From what I've checked, `parseExpression(...)` takes care of this scenario:
> > ```
> > <stdin>:36:11: error: invalid token in expression
> > b $ foo
> > ^
> > ```
> > The change is not binding the two actual tokens together, but only refraining from removing the $ token from the expression.
> > Please let me know if you believe a more active handling of this scenario is necessary.
> Checking the behavior without the changes from the patch, expressions like `b $ foo` are currently accepted by the parser with no errors, as the $ token is dropped from the expression.
Well, if we have two separate tokens (`AsmToken::Dollar` and `AsmToken::Idenrifier`), but somehow end up with symbol name `$foo`, something must be combining those tokens into one.
That happens in `AsmParser::parseIdentifier` (https://github.com/llvm/llvm-project/blob/master/llvm/lib/MC/MCParser/AsmParser.cpp#L2844), but (unfortunately?) this function does not handle
a non-identifier immediately following the `$`.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D73176/new/
https://reviews.llvm.org/D73176
More information about the llvm-commits
mailing list