[PATCH] D73821: [yaml2obj] Add -D k=v to preprocess the input YAML

James Henderson via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Feb 5 06:30:34 PST 2020


jhenderson added inline comments.


================
Comment at: llvm/tools/yaml2obj/yaml2obj.cpp:76
+
+  return Preprocessed;
+}
----------------
MaskRay wrote:
> grimar wrote:
> > MaskRay wrote:
> > > grimar wrote:
> > > > I'd suggest something like the following
> > > > 
> > > > ```
> > > > std::vector<std::pair<std::string, StringRef>> Substitution;
> > > >   for (StringRef Define : D) {
> > > >     StringRef F, V;
> > > >     std::tie(F, V) = Define.split('=');
> > > >     if (!Define.count('=') || F.empty()) {
> > > >       ErrHandler("bad override, missing field name: " + Define);
> > > >       return {};
> > > >     }
> > > >     Substitution.push_back({(Twine("[[") + F + "]]").str(), V});
> > > >   }
> > > > 
> > > >   std::string Document = Buf.str();
> > > >   if (Substitution.empty())
> > > >     return Document;
> > > > 
> > > >   for (std::pair<std::string, StringRef> &P : Substitution) {
> > > >     while (true) {
> > > >       auto I = Document.find(P.first);
> > > >       if (I == std::string::npos)
> > > >         break;
> > > >       Document = Document.replace(I, P.first.size(), P.second.str());
> > > >     }
> > > >   }
> > > > ```
> > > Maybe we should avoid quadratic complexity behavior, just in case that there are a long run of `[[[[[[[[[[`.
> > > just in case that there are a long run of [[[[[[[[[[.
> > 
> > It reminded me about D36307. I believe that `[[[[[[` is a misuse of the feature and hence we probably should not care much.
> > The approach I suggested is just very simple and it is much easier to read. I think the common use case for this feature will be
> > "replace 1-2 defines in a short enough YAML", i.e. I am not sure we should think about an additional optimizations here.
> > 
> > Probably it is time to hear a third opinion though. @jhenderson, what do you think?
> > 
> Doing `replace` repeatedly may also have a non-convergence problem...
Recursive replacement causes issues. For example `-DFOO=[[FOO]]` would result in an infinite loop. Convergence issues are only a problem if we try to do this. I personally don't think we should in the first version.

Another issue is what to do about variables that expand to other variables. One example is `-D VAR1=FOO -D VAR2=[[VAR1]]BAR` which will result in `[[VAR2]]` becoming `[[VAR1]]BAR` if specified in that order, or `FOOBAR` if VAR2 is defined first. I don't think we want this behaviour, as it is unstable and unobvious.

(Relatedly, I think we should have a test showing that variables that expand to a pattern that could be another variable aren't recursively expanded)

As for the quadratic complexity, I'm not too bothered by it in this case, as I doubt we'll ever get a YAML doc in a test where it really matters (and even if we do, there won't be many, so the overall slow-down in the testsuite will be minimal).

FWIW, I think we can probably forbid names containing '[' or ']' themselves.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D73821/new/

https://reviews.llvm.org/D73821





More information about the llvm-commits mailing list