<html><head><meta http-equiv="Content-Type" content="text/html charset=utf-8"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class=""><br class=""><div><blockquote type="cite" class=""><div class="">On Feb 17, 2017, at 7:05 PM, Richard Smith <<a href="mailto:richard@metafoo.co.uk" class="">richard@metafoo.co.uk</a>> wrote:</div><br class="Apple-interchange-newline"><div class=""><div dir="ltr" style="font-family: Helvetica; font-size: 12px; font-style: normal; font-variant-caps: normal; font-weight: normal; letter-spacing: normal; orphans: auto; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px;" class=""><div class="gmail_extra"><div class="gmail_quote">On 17 February 2017 at 10:21, Akira Hatanaka via cfe-dev<span class="Apple-converted-space"> </span><span dir="ltr" class=""><<a href="mailto:cfe-dev@lists.llvm.org" target="_blank" class="">cfe-dev@lists.llvm.org</a>></span><span class="Apple-converted-space"> </span>wrote:<br class=""><blockquote class="gmail_quote" style="margin: 0px 0px 0px 0.8ex; border-left-width: 1px; border-left-style: solid; border-left-color: rgb(204, 204, 204); padding-left: 1ex;"><div style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class="">I’d like to propose a new attribute for enums.</div><div dir="auto" style="word-wrap: break-word;" class=""><br class=""></div><div dir="auto" style="word-wrap: break-word;" class="">### The proposed attribute ###</div><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class="">The attribute is tentatively named “enum_style” and takes two arguments. The first argument determines the style of the enum, either “option” or “choice".</div></div></div></div></div></div></div></div></div></blockquote><div class=""><br class=""></div><div class="">These seem like a bad choice of names to me, since they are synonyms in English. Maybe call the flag form "flag"? But I don't actually see why we need or want to merge these two orthogonal arguments into a single attribute at all. Why not instead:<br class=""></div><div class=""><br class=""></div><div class=""><div class="">__attribute__((open_enum))</div><div class="">__attribute__((closed_enum))</div>__attribute__((flag_enum, open_enum))</div><div class="">__attribute__((flag_enum, closed_enum))<br class=""></div><div class=""><br class=""></div><div class="">?</div><div class=""><br class=""></div><blockquote class="gmail_quote" style="margin: 0px 0px 0px 0.8ex; border-left-width: 1px; border-left-style: solid; border-left-color: rgb(204, 204, 204); padding-left: 1ex;"><div style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class="">“option” implies that the enum can be used as a one-bit flag, just like “flag_enum", and it’s OK to OR the values to create a new value. “choice” implies the enum cannot be used like “option” enums. The second argument is used to indicate whether or not the enum can be extended. If an enum is marked “closed”, clang can assume a variable of the enum type always has a value that is in the range determined by the enumerators listed in the enum definition.</div></div></div></div></div></div></div></div></div></blockquote><div class=""><br class=""></div><div class="">What exactly do you mean by this? The C++ rule for unscoped enums is that the range of representable values is (roughly) the values that fit in the smallest bit-field that can contain the enum. I assume that would be the rule for flag-style enums; for enumeration-style enums, would you restrict the range further to just lowest-declared-value to highest-declared-value (inclusive)?</div><div class=""> </div><blockquote class="gmail_quote" style="margin: 0px 0px 0px 0.8ex; border-left-width: 1px; border-left-style: solid; border-left-color: rgb(204, 204, 204); padding-left: 1ex;"><div style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class="">If it’s marked “open”, it doesn’t have the restriction. </div></div></div></div></div></div></div></div></div></blockquote><blockquote class="gmail_quote" style="margin: 0px 0px 0px 0.8ex; border-left-width: 1px; border-left-style: solid; border-left-color: rgb(204, 204, 204); padding-left: 1ex;"><div style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class="">There are four possible combinations:</div><div dir="auto" style="word-wrap: break-word;" class=""><br class=""></div><div class="">1. "choice, closed"</div><div class="">2. "choice, open"</div><div class=""><div class="">3. "option, closed"</div><div class="">4. "option, open"</div></div><div class=""><br class=""></div><div class="">Attribute “flag_enum” we have today is equivalent to "option,closed".</div><div class=""><br class=""></div><div class="">In addition, I’m considering adding a command line option that specifies the default enum-style for unannotated enums.</div></div></div></div></div></div></div></div></div></blockquote><div class=""><br class=""></div><div class="">A command-line option amounts to adding a new language dialect; that seems like a bad idea to me.</div><div class=""> </div></div></div></div></div></blockquote><div><br class=""></div><div>The command line options is used to silence warnings for unannotated enums. When issuing -Wassign-enum warnings, clang currently classifies enums into two categories, flag_enums and everything else and treats the latter as if they are “choice,closed”. We don’t want to issue a warning when an out-of-range value is assigned to a variable of an unannotated enum type because we don’t really know whether the enum is meant to be closed/open or choice/option.</div><div><br class=""></div><blockquote type="cite" class=""><div class=""><div dir="ltr" style="font-family: Helvetica; font-size: 12px; font-style: normal; font-variant-caps: normal; font-weight: normal; letter-spacing: normal; orphans: auto; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px;" class=""><div class="gmail_extra"><div class="gmail_quote"><blockquote class="gmail_quote" style="margin: 0px 0px 0px 0.8ex; border-left-width: 1px; border-left-style: solid; border-left-color: rgb(204, 204, 204); padding-left: 1ex;"><div style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><div class="">### Motivation for the new attribute ###</div></div><div dir="auto" style="word-wrap: break-word;" class="">There are several areas that can be improved using the new attribute and command line option.</div><div dir="auto" style="word-wrap: break-word;" class=""><br class=""></div><div dir="auto" style="word-wrap: break-word;" class="">1. Warnings</div><div dir="auto" style="word-wrap: break-word;" class="">The new attribute can improve the accuracy of enum-related warnings such as -Wassign-enum and give better control over when the warnings are issued.</div><div dir="auto" style="word-wrap: break-word;" class=""><br class=""></div><div dir="auto" style="word-wrap: break-word;" class="">-Wassign-enum currently warns whenever a value that is out of the range determined by the enumerators is assigned to an enum variable. For a flag-enum, a value is in range if it can be created by ORing the enumerators listed in the enum definition or is a complement of one of the in-range values. For enums that are not flag-enums, only the values of the enumerators listed in the enum definition are considered to be in range. The warning is helpful in catching out-of-range values that are unintentionally assigned, but would be too strict if a project intentionally extended an enum by defining out-of-range “private” values (this does happen often). If the compiler knows an enum is “open”, it can choose not to issue warnings.</div><div dir="auto" style="word-wrap: break-word;" class=""><br class=""></div><div dir="auto" style="word-wrap: break-word;" class="">Another problem with the current approach is that all the enums the compiler sees have to be classified into flag-enums or non-flag-enums and the flag-enums have to be annotated. This requires a lot of work up front to determine whether or not an enum is a flag-enum, and sometimes it’s not even possible to annotate the enums if they are defined in a third party library that cannot be modified. With the command line option for specifying the default enum-style, users can instruct the compiler not to issue warnings if the enum is unannotated (the default can be either "choice,open” or “option,open”) and add the attributes to the enum definitions in an incremental fashion.</div><div dir="auto" style="word-wrap: break-word;" class=""><br class=""></div><div dir="auto" style="word-wrap: break-word;" class="">2. Code-completion and debugging</div><div dir="auto" style="word-wrap: break-word;" class="">Code-completion tools can offer better suggestions based on whether the enum is a choice or an option. For example, if we had an "option" enum like this:</div><div dir="auto" style="word-wrap: break-word;" class=""><br class=""></div><div dir="auto" style="word-wrap: break-word;" class="">enum __attribute((enum_style(<wbr class="">option, closed)))__ MyEnum {</div><div dir="auto" style="word-wrap: break-word;" class=""> <span class="Apple-converted-space"> </span>E1 = 1, E2 = 2,</div><div dir="auto" style="word-wrap: break-word;" class="">};</div><div dir="auto" style="word-wrap: break-word;" class=""><br class=""></div><div dir="auto" style="word-wrap: break-word;" class="">and the user requests a code completion for this:</div><div dir="auto" style="word-wrap: break-word;" class=""><br class=""></div><div dir="auto" style="word-wrap: break-word;" class="">MyEnum = E1 | <esc></div><div dir="auto" style="word-wrap: break-word;" class=""><br class=""></div><div dir="auto" style="word-wrap: break-word;" class="">code completion would recognize MyEnum is an “option" enum and could offer E2 as a suggestion. If MyEnum were a “choice" enum, it would offer no suggestions.</div><div dir="auto" style="word-wrap: break-word;" class=""><br class=""></div><div dir="auto" style="word-wrap: break-word;" class="">Debugging experience can be improved too. For example, when a MyEnum variable is set to 3, the debugger could show “E1 | E2” instead of showing the raw value 3.</div><div dir="auto" style="word-wrap: break-word;" class=""><br class=""></div><div dir="auto" style="word-wrap: break-word;" class="">3. clang importer</div><div dir="auto" style="word-wrap: break-word;" class="">In order to determine whether a C or ObjC enum maps to an enum or option set in swift, swift’s clang importer looks at whether the enum was declared using macros such as CF_ENUM or CF_OPTIONS (the macros are explained in the link below). With the proposed attribute, this hack can be removed.</div><div dir="auto" style="word-wrap: break-word;" class=""><br class=""></div><div dir="auto" style="word-wrap: break-word;" class=""><div dir="auto" style="word-wrap: break-word;" class=""><a href="https://developer.apple.com/library/content/releasenotes/ObjectiveC/ModernizationObjC/AdoptingModernObjective-C/AdoptingModernObjective-C.html" target="_blank" class="">https://developer.apple.com/<wbr class="">library/content/releasenotes/<wbr class="">ObjectiveC/ModernizationObjC/<wbr class="">AdoptingModernObjective-C/<wbr class="">AdoptingModernObjective-C.html</a></div><div class=""><br class=""></div></div><div dir="auto" style="word-wrap: break-word;" class="">4. Optimization</div><div dir="auto" style="word-wrap: break-word;" class="">There is a command line option named -fstrict-enums, which allows the compiler to optimize code using the assumption that the value of an enum variable is in range. This option can safely be used only if it is known that the values the variables of enum types can take are always in range. With the new attribute, the compiler can focus on variables of enum types that are marked “closed” and optimize them and leave other variables unoptimized.</div><div dir="auto" style="word-wrap: break-word;" class=""><br class=""></div><div dir="auto" style="word-wrap: break-word;" class="">### What will change ###</div><div dir="auto" style="word-wrap: break-word;" class="">I plan to add support for the new attribute in Sema and change the code that issues warning. I’m not planning to work on the IRGen optimization that takes advantage of the information the new attribute provides. It will be left as future work.</div></div></div></div></div></div></div></div><br class="">______________________________<wbr class="">_________________<br class="">cfe-dev mailing list<br class=""><a href="mailto:cfe-dev@lists.llvm.org" class="">cfe-dev@lists.llvm.org</a><br class=""><a href="http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev" rel="noreferrer" target="_blank" class="">http://lists.llvm.org/cgi-bin/<wbr class="">mailman/listinfo/cfe-dev</a></blockquote></div></div></div></div></blockquote></div><br class=""></body></html>