[cfe-dev] LibC++ v3.8 - Problems with ISO C wrapper headers

Wed Feb 3 02:35:28 PST 2016

The problem is that this *does* break existing code.

While I flagged the problem initially when I got ambiguities with '__fp16', this was simply due to the fact that every 6 months we align our out-of-tree implementation to the forthcoming formal numbered branch.  In this case I was actually updating from v3.7.1 to v3.8.

It is our intention to start tracking head, but we're not there yet or I would have caught this a lot earlier in its development.

When I am stabilising after one of these big-bang updates, I first of all address all the compile-time regressions in our test-suites.  The '__fp16' ambiguity simple revealed to me that ISO C names were now overloaded at the global namespace, even though '__fp16' is not meaningful to ISO C++.

But since then, I have been debugging runtime regressions, and these are far more difficult to debug.

A lot of the real-world application of our processor is math intensive, so the math libraries get hit on a lot.  But performance is also critical and programmers trade-off using 'float' and 'double' quite often for a variety of reasons.  Many of these math applications started off as FORTRAN and then were carefully ported and tuned to C.  It is with great difficulty that we try to get programmers to use C++, and when they do so, is usually so that they can get better abstraction for their existing code (encapsulation and templates collections mostly).

The main reasons that programmers chose between 'float' and 'double' (assume 'float' is IEEE FP32 and 'double' is IEEE FP64) are:

  o  Precision - when the problem needs to be more precise, they use 'double'
  o  Dynamic Range - when the numbers are very large, they use 'double'
  o  Space - large data sets take up a lot of memory, and programmers often
     choose to compromise precision for space by using 'float'
  o  Performance - on systems where FP32 arithmetic is more performant than
     FP64 arithmetic, programmers will sometimes sacrifice the precision of
     'double' for the performance of 'float'

I don't see a lot of programmers using FP128 'long double', not sure why not, but I guess that the loss of performance versus the gain in precision and dynamic range is not a viable trade-off for their programs.  We also have extensive use of FP16 where the precision and dynamic range take a back seat to raw "good enough" performance and probably space too.

But I will illustrate the kind of RWC that is broken by overloading the ISO C names in the global space.  The following is a reduction of a real example that regressed at runtime with this change, this is from a modelling library and is optimised for FP32.  I have culled the actual code, and left just the control-flow logic:

  // Inputs:  float inputA; float inputB;
  double upperRange = pow(inputA, inputB);

  if (isnan(upperRange))
    handleNanErrors(inputA, inputB);
  else if (isinf(upperRange))
    handleOutOfRangeErrors(inputA, inputB);
  else if ((upperRange <= __FLT_MAX__) && (upperRange >= __FLT_MIN__))
    useFP32OptimisedModel(upperRange, inputA, inputB);
  else
    useHighDynamicRangeModel(upperRange, inputA, inputB);

The observed problem is that the "useHighDynamicRangeModel" implementation was never being executed, and instead "handleOutOfRangeErrors" was being called far more often than previously.

When I analysed this, the problem is down to the overloading of '::pow'.  With ISO C, the function 'pow' takes two 'double' arguments, and returns a value of type 'double'.  The two input operands are 'float', so they are first promoted to 'double', and then 'pow(double,double)' is called, which yields a value of type 'double'.

With the overloading at the global namespace, the function '::pow(float,float)' is called, which in turn calls '::powf(float,float)' yielding a 'float' value which is then promoted to 'double' to initialise 'upperRange'.

But 'float' does not have the dynamic range of 'double', and many pairs of input operands of type 'float' yield results that exceed the dynamic range of 'float' and 'pow(float,float)' returns INFINITY which is retained on promotion to 'double'.  This in turn make 'isinf(upperRange)' evaluate to true when the value exceeds the dynamic range of FP32 and not when it exceeds the dynamic range of FP64 as was intended and expected by the program.

The original code was complaint ISO C, that was then migrated to C++, but as with the majority of legacy C programs, the changes made to operate with C++ are very minimal.

You could say "just put a cast to 'double' before the input operands" - and this will of course work in this case.  But it took me a few hours to find this problem buried as it was in the actual code, and of course I was blaming our target lowering as the probable cause until I realised that it was not a code-generation bug.  And I expect that related problems are going to be liberally strewn throughout legacy C code that is migrated or partially migrated to C++ in ways that will be very difficult to detect.

In my opinion, this was never the intent of the C++ Standard Committee, and I think that it is also an undesirable interpretation of the words in the Standard that will only server to further alienate C programmers from using C++; and this is already a difficult task in the embedded programming space where the majority of C programmers already hear enough FUD about C++ and performance.

My attachment of a patch at this stage is because I think that this is a significant enough issue for LibC++ v3.8 that it warrants urgent attention.  I am also aware that it will probably not get resolved unless somebody does some work to make it happen, and I have done some of that work which may serve as a starting point for others to complete (LibC++ maintainers).  I am not experiencing failures in the LibC++ test-suite from these changes with the exception of the 'depr.c.headers' test that I already mentioned and I have also not seen any failures in modules.

And these changes have addressed the regressions in the RWC that I use for verification.

While I admit the changes are not the ideal solution, I think that they represent an adequate minimalistic change to the sources that will offset the majority of the problems, but I am already working on a rewrite of '<cmath>/<math.h>' that I think will produce a better solution that addresses the objectives of C++ while retaining the intentions of ISO C compatibility.  And of course the other ISO C wrapper headers.  But I won't have this done in time for v3.8 code freeze.

All the best,

	MartinO

-----Original Message-----
From: metafoo at gmail.com [mailto:metafoo at gmail.com] On Behalf Of Richard Smith
Sent: 02 February 2016 21:23
To: James Knight
Cc: Richard Smith via cfe-dev; Martin J. O'Riordan
Subject: Re: [cfe-dev] LibC++ v3.8 - Problems with ISO C wrapper headers

On Tue, Feb 2, 2016 at 12:44 PM, James Knight <jyknight at google.com> wrote:
> The issue at hand is whether #include "math.h" may add the extra overloads to the global namespace, not whether #include <cmath> may do so. I've got no idea what the right answer is, but those do seem -- at least to me -- to be potentially distinct questions.

Yes, sorry, that issue doesn't say what I thought it did.

>> On Feb 2, 2016, at 3:22 PM, Richard Smith via cfe-dev <cfe-dev at lists.llvm.org> wrote:
>>
>> Also, see http://cplusplus.github.io/LWG/lwg-defects.html#2380 where 
>> LWG agreed that libc++'s current behavior is the desired behavior.
>>
>> On Tue, Feb 2, 2016 at 12:16 PM, Richard Smith <richard at metafoo.co.uk> wrote:
>>> This is not really the time to be sending patches; we have not yet 
>>> established what the desired direction is. Even if we agreed that 
>>> this was the right direction and that we wanted to regress our 
>>> conformance here, this patch is not acceptable as it breaks modules 
>>> support. (It also breaks the include guard optimization for many of 
>>> libc++'s
>>> headers.)
>>>
>>> On Tue, Feb 2, 2016 at 2:29 AM, Martin J. O'Riordan via cfe-dev 
>>> <cfe-dev at lists.llvm.org> wrote:
>>>> Sorry for the delay getting back to this.  I am attaching a revised 
>>>> patch (with respect to #259486) that addresses the issue that James commented on.
>>>> The test case:
>>>>
>>>>
>>>>
>>>> test/std/depr/depr.c.headers/math_h.pass.cpp
>>>>
>>>>
>>>>
>>>> fails with these changes, but this is because the test is expecting 
>>>> the names in ‘<math.h>’ to be overloaded in the global namespace.  
>>>> I also made some additional changes to ‘<stdio.h>’ to wrap the 
>>>> macros for ‘getc’, ‘putc’, etc. using the same pattern that is used 
>>>> to achieve the same task in ‘<math.h>’.  The ‘#undef’s for these 
>>>> was causing link failures for against our C library which does not 
>>>> provide callable functions for these and instead uses the macros 
>>>> provided by Newlib.  I liked the pattern used in ‘<math.h>’ and thought that it neatly addressed the requirements of C++.
>>>> Similar changes are probably a good idea for ‘<ctype.h>’ and 
>>>> ‘<wctype.h>’, but I did not implement these.
>>>>
>>>>
>>>>
>>>> Thanks,
>>>>
>>>>
>>>>
>>>>            MartinO
>>>>
>>>>
>>>>
>>>> From: Martin J. O'Riordan [mailto:martin.oriordan at movidius.com]
>>>> Sent: 27 January 2016 15:41
>>>> To: 'James Y Knight'
>>>> Cc: 'David Chisnall'; 'Clang Dev'
>>>> Subject: RE: [cfe-dev] LibC++ v3.8 - Problems with ISO C wrapper 
>>>> headers
>>>>
>>>>
>>>>
>>>> Hmm!  In order for it to matter, the program would have to do:
>>>>
>>>>
>>>>
>>>>    #include <math.h>
>>>>
>>>>    #include <cmath>
>>>>
>>>>
>>>>
>>>> which I think is unlikely, but:
>>>>
>>>>
>>>>
>>>>    #include <math.h>
>>>>
>>>>    #include “otherfile.h”
>>>>
>>>>
>>>>
>>>> where ‘otherfile.h’ then includes ‘<cmath>’, which is more likely 
>>>> to happen when ‘<cmath>’ is included by proxy after ‘<math.h>’.  
>>>> But you are correct, I hadn’t thought of this and the pattern would 
>>>> need to be refined to address this.  I did not come across this 
>>>> scenario in the LibC++ test-suite or my own.
>>>>
>>>>
>>>>
>>>> Thanks,
>>>>
>>>>
>>>>
>>>>           MartinO
>>>>
>>>>
>>>>
>>>> From: James Y Knight [mailto:jyknight at google.com]
>>>> Sent: 27 January 2016 15:29
>>>> To: Martin J. O'Riordan
>>>> Cc: David Chisnall; Clang Dev
>>>>
>>>>
>>>> Subject: Re: [cfe-dev] LibC++ v3.8 - Problems with ISO C wrapper 
>>>> headers
>>>>
>>>>
>>>>
>>>> This doesn't seem right -- won't it break code that does this:
>>>>
>>>> #include <math.h>
>>>>
>>>> #include <cmath>
>>>>
>>>> ? (since the "#ifdef _LIBCPP_INCLUDING_STDC_HEADER" is within the 
>>>> "#ifndef _LIBCPP_MATH_H" block, and thus only gets checked once)
>>>>
>>>>
>>>>
>>>>
>>>>
>>>> On Wed, Jan 27, 2016 at 10:22 AM, Martin J. O'Riordan via cfe-dev 
>>>> <cfe-dev at lists.llvm.org> wrote:
>>>>
>>>> I suppose I am more focussed on embedded systems than hosted - thus 
>>>> the reference to 'newlib'; and I'm sure that there are many 
>>>> alternative STDC libraries that I have never heard of.  But I think 
>>>> that the LibC++ wrapper headers is still a good place to abstract 
>>>> such portability issues so that users of LibC++ are unaware of the 
>>>> ISO C header implementation that lies beneath.
>>>>
>>>>
>>>> I have also attached a patch file computed against the #258931 revision on:
>>>>
>>>>     
>>>> https://llvm.org/svn/llvm-project/libcxx/branches/release_38/includ
>>>> e
>>>>
>>>> Although many files have changed, the nature of the changes is quite simple.
>>>>
>>>>  o  In each of the '<cXXXX>' files I have inserted:
>>>>
>>>>       #define _LIBCPP_INCLUDING_STDC_HEADER
>>>>
>>>>     before each '#include <####.h>' ISO C header, and followed by:
>>>>
>>>>       #undef _LIBCPP_INCLUDING_STDC_HEADER
>>>>
>>>>     I have done this for all LibC++ headers that include an ISO C 
>>>> header even if
>>>>     it does not refer to a LibC++ wrapping header, in case one C header
>>>>     includes another in any particular ISO C header implementation.
>>>>
>>>>  o  In each of the '<XXXX.h>' ISO C header wrappers, where 
>>>> appropriate I have
>>>>     replaced:
>>>>
>>>>       #ifdef __cplusplus
>>>>       ...
>>>>       #endif // __cplusplus
>>>>
>>>>     with:
>>>>
>>>>       #ifdef _LIBCPP_INCLUDING_STDC_HEADER
>>>>       ...
>>>>       #endif // _LIBCPP_INCLUDING_STDC_HEADER
>>>>
>>>>  o  Before the C++ overloaded functions are introduced at global 
>>>> scope, I have
>>>>     inserted:
>>>>
>>>>       _LIBCPP_BEGIN_NAMESPACE_STD
>>>>
>>>>     and afterwards:
>>>>
>>>>       _LIBCPP_END_NAMESPACE_STD
>>>>
>>>>  o  Finally, in '<math.h>' in particular, I have prefixed the forwarded
>>>>     names with '::' since at this point in the 'using ::name;' has not yet
>>>>     been seen.
>>>>
>>>> The patch is a suggestion, and the '_LIBCPP_INCLUDING_STDC_HEADER' 
>>>> name I picked are just my idea for following the existing 
>>>> convention used by the
>>>> LibC++ maintainers.
>>>>
>>>>        MartinO
>>>>
>>>> -----Original Message-----
>>>> From: Dr D. Chisnall [mailto:dc552 at hermes.cam.ac.uk] On Behalf Of 
>>>> David Chisnall
>>>> Sent: 27 January 2016 9:25
>>>> To: Martin.ORiordan at Movidius.com
>>>> Cc: Craig, Ben; Clang Dev
>>>> Subject: Re: [cfe-dev] LibC++ v3.8 - Problems with ISO C wrapper 
>>>> headers
>>>>
>>>> On 26 Jan 2016, at 20:09, Martin J. O'Riordan via cfe-dev 
>>>> <cfe-dev at lists.llvm.org> wrote:
>>>>>
>>>>> And C++ also requires that some parts of the interfaces from C are 
>>>>> presented to C++ as functions - examples being ‘fpclassify’, ‘signbit’ etc.
>>>>> These have to be handled differently, and I think that the current 
>>>>> LibC++ approach to these is good and maintains semantic compatibility with C.
>>>>>
>>>>> I had intended to build a patch file of my changes today in case 
>>>>> anyone is interested, but other things got me busy.  I’ll do that 
>>>>> tomorrow and attach it to this thread.
>>>>>
>>>>> I think that it is a good idea to have LibC++ provide its own 
>>>>> wrapper headers for the C headers.  It is a logical place to deal 
>>>>> with portability issues that arise when referring to C headers 
>>>>> provided by newlib, uclibc or glibc (and other less well known 
>>>>> implementations).  The handling of these portability issues will 
>>>>> mean that the ‘c’ prefixed C++ headers will need little or no alteration and just maintain the ‘std’ namespace.
>>>>>
>>>>
>>>> I note that none of your listed libc implementations are the 
>>>> defaults on the operating systems that ship libc++ as the default 
>>>> C++ standard library implementation…
>>>>
>>>> It would be very helpful for people who are working on this to 
>>>> provide a set of recommendations for C library maintainers to make 
>>>> C++ interoperability easier.  That way, those of us who do have 
>>>> control over the libc headers can work from the other end to improve the situation.
>>>>
>>>> David
>>>>
>>>>
>>>> _______________________________________________
>>>> cfe-dev mailing list
>>>> cfe-dev at lists.llvm.org
>>>> http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev
>>>>
>>>>
>>>>
>>>>
>>>> _______________________________________________
>>>> cfe-dev mailing list
>>>> cfe-dev at lists.llvm.org
>>>> http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev
>>>>
>> _______________________________________________
>> cfe-dev mailing list
>> cfe-dev at lists.llvm.org
>> http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev
>