[clang] 569faa4 - Add documentation for `__builtin_object_size`. (#96573)
via cfe-commits
cfe-commits at lists.llvm.org
Tue Jun 25 23:51:15 PDT 2024
Author: Richard Smith
Date: 2024-06-25T23:51:12-07:00
New Revision: 569faa4dc04cf6849057e56ef07b56ad27f6ef9e
URL: https://github.com/llvm/llvm-project/commit/569faa4dc04cf6849057e56ef07b56ad27f6ef9e
DIFF: https://github.com/llvm/llvm-project/commit/569faa4dc04cf6849057e56ef07b56ad27f6ef9e.diff
LOG: Add documentation for `__builtin_object_size`. (#96573)
Explicitly describe how the Clang builtin works, given that it's not
exactly the same as GCC's builtin of the same name -- but is drop-in
compatible.
Fixes #95635.
Added:
Modified:
clang/docs/LanguageExtensions.rst
Removed:
################################################################################
diff --git a/clang/docs/LanguageExtensions.rst b/clang/docs/LanguageExtensions.rst
index 9457e53304126..d9439d49a2e29 100644
--- a/clang/docs/LanguageExtensions.rst
+++ b/clang/docs/LanguageExtensions.rst
@@ -5422,16 +5422,85 @@ The ``#pragma comment(lib, ...)`` directive is supported on all ELF targets.
The second parameter is the library name (without the traditional Unix prefix of
``lib``). This allows you to provide an implicit link of dependent libraries.
-Evaluating Object Size Dynamically
-==================================
+Evaluating Object Size
+======================
+
+Clang supports the builtins ``__builtin_object_size`` and
+``__builtin_dynamic_object_size``. The semantics are compatible with GCC's
+builtins of the same names, but the details are slightly
diff erent.
+
+.. code-block:: c
+
+ size_t __builtin_[dynamic_]object_size(const void *ptr, int type)
+
+Returns the number of accessible bytes ``n`` past ``ptr``. The value returned
+depends on ``type``, which is required to be an integer constant between 0 and
+3:
+
+* If ``type & 2 == 0``, the least ``n`` is returned such that accesses to
+ ``(const char*)ptr + n`` and beyond are known to be out of bounds. This is
+ ``(size_t)-1`` if no better bound is known.
+* If ``type & 2 == 2``, the greatest ``n`` is returned such that accesses to
+ ``(const char*)ptr + i`` are known to be in bounds, for 0 <= ``i`` < ``n``.
+ This is ``(size_t)0`` if no better bound is known.
-Clang supports the builtin ``__builtin_dynamic_object_size``, the semantics are
-the same as GCC's ``__builtin_object_size`` (which Clang also supports), but
-``__builtin_dynamic_object_size`` can evaluate the object's size at runtime.
-``__builtin_dynamic_object_size`` is meant to be used as a drop-in replacement
-for ``__builtin_object_size`` in libraries that support it.
+.. code-block:: c
+
+ char small[10], large[100];
+ bool cond;
+ // Returns 100: writes of more than 100 bytes are known to be out of bounds.
+ int n100 = __builtin_object_size(cond ? small : large, 0);
+ // Returns 10: writes of 10 or fewer bytes are known to be in bounds.
+ int n10 = __builtin_object_size(cond ? small : large, 2);
+
+* If ``type & 1 == 0``, pointers are considered to be in bounds if they point
+ into the same storage as ``ptr`` -- that is, the same stack object, global
+ variable, or heap allocation.
+* If ``type & 1 == 1``, pointers are considered to be in bounds if they point
+ to the same subobject that ``ptr`` points to. If ``ptr`` points to an array
+ element, other elements of the same array, but not of enclosing arrays, are
+ considered in bounds.
+
+.. code-block:: c
+
+ struct X { char a, b, c; } x;
+ static_assert(__builtin_object_size(&x, 0) == 3);
+ static_assert(__builtin_object_size(&x.b, 0) == 2);
+ static_assert(__builtin_object_size(&x.b, 1) == 1);
+
+.. code-block:: c
-For instance, here is a program that ``__builtin_dynamic_object_size`` will make
+ char a[10][10][10];
+ static_assert(__builtin_object_size(&a, 1) == 1000);
+ static_assert(__builtin_object_size(&a[1], 1) == 900);
+ static_assert(__builtin_object_size(&a[1][1], 1) == 90);
+ static_assert(__builtin_object_size(&a[1][1][1], 1) == 9);
+
+The values returned by this builtin are a best effort conservative approximation
+of the correct answers. When ``type & 2 == 0``, the true value is less than or
+equal to the value returned by the builtin, and when ``type & 2 == 1``, the true
+value is greater than or equal to the value returned by the builtin.
+
+For ``__builtin_object_size``, the value is determined entirely at compile time.
+With optimization enabled, better results will be produced, especially when the
+call to ``__builtin_object_size`` is in a
diff erent function from the formation
+of the pointer. Unlike in GCC, enabling optimization in Clang does not allow
+more information about subobjects to be determined, so the ``type & 1 == 1``
+case will often give imprecise results when used across a function call boundary
+even when optimization is enabled.
+
+`The pass_object_size and pass_dynamic_object_size attributes <https://clang.llvm.org/docs/AttributeReference.html#pass-object-size-pass-dynamic-object-size>`_
+can be used to invisibly pass the object size for a pointer parameter alongside
+the pointer in a function call. This allows more precise object sizes to be
+determined both when building without optimizations and in the ``type & 1 == 1``
+case.
+
+For ``__builtin_dynamic_object_size``, the result is not limited to being a
+compile time constant. Instead, a small amount of runtime evaluation is
+permitted to determine the size of the object, in order to give a more precise
+result. ``__builtin_dynamic_object_size`` is meant to be used as a drop-in
+replacement for ``__builtin_object_size`` in libraries that support it. For
+instance, here is a program that ``__builtin_dynamic_object_size`` will make
safer:
.. code-block:: c
More information about the cfe-commits
mailing list