[all-commits] [llvm/llvm-project] 17f5b4: [OpenMP] Allocatable explicit member mapping fortr...

agozillon via All-commits all-commits at lists.llvm.org
Thu Nov 7 20:25:29 PST 2024


  Branch: refs/heads/users/agozillo/alloca-member-map-3
  Home:   https://github.com/llvm/llvm-project
  Commit: 17f5b48340e1b24146c8410e3c1e83a5e8aca2fa
      https://github.com/llvm/llvm-project/commit/17f5b48340e1b24146c8410e3c1e83a5e8aca2fa
  Author: agozillon <Andrew.Gozillon at amd.com>
  Date:   2024-11-07 (Thu, 07 Nov 2024)

  Changed paths:
    M offload/test/offloading/fortran/basic-target-region-1D-array-section.f90
    M offload/test/offloading/fortran/basic-target-region-3D-array-section.f90
    M offload/test/offloading/fortran/basic-target-region-3D-array.f90
    M offload/test/offloading/fortran/constant-arr-index.f90
    M offload/test/offloading/fortran/declare-target-vars-in-target-region.f90
    M offload/test/offloading/fortran/double-target-call-with-declare-target.f90
    M offload/test/offloading/fortran/dtype-array-constant-index-map.f90
    A offload/test/offloading/fortran/dtype-member-map-syntax-1.f90
    A offload/test/offloading/fortran/dtype-member-map-syntax-2.f90
    M offload/test/offloading/fortran/dump_map_tables.f90
    M offload/test/offloading/fortran/local-descriptor-map-regress.f90
    M offload/test/offloading/fortran/target-depend.f90
    M offload/test/offloading/fortran/target-map-all-common-block-members.f90
    A offload/test/offloading/fortran/target-map-alloca-dtype-alloca-array-of-dtype.f90
    A offload/test/offloading/fortran/target-map-alloca-dtype-alloca-array.f90
    A offload/test/offloading/fortran/target-map-alloca-dtype-and-alloca-array-v2.f90
    A offload/test/offloading/fortran/target-map-alloca-dtype-and-alloca-array.f90
    A offload/test/offloading/fortran/target-map-alloca-dtype-array-and-scalar.f90
    A offload/test/offloading/fortran/target-map-alloca-dtype-array-of-dtype.f90
    M offload/test/offloading/fortran/target-map-allocatable-array-section-1d-bounds.f90
    M offload/test/offloading/fortran/target-map-allocatable-array-section-3d-bounds.f90
    A offload/test/offloading/fortran/target-map-allocatable-dtype.f90
    M offload/test/offloading/fortran/target-map-allocatable-map-scopes.f90
    M offload/test/offloading/fortran/target-map-common-block.f90
    M offload/test/offloading/fortran/target-map-declare-target-link-common-block.f90
    M offload/test/offloading/fortran/target-map-derived-type-full-1.f90
    M offload/test/offloading/fortran/target-map-derived-type-full-2.f90
    M offload/test/offloading/fortran/target-map-derived-type-full-implicit-1.f90
    M offload/test/offloading/fortran/target-map-derived-type-full-implicit-2.f90
    M offload/test/offloading/fortran/target-map-double-large-nested-dtype-multi-member.f90
    M offload/test/offloading/fortran/target-map-double-nested-dtype-array-bounds.f90
    M offload/test/offloading/fortran/target-map-double-nested-dtype-double-array-bounds.f90
    M offload/test/offloading/fortran/target-map-double-nested-dtype-single-member.f90
    A offload/test/offloading/fortran/target-map-dtype-3d-alloca-array-with-bounds.f90
    A offload/test/offloading/fortran/target-map-dtype-alloca-and-non-alloca-array.f90
    A offload/test/offloading/fortran/target-map-dtype-alloca-array-and-non-alloca-dtype.f90
    A offload/test/offloading/fortran/target-map-dtype-alloca-array-of-dtype.f90
    A offload/test/offloading/fortran/target-map-dtype-alloca-array-with-bounds.f90
    A offload/test/offloading/fortran/target-map-dtype-allocatable-array.f90
    A offload/test/offloading/fortran/target-map-dtype-allocatable-scalar-and-array.f90
    M offload/test/offloading/fortran/target-map-dtype-arr-bounds-member-enter-exit-update.f90
    M offload/test/offloading/fortran/target-map-dtype-arr-bounds-member-enter-exit.f90
    M offload/test/offloading/fortran/target-map-dtype-explicit-individual-array-member.f90
    M offload/test/offloading/fortran/target-map-dtype-multi-explicit-array-3D-member-bounds.f90
    M offload/test/offloading/fortran/target-map-dtype-multi-explicit-array-member-bounds.f90
    M offload/test/offloading/fortran/target-map-dtype-multi-explicit-array-member.f90
    M offload/test/offloading/fortran/target-map-dtype-multi-explicit-member.f90
    M offload/test/offloading/fortran/target-map-enter-exit-allocatables.f90
    M offload/test/offloading/fortran/target-map-enter-exit-array-2.f90
    M offload/test/offloading/fortran/target-map-enter-exit-array-bounds.f90
    M offload/test/offloading/fortran/target-map-enter-exit-array.f90
    M offload/test/offloading/fortran/target-map-enter-exit-scalar.f90
    M offload/test/offloading/fortran/target-map-first-common-block-member.f90
    M offload/test/offloading/fortran/target-map-individual-dtype-member-map.f90
    M offload/test/offloading/fortran/target-map-large-nested-dtype-multi-member.f90
    M offload/test/offloading/fortran/target-map-mix-imp-exp-common-block-members.f90
    A offload/test/offloading/fortran/target-map-multi-alloca-dtypes-with-multi-alloca-members.f90
    A offload/test/offloading/fortran/target-map-multi-alloca-dtypes-with-multi-mixed-members.f90
    A offload/test/offloading/fortran/target-map-nested-alloca-dtype-3d-alloca-array-bounds.f90
    A offload/test/offloading/fortran/target-map-nested-alloca-dtype-alloca-array-bounds.f90
    A offload/test/offloading/fortran/target-map-nested-dtype-3d-alloca-array-with-bounds.f90
    A offload/test/offloading/fortran/target-map-nested-dtype-alloca-and-non-alloca-array.f90
    A offload/test/offloading/fortran/target-map-nested-dtype-alloca-array-and-non-alloca-dtype.f90
    A offload/test/offloading/fortran/target-map-nested-dtype-alloca-array-with-bounds.f90
    A offload/test/offloading/fortran/target-map-nested-dtype-alloca-array.f90
    M offload/test/offloading/fortran/target-map-nested-dtype-complex-member.f90
    M offload/test/offloading/fortran/target-map-nested-dtype-derived-member.f90
    M offload/test/offloading/fortran/target-map-nested-dtype-multi-member.f90
    M offload/test/offloading/fortran/target-map-nested-dtype-single-member.f90
    M offload/test/offloading/fortran/target-map-pointer-scopes-enter-exit.f90
    M offload/test/offloading/fortran/target-map-pointer-target-array-section-3d-bounds.f90
    M offload/test/offloading/fortran/target-map-pointer-target-scopes.f90
    A offload/test/offloading/fortran/target-map-pointer-to-dtype-allocatable-member.f90
    M offload/test/offloading/fortran/target-map-second-common-block-member.f90
    M offload/test/offloading/fortran/target-map-two-dtype-explicit-member.f90
    M offload/test/offloading/fortran/target-map-two-dtype-individual-member-array-1D-bounds.f90
    M offload/test/offloading/fortran/target-map-two-dtype-mixed-implicit-explicit-capture-1.f90
    M offload/test/offloading/fortran/target-map-two-dtype-mixed-implicit-explicit-capture-2.f90
    M offload/test/offloading/fortran/target-map-two-dtype-multi-member-array-1D-bounds.f90
    M offload/test/offloading/fortran/target-map-two-nested-dtype-member-array-map.f90
    M offload/test/offloading/fortran/target-nested-target-data.f90
    M offload/test/offloading/fortran/target-region-implicit-array.f90

  Log Message:
  -----------
  [OpenMP] Allocatable explicit member mapping fortran offloading tests

This PR is one in a series of 3 that aim to add support for explicit member mapping of
allocatable components in derived types within OpenMP+Fortran for Flang.

This PR provides all of the runtime tests that are currently upstreamable, unfortunately
some of the other tests would require linking of the fortran runtime for offload which
we currently do not do. But regardless, this is plenty to ensure that the mapping is
working in most cases.


  Commit: 7d27486647bbaee3936fe81351e18c89e192f368
      https://github.com/llvm/llvm-project/commit/7d27486647bbaee3936fe81351e18c89e192f368
  Author: agozillon <Andrew.Gozillon at amd.com>
  Date:   2024-11-07 (Thu, 07 Nov 2024)

  Changed paths:
    M mlir/include/mlir/Dialect/OpenMP/OpenMPOps.td
    M mlir/lib/Dialect/OpenMP/IR/OpenMPDialect.cpp
    M mlir/lib/Target/LLVMIR/Dialect/OpenMP/OpenMPToLLVMIRTranslation.cpp
    M mlir/test/Dialect/OpenMP/ops.mlir
    R mlir/test/Target/LLVMIR/omptarget-fortran-allocatable-types-host.mlir
    A mlir/test/Target/LLVMIR/omptarget-nested-ptr-record-type-mapping-host.mlir
    M mlir/test/Target/LLVMIR/omptarget-nested-record-type-mapping-host.mlir
    A mlir/test/Target/LLVMIR/omptarget-record-type-with-ptr-member-host.mlir

  Log Message:
  -----------
  [OpenMP][MLIR] Descriptor explicit member map lowering changes

This is one of 3 PRs in a PR stack that aims to add support for explicit mapping of
allocatable members in derived types.

The primary changes in this PR are the OpenMPToLLVMIRTranslation.cpp changes,
which are small and seek to alter the current member mapping to add an
additional map insertion for pointers. Effectively, if the member is a pointer
(currently indicated by having a varPtrPtr field) we add an additional map for
the pointer and then alter the subsequent mapping of the member (the data)
to utilise the member rather than the parents base pointer. This appears to be
necessary in certain cases when mapping pointer data within record types to
avoid segfaulting on device (due to incorrect data mapping). In general this
record type mapping may be simplifiable in the future.

There are also additions of tests which should help to showcase the affect
of the changes above.


  Commit: 2a5db632ea848ed0e8541fc0c07ea4bfaa6a5761
      https://github.com/llvm/llvm-project/commit/2a5db632ea848ed0e8541fc0c07ea4bfaa6a5761
  Author: agozillon <Andrew.Gozillon at amd.com>
  Date:   2024-11-07 (Thu, 07 Nov 2024)

  Changed paths:
    M flang/include/flang/Optimizer/Builder/FIRBuilder.h
    M flang/lib/Lower/OpenMP/ClauseProcessor.cpp
    M flang/lib/Lower/OpenMP/ClauseProcessor.h
    M flang/lib/Lower/OpenMP/Clauses.h
    M flang/lib/Lower/OpenMP/OpenMP.cpp
    M flang/lib/Lower/OpenMP/Utils.cpp
    M flang/lib/Lower/OpenMP/Utils.h
    M flang/lib/Optimizer/Builder/FIRBuilder.cpp
    M flang/lib/Optimizer/OpenMP/MapInfoFinalization.cpp
    M flang/lib/Optimizer/OpenMP/MapsForPrivatizedSymbols.cpp
    M flang/test/Fir/convert-to-llvm-openmp-and-fir.fir
    M flang/test/Integration/OpenMP/map-types-and-sizes.f90
    M flang/test/Lower/OpenMP/allocatable-array-bounds.f90
    M flang/test/Lower/OpenMP/allocatable-map.f90
    M flang/test/Lower/OpenMP/array-bounds.f90
    M flang/test/Lower/OpenMP/declare-target-link-tarop-cap.f90
    A flang/test/Lower/OpenMP/derived-type-allocatable-map.f90
    M flang/test/Lower/OpenMP/derived-type-map.f90
    M flang/test/Lower/OpenMP/target.f90
    M flang/test/Transforms/omp-map-info-finalization.fir

  Log Message:
  -----------
  [Flang][OpenMP] Derived type explicit allocatable member mapping

This PR is one of 3 in a PR stack, this is the primary change set which seeks
to extend the current derived type explicit member mapping support to
handle descriptor member mapping at arbitrary levels of nesting. The PR
stack seems to do this reasonably (from testing so far) but as you can
create quite complex mappings with derived types (in particular when adding
allocatable derived types or arrays of allocatable derived types) I imagine
there will be hiccups, which I am more than happy to address. There will
also be further extensions to this work to handle the implicit auto-magical
mapping of descriptor members in derived types and a few other changes
planned for the future (with some ideas on optimizing things).

The changes in this PR primarily occur in the OpenMP lowering and
the OMPMapInfoFinalization pass.

In the OpenMP lowering several utility functions were added or extended
to support the generation of appropriate intermediate member mappings
which are currently required when the parent (or multiple parents) of a
mapped member are descriptor types. We need to map the entirety of
these types or do a "deep copy" for lack of a better term, where we map
both the base address and the descriptor as without the copying of both
of these we lack the information in the case of the descriptor to access the
member or attach the pointers data to the pointer and in the latter case we
require the base address to map the chunk of data. Currently we do not
segment descriptor based derived types as we do with regular
non-descriptor derived types, we effectively map their entirety in all
cases at the moment, I hope to address this at some point in the future as
it adds a fair bit of a performance penalty to having nestings of allocatable
derived types as an example. The process of mapping all intermediate
descriptor members in a members path only occurs if a member has
an allocatable or object parent in its symbol path or the member itself
is a member or allocatable. This occurs in the
createParentSymAndGenIntermediateMaps function, which will also
generate the appropriate address for the allocatable member
within the derived type to use as a the varPtr field of the map (for
intermediate allocatable maps and final allocatable mappings). In
this case it's necessary as we can't utilise the usual Fortran::lower
functionality such as gatherDataOperandAddrAndBounds without
causing issues later in the lowering due to extra allocas being spawned
which seem to affect the pointer attachment (at least this is my
current assumption, it results in memory access errors on the device
due to incorrect map information generation). This is similar
to why we do not use the MLIR value generated for this and utilise
the original symbol provided when mapping descriptor types external
to derived types. Hopefully this can be rectified in the future so this
function can be simplified and more closely aligned to the other type
mappings. We also make use of fir::CoordinateOp as opposed to the
HLFIR version as the HLFIR version doesn't support the appropriate
lowering to FIR necessary at the moment, we also cannot use a
single CoordinateOp (similarly to a single GEP) as when we index
through a descriptor operation (BoxType) we encounter issues later
in the lowering, however in either case we need access to intermediate
descriptors so individual CoordinateOp's aid this (although, being
able to compress them into a smaller amount of CoordinateOp's may
simplify the IR and perhaps result in a better end product, something
to consider for the future).

The other large change area was in the OMPMapInfoFinalization pass,
where the pass had to be extended to support the expansion of box
types (or multiple nestings of box types) within derived types, or box
type derived types. This requires expanding each BoxType mapping
from one into two maps and then modifying all of the existing
member indices of the overarching parent mapping to account for
the addition of these new members alongside adjusting the existing
member indices to support the addition of these new maps which
extend the original member indices (as a base address of a box type
is currently considered a member of the box type at a position of
0 as when lowered to LLVM-IR it's a pointer contained at this position
in the descriptor type, however, this means extending mapped children
of this expanded descriptor type to additionally incorporate the new
member index in the correct location in its own index list). I believe
there is a reasonable amount of comments that should aid in
understanding this better, alongside the test alterations for the pass.

A subset of the changes were also aimed at making some of the utilities
for packing and unpacking the DenseIntElementsAttr
containing the member indices shareable across the lowering and
OMPMapInfoFinalization, this required moving some functions to the
Lower/Support/Utils.h header, and transforming the lowering structure
containing the member index data into something more similar to the
version used in OMPMapInfoFinalization. There we also some other
attempts at tidying things up in relation to the member index data
generation in the lowering, some of which required creating a logical
operator for the OpenMP ID class so it can be utilised as a map key
(it simply utilises the symbol address for the moment as ordering
isn't particularly important).

Otherwise I have added a set of new tests encompassing some of
the mappings currently supported by this PR (unfortunately as
you can have arbitrary nestings of all shapes and types it's not
very feasible to cover them all).


Compare: https://github.com/llvm/llvm-project/compare/c7016d22d27c...2a5db632ea84

To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications


More information about the All-commits mailing list