`.to_numpy(copy=False)` Runtime Error if Device Memory #201

ax3l · 2023-10-06T23:51:15Z

We allow users to call .to_numpy(copy=False) on arbitrary memory.

This is fine even with pure GPU memory, to either:

transport pointers around w/o access from the host or
use as managed memory from the host (read/write)

For the situation that the pointer is in GPU memory and not managed, we should instead raise a runtime exception with the hint to use .to_numpy(copy=True), .to_cupy(copy=False) or activate managed memory.

We can use AMReX_GpuUtility.H for isManaged, isDevicePtr, isPinnedPtr helpers. It wraps cudaPointerGetAttributes and, once later supported, similar functions for HIP and SYCL.
https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__UNIFIED.html#group__CUDART__UNIFIED_1gd89830e17d399c064a2f3c3fa8bb4390

The text was updated successfully, but these errors were encountered:

ax3l · 2023-10-06T23:55:06Z

These function can be quite expensive, so we should either use them sparsely or check alternatively the arenas if we know them.

These situations in AMReX can create this:

in default Arena and managed allowed
in explicit Managed Arena
in explicit device arena and managed allowed

Then Arena has

    // isDeviceAccessible and isHostAccessible can both be true.                                                                                                             
    [[nodiscard]] virtual bool isDeviceAccessible () const;
    [[nodiscard]] virtual bool isHostAccessible () const;

    // Note that isManaged, isDevice and isPinned are mutually exclusive.                                                                                                    
    // For memory allocated by cudaMalloc* etc., one of them returns true.                                                                                                   
    // Otherwise, neither is true.                                                                                                                                           
    [[nodiscard]] virtual bool isManaged () const;
    [[nodiscard]] virtual bool isDevice () const;
    [[nodiscard]] virtual bool isPinned () const;

where isHostAccessible() is what we need.

ax3l · 2023-10-07T00:15:14Z

For MultiFab.array(mfi).to_numpy() we could go on the MultiFab level and add a:

MultiFab.to_numpy(mfi, ...)

function, that way we have still access to the Arena (or implement the more costly helper calls on the pointer from AMReX_GpuUtility.H above).

ax3l added enhancement New feature or request backend: cuda Specific to CUDA execution (GPUs) labels Oct 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`.to_numpy(copy=False)` Runtime Error if Device Memory #201

`.to_numpy(copy=False)` Runtime Error if Device Memory #201

ax3l commented Oct 6, 2023

ax3l commented Oct 6, 2023 •

edited

Loading

ax3l commented Oct 7, 2023

.to_numpy(copy=False) Runtime Error if Device Memory #201

.to_numpy(copy=False) Runtime Error if Device Memory #201

Comments

ax3l commented Oct 6, 2023

ax3l commented Oct 6, 2023 • edited Loading

ax3l commented Oct 7, 2023

`.to_numpy(copy=False)` Runtime Error if Device Memory #201

`.to_numpy(copy=False)` Runtime Error if Device Memory #201

ax3l commented Oct 6, 2023 •

edited

Loading