HIP: Heterogenous-computing Interface for Portability
|
Functions | |
hipError_t | hipHccGetAccelerator (int deviceId, hc::accelerator *acc) |
Return hc::accelerator associated with the specified deviceId. More... | |
hipError_t | hipHccGetAcceleratorView (hipStream_t stream, hc::accelerator_view **av) |
Return hc::accelerator_view associated with the specified stream. More... | |
hipError_t | hipExtModuleLaunchKernel (hipFunction_t f, uint32_t globalWorkSizeX, uint32_t globalWorkSizeY, uint32_t globalWorkSizeZ, uint32_t localWorkSizeX, uint32_t localWorkSizeY, uint32_t localWorkSizeZ, size_t sharedMemBytes, hipStream_t hStream, void **kernelParams, void **extra, hipEvent_t startEvent=nullptr, hipEvent_t stopEvent=nullptr, uint32_t flags=0) |
launches kernel f with launch parameters and shared memory on stream with arguments passed to kernelparams or extra More... | |
hipError_t | hipHccModuleLaunchKernel (hipFunction_t f, uint32_t globalWorkSizeX, uint32_t globalWorkSizeY, uint32_t globalWorkSizeZ, uint32_t localWorkSizeX, uint32_t localWorkSizeY, uint32_t localWorkSizeZ, size_t sharedMemBytes, hipStream_t hStream, void **kernelParams, void **extra, hipEvent_t startEvent=nullptr, hipEvent_t stopEvent=nullptr) __attribute__((deprecated("use hipExtModuleLaunchKernel instead"))) |
hipError_t hipExtModuleLaunchKernel | ( | hipFunction_t | f, |
uint32_t | globalWorkSizeX, | ||
uint32_t | globalWorkSizeY, | ||
uint32_t | globalWorkSizeZ, | ||
uint32_t | localWorkSizeX, | ||
uint32_t | localWorkSizeY, | ||
uint32_t | localWorkSizeZ, | ||
size_t | sharedMemBytes, | ||
hipStream_t | hStream, | ||
void ** | kernelParams, | ||
void ** | extra, | ||
hipEvent_t | startEvent = nullptr , |
||
hipEvent_t | stopEvent = nullptr , |
||
uint32_t | flags = 0 |
||
) |
launches kernel f with launch parameters and shared memory on stream with arguments passed to kernelparams or extra
[in[ | f Kernel to launch. | |
[in] | gridDimX | X grid dimension specified in work-items |
[in] | gridDimY | Y grid dimension specified in work-items |
[in] | gridDimZ | Z grid dimension specified in work-items |
[in] | blockDimX | X block dimensions specified in work-items |
[in] | blockDimY | Y grid dimension specified in work-items |
[in] | blockDimZ | Z grid dimension specified in work-items |
[in] | sharedMemBytes | Amount of dynamic shared memory to allocate for this kernel. The kernel can access this with HIP_DYNAMIC_SHARED. |
[in] | stream | Stream where the kernel should be dispatched. May be 0, in which case th default stream is used with associated synchronization rules. |
[in] | kernelParams | |
[in] | extra | Pointer to kernel arguments. These are passed directly to the kernel and must be in the memory layout and alignment expected by the kernel. |
[in] | startEvent | If non-null, specified event will be updated to track the start time of the kernel launch. The event must be created before calling this API. |
[in] | stopEvent | If non-null, specified event will be updated to track the stop time of the kernel launch. The event must be created before calling this API. |
HIP/ROCm actually updates the start event when the associated kernel completes.
hipError_t hipHccGetAccelerator | ( | int | deviceId, |
hc::accelerator * | acc | ||
) |
Return hc::accelerator associated with the specified deviceId.
hipError_t hipHccGetAcceleratorView | ( | hipStream_t | stream, |
hc::accelerator_view ** | av | ||
) |
Return hc::accelerator_view associated with the specified stream.
If stream is 0, the accelerator_view for the default stream is returned.