Skip to content
#

cuda

Here are 2,504 public repositories matching this topic...

numba
gmarkall
gmarkall commented Nov 3, 2020

PR #6447 adds a public API to get the maximum number of registers per thread (numba.cuda.Dispatcher.get_regs_per_thread()). There are other attributes that might be nice to provide - shared memory per block, local memory per thread, const memory usage, maximum block size.

These are all available in the FuncAttr named tuple: https://github.com/numba/numba/blob/master/numba/cuda/cudadrv/drive

futhark

Improve this page

Add a description, image, and links to the cuda topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the cuda topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.