I cant seem to understand the wording for the CUDA kernel parameters <<<gridSize, blockSize>>>
In the code I am reviewing they are defined as
const dim3 blockSize(1, 1, 1);
const dim3 gridSize( 1, 1, 1);
Replacing the hardcoded 1s with variable reference, would they be properly name if they were named like so
const dim3 blockSize(nThreadsX, nThreadsY, nThreadsZ);
const dim3 gridSize(nBlocksX, nBlocksY, nBlocksZ);
where the maximum value that any argument to blockSize
can be is set by the hardware (something like 512 or 1024?) and is the maximum number of threads that will run in a block in a single dimension?