I'm looking at the code in question: How do I choose grid and block dimensions for CUDA kernels? Which is a followup question from: CUDA how to get grid, block, thread size and parallalize non square matrix calculation
const int n = 128 * 1024;
int blocksize = 512; // value usually chosen by tuning and hardware constraints
int nblocks = n / nthreads; // value determine by block size and total work
madd<<<nblocks,blocksize>>>mAdd(A,B,C,n);
What is the difference between blocksize and nthreads? I'm thinking they are one in the same. Is this just a typo or am I missing something?