I have gone through Cuda programming guide but still not clear where does cuda kernel reside on GPU? In other words, in which memory segment does it reside?
Also, How do I know what is the max kernel size supported by my device? Whether max kernel size depend on number of simultaneous kernels loaded on device?