I'm trying to profile my code using Nivida Profiler, but I'm getting strange gaps in the timeline as shown below:
Note: both kernels on the edges of the gaps are CudaMemCpyAsync (Host-to-Device)
I'm running on Ubuntu 14.04 with latest version of CUDA, 8.0.61 and latest Nvidia display driver.
Intel integrated graphics card is used in display not Nvidia. So, Nvidia Graphics card is only running the code, not anything else.
I've enabled CPU Profiling as well to check these gaps but nothing is shown!
Also, no Debugging options are enabled (-G nor -g) and this is a "release build"
My laptop's specs:
- Intel Core i7 4720HQ
- Nvidia GTX 960m
- 16GB DDR3 Ram
- 1 TB Hard Drive
Is there anyway to trace what's happening in these empty time slots?
Thanks,


