Regarding relationship between cores and ranks of a MPI program

Question

As far as I know, in a multiprocessor environment any thread/process can be allocated to any core/processor so, what is meant by following line:

the number of MPI ranks used on an Intel Xeon Phi coprocessor should be substantially fewer than the number of cores in no small part because of limited memory on the coprocessor.

I mean, what are the issues if #cores <= #MPI Ranks ?

Levi Morrison Levi Morrison · Accepted Answer · 2014-01-08T18:56:48

That quote is correct only when it is applied to a memory size constrained problem; in general it would be an incorrect statement. In general you should use more tasks than you have physical cores on the Xeon Phi in order to hide memory latency¹.

To answer your question "What are the issues if the number of cores is fewer than the number of MPI ranks?": you run the risk of having too much context switching. On many problems it is advantageous to use more tasks than you have cores to hide memory latency².

^{1. I don't even feel like I need to cite a reference for this because how loudly it is advertised; however, they do mention it in an article on the OpenCL design document: http://software.intel.com/en-us/articles/opencl-design-and-programming-guide-for-the-intel-xeon-phi-coprocessor}

^{2. This advice applies to the Xeon Phi specifically, not necessarily other pieces of hardware.}

Regarding relationship between cores and ranks of a MPI program

2 Answers