I need to program for parallel block matrix calculation (Element by Element wise) using MPI. I programmed before row wise allocation to nodes for calculation. In current problem i need to divide tasks according to elements of resultant matrix.
I keep trying how to distribute the data to nodes but have not reached the solution. Could you please help me in understanding the logic about how to do that?