I am currently trying to implement a matrix multiplication in ARM assembler. I already read some tutorials about matrix calculations in assembler and they all use the NEON instructions on a 3x3 or 4x4 float matrix. But that's totally different to what i want to do. I have two symmetrical int matrices, so row and column number of both matrices are the same and my assembler function gets the size of the matrices as parameter, so i cant write a matMul function for just 3x3 or 4x4 matrices like in the tutorials.
So my question is what is the best and easiest way to do that? Is it even possible to use the NEON instructions with a 100x100 or even bigger matrix? And also the single- and double-precision registers are unnecessary, because i only have int numbers.
Another problem is, that I'm almost totally new to ARM assembler and so i don't understand the NEON instructions completely.