I implement my own gaussian filter by C++ and neon.
Pseudo codeļ¼
oneDimensionBlur(src, temp1, width, height) //implement by C++
transposeMatrix(temp1, temp2, width, height) //implement by neon
oneDimensionBlur(temp2, temp1, height, width)
transposeMatrix(temp1, dst, height, width)
But the cv::GaussianBlur() is almost 8 times faster than my implementation! I almost desperate for the profiling result. So I want to ask if opencv use any acceleration technology on gaussian blur? I try to track the opencv source code but failed.