↧
Answer by Holmz for SIMD transpose when row size is greater than vector width
Perhaps one could use the fortran TRANSPOSE intrisic with ISO_C_BINDING and link that with C as a subroutine or function call. TRANSPOSE is pretty optimised in fortran. And mixed language skills are...
View ArticleAnswer by chtz for SIMD transpose when row size is greater than vector width
If all your matrix dimensions are a multiple of your packet-size you can do the operation block-wise and swap the blocks as needed. Example for 4x4 double matrix using SSE2: // transpose vectors i0 and...
View ArticleSIMD transpose when row size is greater than vector width
You can find a lot of good answers for transposing a matrix which falls with the natural size of the SIMD instruction set, in particular, where the size of one row is no more than the vector width....
View Article