#CodeePerformanceTip
Consider loop interchange to improve the locality of reference and enable vectorization.
Using loop interchange, the inefficient matrix access pattern is replaced with a more efficient one.
See the explanation: 🔗
zurl.co/uyjR
#performance