By: Eric Bron (eric.bron.delete@this.zvisuel.privatefortest.com), April 11, 2013 12:30 pm
Room: Moderated Discussions
> Your alternative of prefetching 1-2KB of data past the end of
it's not my alternative, this is the common way to do it all the examples I have seen, including compiler generated code, I probably missed a lot of examples, though
> the array does not sound preferable to ~2 branch misses.
there is also the extra compare and well predicted branch for each prefetch instruction
(it's more effective to mix regular code in between prefetch instructions) at each loop iteration
it's not my alternative, this is the common way to do it all the examples I have seen, including compiler generated code, I probably missed a lot of examples, though
> the array does not sound preferable to ~2 branch misses.
there is also the extra compare and well predicted branch for each prefetch instruction
(it's more effective to mix regular code in between prefetch instructions) at each loop iteration