By: Jan Wassenberg (jan.wassenberg.delete@this.gmail.com), May 29, 2022 12:51 am
Room: Moderated Discussions
Adrian (a.delete@this.acm.org) on May 24, 2022 2:39 pm wrote:
> I have not tried this on more recent Intel CPUs, but in a measurement on Skylake Server CPUs
> (with 2 512-bit FMA units) done a few years ago, the ratio between the energies needed to
> compute some LINPACK benchmark in AVX-512 and in AVX2 (i.e. with 256-bit FMA/LD/ST) modes
> was around 5/6, so a little more than your maximum estimation, but not much more.
Interesting, can you share some pointers on how this was measured so I can try it for AVX-512 vs scalar? (I suspect that is a much larger difference than AVX2 vs AVX-512.)
> I have not tried this on more recent Intel CPUs, but in a measurement on Skylake Server CPUs
> (with 2 512-bit FMA units) done a few years ago, the ratio between the energies needed to
> compute some LINPACK benchmark in AVX-512 and in AVX2 (i.e. with 256-bit FMA/LD/ST) modes
> was around 5/6, so a little more than your maximum estimation, but not much more.
Interesting, can you share some pointers on how this was measured so I can try it for AVX-512 vs scalar? (I suspect that is a much larger difference than AVX2 vs AVX-512.)