By: Michael S (already5chosen.delete@this.yahoo.com), September 21, 2021 10:17 am
Room: Moderated Discussions
-.- (blarg.delete@this.mailinator.com) on September 20, 2021 4:49 pm wrote:
> Michael S (already5chosen.delete@this.yahoo.com) on September 19, 2021 4:58 pm wrote:
> > https://github.com/already5chosen/others/tree/master/aligned_add_test
>
> Replacing all '32' with '64' in the cpp file, and adding this,
> compiling with the supplied commands and running on a 7820X:
>
The buffer that your are using by 512-bit sum is twice bigger than one used by other variants.
IMHO, it's size is too close to the size of Skylake's L1D cache to be sure that we have 100% hit rate.
> Michael S (already5chosen.delete@this.yahoo.com) on September 19, 2021 4:58 pm wrote:
> > https://github.com/already5chosen/others/tree/master/aligned_add_test
>
> Replacing all '32' with '64' in the cpp file, and adding this,
> compiling with the supplied commands and running on a 7820X:
>
The buffer that your are using by 512-bit sum is twice bigger than one used by other variants.
IMHO, it's size is too close to the size of Skylake's L1D cache to be sure that we have 100% hit rate.