Article: Parallelism at HotPar 2010
By: Richard Cownie (tich.delete@this.pobox.com), August 3, 2010 10:33 am
Room: Moderated Discussions
Mark Roulo (nothanks@xxx.com) on 8/3/10 wrote:
---------------------------
>The Nehalem L2 caches (256 KB) are per-core and are not shared. The pre-Nehalem
>Intel chips had larger, shared L2s. On Nehalem, the trick is to not need the L3
>(which is shared, and which the cores *will* fight over).
Thanks, I didn't know that. A lot of good stuff has happened
in Nehalem which probably bends the CPU/GPU comparison
towards the CPU. But then maybe a lot of those CPU/GPU
comparisons in the literature are based on measuring older cpu's ?
---------------------------
>The Nehalem L2 caches (256 KB) are per-core and are not shared. The pre-Nehalem
>Intel chips had larger, shared L2s. On Nehalem, the trick is to not need the L3
>(which is shared, and which the cores *will* fight over).
Thanks, I didn't know that. A lot of good stuff has happened
in Nehalem which probably bends the CPU/GPU comparison
towards the CPU. But then maybe a lot of those CPU/GPU
comparisons in the literature are based on measuring older cpu's ?