Article: Parallelism at HotPar 2010
By: Mark Roulo (nothanks.delete@this.xxx.com), August 3, 2010 9:37 am
Room: Moderated Discussions
Richard Cownie (tich@pobox.com) on 8/3/10 wrote:
---------------------------
>Mark Roulo (nothanks@xxx.com) on 8/3/10 wrote:
>---------------------------
>
>>The Nehalem L2 caches (256 KB) are per-core and are not shared. The pre-Nehalem
>>Intel chips had larger, shared L2s. On Nehalem, the trick is to not need the L3
>>(which is shared, and which the cores *will* fight over).
>
>Thanks, I didn't know that. A lot of good stuff has happened
>in Nehalem which probably bends the CPU/GPU comparison
>towards the CPU. But then maybe a lot of those CPU/GPU
>comparisons in the literature are based on measuring older cpu's ?
>
They might be. That would be interesting to know, too.
-Mark Roulo
---------------------------
>Mark Roulo (nothanks@xxx.com) on 8/3/10 wrote:
>---------------------------
>
>>The Nehalem L2 caches (256 KB) are per-core and are not shared. The pre-Nehalem
>>Intel chips had larger, shared L2s. On Nehalem, the trick is to not need the L3
>>(which is shared, and which the cores *will* fight over).
>
>Thanks, I didn't know that. A lot of good stuff has happened
>in Nehalem which probably bends the CPU/GPU comparison
>towards the CPU. But then maybe a lot of those CPU/GPU
>comparisons in the literature are based on measuring older cpu's ?
>
They might be. That would be interesting to know, too.
-Mark Roulo