Article: Escape From the Planet of x86
By: Paul DeMone (pdemone.delete@this.igs.net), June 19, 2003 7:53 pm
Room: Moderated Discussions
Bill Todd (billtodd@metrocast.net) on 6/19/03 wrote:
---------------------------
>One more interesting tidbit that I noticed in that presentation was the L3 latency
>of 14 clock cycles. My recollection is that McKinley's L3 latency was 12 cycles
>(though it may have been more in some situations - ISTR the range 12 - 15 cycles
>being mentioned once): does this indicate a slightly sub-linear improvement in
>L3 performance for Madison?
I found an old spreadsheet in which I modeled the Merced and McKInley cache
and memory hierarchies. The representative figure for average memory access
for a SPECint style workload for a 1 GHz McKinley was 2.34 cycles.
If you simply increase clock rate to 1.5 GHz this figure increases to 2.82 cycles.
Increasing L3 latency from 12 to 14 cycles increases this to 2.84 cycles. Now
if you reduce the L3 global miss rate by 29% to account for the capacity doubling
to 6 MB using the inverse square root rule of thumb then average memory latency
falls back to 2.42, a bit more than the McKinley figure we started with. Extending
that to the reported Madison+ (1.8 GHz, 9 MB) and the latency falls to 2.40 cycles.
And if the Madison+'s 9 MB L3 latency goes up another 2 cycles, to 16, we stay
at 2.42 cycles. All in all a remarkably scalable memory hierarchy that preserves
the system level infrastructure (a 1.8 GHz Madison+ will generate roughly the
same level of memory traffic as a 1 GHz McKinley yet yield perhaps 70% or
more higher performance on most CPU intensive workloads).
The specific numbers will vary from code to code of course but I think it is clear
that the increase in L3 latency is completely negligible compared to CPU clock/
memory access time scaling mismatch. And that in turn isn't a big issue since
another analysis I did based on published I2 CPI breakdown data for SPEC CPU
2k suggested McKinley's performance scaling factors were calculated at above
90% for both SPECInt and SPECfp and actually improved for Madison despite its
higher clock rate.
---------------------------
>One more interesting tidbit that I noticed in that presentation was the L3 latency
>of 14 clock cycles. My recollection is that McKinley's L3 latency was 12 cycles
>(though it may have been more in some situations - ISTR the range 12 - 15 cycles
>being mentioned once): does this indicate a slightly sub-linear improvement in
>L3 performance for Madison?
I found an old spreadsheet in which I modeled the Merced and McKInley cache
and memory hierarchies. The representative figure for average memory access
for a SPECint style workload for a 1 GHz McKinley was 2.34 cycles.
If you simply increase clock rate to 1.5 GHz this figure increases to 2.82 cycles.
Increasing L3 latency from 12 to 14 cycles increases this to 2.84 cycles. Now
if you reduce the L3 global miss rate by 29% to account for the capacity doubling
to 6 MB using the inverse square root rule of thumb then average memory latency
falls back to 2.42, a bit more than the McKinley figure we started with. Extending
that to the reported Madison+ (1.8 GHz, 9 MB) and the latency falls to 2.40 cycles.
And if the Madison+'s 9 MB L3 latency goes up another 2 cycles, to 16, we stay
at 2.42 cycles. All in all a remarkably scalable memory hierarchy that preserves
the system level infrastructure (a 1.8 GHz Madison+ will generate roughly the
same level of memory traffic as a 1 GHz McKinley yet yield perhaps 70% or
more higher performance on most CPU intensive workloads).
The specific numbers will vary from code to code of course but I think it is clear
that the increase in L3 latency is completely negligible compared to CPU clock/
memory access time scaling mismatch. And that in turn isn't a big issue since
another analysis I did based on published I2 CPI breakdown data for SPEC CPU
2k suggested McKinley's performance scaling factors were calculated at above
90% for both SPECInt and SPECfp and actually improved for Madison despite its
higher clock rate.
Topic | Posted By | Date |
---|---|---|
New Silicon Insider Article | David Kanter | 2003/06/17 03:39 PM |
Srockholm Syndrome | anonymous | 2003/06/17 03:50 PM |
Srockholm Syndrome | Nate Begeman | 2003/06/17 04:32 PM |
Srockholm Syndrome | anonymous | 2003/06/18 02:23 PM |
Srockholm Syndrome | Scott Robinson | 2003/06/20 08:25 AM |
New Silicon Insider Article | Bill Todd | 2003/06/17 09:51 PM |
New Silicon Insider Article | Alberto | 2003/06/18 07:29 AM |
New Silicon Insider Article | José Javier Zarate | 2003/06/18 10:16 AM |
New Silicon Insider Article | Bill Todd | 2003/06/18 03:10 PM |
New Silicon Insider Article | Nate Begeman | 2003/06/18 03:25 PM |
New Silicon Insider Article | Tvar' | 2003/06/18 03:41 PM |
New Silicon Insider Article | Alberto | 2003/06/18 03:58 PM |
New Silicon Insider Article | Tvar' | 2003/06/18 04:04 PM |
New Silicon Insider Article | Alberto | 2003/06/18 04:24 PM |
New Silicon Insider Article | Tvar' | 2003/06/18 04:32 PM |
New Silicon Insider Article | Paul DeMone | 2003/06/18 04:13 PM |
New Silicon Insider Article | Tvar' | 2003/06/18 04:23 PM |
New Silicon Insider Article | mas | 2003/06/18 04:11 PM |
New Silicon Insider Article | Alberto | 2003/06/18 03:45 PM |
New Silicon Insider Article | Bill Todd | 2003/06/18 11:46 PM |
New Silicon Insider Article | David Wang | 2003/06/19 12:13 AM |
New Silicon Insider Article | Bill Todd | 2003/06/19 01:14 AM |
New Silicon Insider Article | David Wang | 2003/06/19 10:52 AM |
New Silicon Insider Article | Paul DeMone | 2003/06/18 04:04 PM |
New Silicon Insider Article | Bill Todd | 2003/06/18 11:28 PM |
New Silicon Insider Article | Paul DeMone | 2003/06/19 12:43 AM |
New Silicon Insider Article | Rob Young | 2003/06/19 10:23 AM |
New Silicon Insider Article | Bill Todd | 2003/06/19 04:53 PM |
New Silicon Insider Article | David Wang | 2003/06/18 11:29 PM |
New Silicon Insider Article | Bill Todd | 2003/06/19 12:03 AM |
New Silicon Insider Article | José Javier Zarate | 2003/06/19 05:33 AM |
New Silicon Insider Article | mas | 2003/06/19 06:37 AM |
New Silicon Insider Article | Bill Todd | 2003/06/19 04:40 PM |
New Silicon Insider Article | David Wang | 2003/06/19 05:25 PM |
New Silicon Insider Article | Bill Todd | 2003/06/19 06:00 PM |
New Silicon Insider Article | Alberto | 2003/06/19 06:29 PM |
New Silicon Insider Article | Speedy | 2003/06/19 06:48 PM |
New Silicon Insider Article | Alberto | 2003/06/20 04:57 AM |
New Silicon Insider Article | David Wang | 2003/06/19 06:52 PM |
New Silicon Insider Article | Bill Todd | 2003/06/19 09:00 PM |
New Silicon Insider Article | Anonymous | 2003/06/20 02:20 AM |
New Silicon Insider Article | Paul DeMone | 2003/06/20 09:11 AM |
New Silicon Insider Article | Anonymous | 2003/06/22 04:48 PM |
New Silicon Insider Article | Paul DeMone | 2003/06/22 05:49 PM |
New Silicon Insider Article | Vincent Diepeveen | 2003/06/22 06:25 PM |
New Silicon Insider Article | José Javier Zarate | 2003/06/22 07:55 PM |
New Silicon Insider Article | Anonymous | 2003/06/23 09:59 AM |
New Silicon Insider Article | Paul DeMone | 2003/06/19 07:53 PM |
New Silicon Insider Article | Bill Todd | 2003/06/19 08:53 PM |
New Silicon Insider Article | David Wang | 2003/06/19 09:08 PM |
New Silicon Insider Article | Bill Todd | 2003/06/20 02:28 AM |
New Silicon Insider Article | David Wang | 2003/06/20 11:35 AM |
New Silicon Insider Article | Paul DeMone | 2003/06/20 12:29 PM |
New Silicon Insider Article | Bill Todd | 2003/06/20 07:10 PM |
New Silicon Insider Article | Marc M. | 2003/06/21 06:06 AM |
New Silicon Insider Article | Bill Todd | 2003/06/21 12:07 PM |
New Silicon Insider Article | Bill Todd | 2003/06/20 07:01 PM |
New Silicon Insider Article | David Wang | 2003/06/20 07:52 PM |
New Silicon Insider Article | Bill Todd | 2003/06/20 08:53 PM |
New Silicon Insider Article | David Wang | 2003/06/20 09:14 PM |
New Silicon Insider Article | Vincent Diepeveen | 2003/06/20 09:52 PM |
New Silicon Insider Article | Marc M. | 2003/06/21 08:16 AM |
New Silicon Insider Article | Vincent Diepeveen | 2003/06/22 05:24 PM |
New Silicon Insider Article | Singh, S.R. | 2003/06/21 04:39 AM |
New Silicon Insider Article | David Wang | 2003/06/21 09:10 AM |
IPF Compilers | Nate Begeman | 2003/06/21 10:10 AM |
IPF Compilers | Paul DeMone | 2003/06/21 10:45 AM |
Use ILP to extract more ILP | Paul DeMone | 2003/06/20 11:48 PM |
New Silicon Insider Article | Paul DeMone | 2003/06/20 09:06 AM |
New Silicon Insider Article | Singh, S.R. | 2003/06/20 10:41 AM |
New Silicon Insider Article | David Kanter | 2003/06/21 04:34 PM |
New Silicon Insider Article | Paul DeMone | 2003/06/22 03:22 PM |
New Silicon Insider Article | Bill Todd | 2003/06/20 06:52 PM |
New Silicon Insider Article | Marc M. | 2003/06/21 08:54 AM |
New Silicon Insider Article | Daniel Gustafsson | 2003/06/19 12:12 PM |
New Silicon Insider Article | Paul DeMone | 2003/06/20 03:20 PM |
New Silicon Insider Article | Bryan Gregory | 2003/06/20 02:14 PM |
New Silicon Insider Article | mas | 2003/06/20 02:43 PM |
New Silicon Insider Article | Paul DeMone | 2003/06/25 11:29 AM |
New Silicon Insider Article | José Javier Zarate | 2003/06/25 11:43 AM |
New Silicon Insider Article | Paul DeMone | 2003/06/25 11:52 AM |
lol, amazing coincidence :-) (NT) | mas | 2003/06/25 04:15 PM |
New Silicon Insider Article | Yoav | 2015/04/01 04:43 AM |