Performance analysis of K8 and Core online

Article: Performance Analysis for Core 2 and K8: Part 1
By: Michael S (already5chosen.delete@this.yahoo.com), October 30, 2008 6:32 pm
Room: Moderated Discussions
David Kanter (dkanter@realworldtech.com) on 10/30/08 wrote:
---------------------------
>James (alan@devonex.webhop.org) on 10/29/08 wrote:
>---------------------------
>>First quibble:
>>
>>> The next step was to collect the actual event-based
>>> sampling data with a 1MHz resolution (sampling every 1ms).
>>
>>Wouldn't 1 ms lead to a 1 kHz resolution? For 1 MHz, you'd want a sample >every 1 microsecond (µs).
>
>Yup, that was a mistake that made it past the proof-reading : (
>
>That's the problem with having smart readers - they notice all your mistakes!
>
>Of course, that's the best way to learn as well.
>
>David
>

O.k. Then more nitpicks:

Chart 2 – System Settings/System Bus you use clock frequency for AMD vs transfer rate for Intel. That's inconsistent.
It would be better to mention either clock rate vs clock rate, i.e. 1000 MHz vs 266 MHz or transfer rate vs. transfer rate, i.e. 2000 MT/s vs 1066MT/s.
Also in the same chart you mention nForce 590 SLI as a Northbridge. I'd rather say that the on AMD platform the Northbridge is integrated. nForce chip better fits thd Southbridge moniker.

Same page, Figure 1:
Merom
6MB L2 is Penryn, not Merom that you tested.

Execution port 0 can't do SSE Shuffles.

Execution port 1 can't do SSE MUL, it can do FP/SSE MOVE and Logic + 64-bit fixpoint shuffle (or 128bit fixpoint shuffle at reduced performance).

Execution port 2 does complete Integer/FP load, not just Load Address. Memory data arrives to the inner core through writeback port 2.

K8:
Unlike Intel where all external memory/IO accesses travel through L2 cache, on K8's SRQ is actually attached directly to all three caches.

IMHO, you should draw L1 TLBs on the right (system) side of respective data caches. This way you make clear that AMD L1 caches while physically-tagged are virtually-indexed. AMD itself certainly draws L1 TLBs on the system side.

The arrows between the L1D and LSU_1 create an impression that LSUs are fully symmetric and can sustain any combination of loads and stores. That's incorrect. K8 L1D cache could sustain at most one store per clock.
< Previous Post in ThreadNext Post in Thread >
TopicPosted ByDate
Performance analysis of K8 and Core onlineDavid Kanter2008/10/29 01:47 AM
  Performance analysis of K8 and Core onlineJames2008/10/29 02:36 AM
    Performance analysis of K8 and Core onlineMatt Sayler2008/10/29 05:37 AM
    Performance analysis of K8 and Core onlineDavid Kanter2008/10/30 12:00 PM
      Performance analysis of K8 and Core onlineMichael S2008/10/30 06:32 PM
        Performance analysis of K8 and Core onlineDavid Kanter2008/10/31 08:38 AM
        Performance analysis of K8 and Core onlineEduardoS2008/10/31 05:36 PM
          Performance analysis of K8 and Core onlineMichael S2008/11/01 08:21 AM
            Performance analysis of K8 and Core onlineEduardoS2008/11/01 11:15 AM
              Performance analysis of K8 and Core onlineanonymous2008/11/01 01:06 PM
              I stay correctedMichael S2008/11/01 01:53 PM
    Performance analysis of K8 and Core onlineDavid Kanter2008/11/08 06:23 PM
  Performance analysis of K8 and Core onlinehobold2008/10/29 06:36 AM
  Performance analysis of K8 and Core onlinerwessel2008/10/29 11:08 AM
    Performance analysis of K8 and Core onlineDavid Kanter2008/10/30 11:48 AM
    Error fixedDavid Kanter2008/11/08 06:22 PM
  I cache fetches.Jouni Osmala2008/10/31 01:10 AM
    I cache fetches.anon2008/10/31 02:57 AM
      I cache fetches.anon.moose2008/10/31 08:06 AM
        I cache fetches.anon2008/10/31 02:32 PM
          I cache fetches.Peter2008/10/31 03:38 PM
            I cache fetches - clarificationPeter2008/10/31 03:50 PM
        instruction fetch vs. icache accessDavid Kanter2008/10/31 06:19 PM
  Performance analysis of K8 and Core onlineHoward Chu2008/10/31 02:39 AM
    Performance analysis of K8 and Core onlineLinus Torvalds2008/10/31 08:08 AM
      Performance analysis of K8 and Core onlineEduardoS2008/10/31 01:11 PM
      Timing sensitive performanceDavid Kanter2008/11/03 09:52 AM
    Performance analysis of K8 and Core onlineDavid Kanter2008/11/01 11:53 PM
      Performance analysis of K8 and Core onlineMichael S2008/11/02 05:12 AM
      Performance analysis of K8 and Core onlineEduardoS2008/11/02 07:47 AM
    Performance analysis of K8 and Core onlineDavid Kanter2008/11/03 11:23 AM
  Shanghai vs Penryn Spec PowerJoe Chang2008/11/09 04:38 PM
    Shanghai vs Penryn Spec PowerEduardoS2008/11/09 05:14 PM
      Shanghai vs Penryn Spec PowerMichael S2008/11/09 06:08 PM
        Shanghai vs Penryn Spec PowerHenrik S2008/11/09 11:52 PM
        Shanghai vs Penryn Spec PowerEduardoS2008/11/10 05:32 AM
          Shanghai vs Penryn Spec PowerMichael S2008/11/10 06:56 AM
            Shanghai vs Penryn Spec PowerEduardoS2008/11/10 12:32 PM
              Shanghai vs Penryn Spec PowerMichael S2008/11/10 05:55 PM
                Shanghai vs Penryn Spec Poweranonymous2008/11/10 08:28 PM
                  Shanghai vs Penryn Spec PowerMichael S2008/11/11 02:32 AM
                    Shanghai vs Penryn Spec Poweranonymous2008/11/11 11:38 AM
                Shanghai vs Penryn Spec Powermpx2008/11/11 02:12 PM
                  Shanghai vs Penryn Spec PowerMichael S2008/11/11 04:39 PM
    Seaburg vs San ClementeMichael S2008/11/09 05:37 PM
      Seaburg vs San ClementeJoe Chang2008/11/09 08:11 PM
    Links to spec.org + commentsHenrik S2008/11/10 03:18 AM
      Links to spec.org + commentsMichael S2008/11/10 04:31 AM
        Links to spec.org + commentsHenrik S2008/11/10 08:09 AM
          Links to spec.org + commentsMichael S2008/11/10 08:58 AM
            Links to spec.org + commentsHenrik S2008/11/10 10:48 AM
      Links to spec.org + commentsEduardoS2008/11/10 05:12 AM
        Links to spec.org + commentsMichael S2008/11/10 07:00 AM
          Links to spec.org + commentspgerassi2008/11/11 10:29 AM
            Links to spec.org + commentsMichael S2008/11/11 11:15 AM
  Performance analysis of K8 and Core onlineaap2008/11/16 05:08 PM
    Performance analysis of K8 and Core onlineDavid Kanter2008/11/16 07:20 PM
Reply to this Topic
Name:
Email:
Topic:
Body: No Text
How do you spell avocado?