Good point

Article: AMD's Cayman GPU Architecture
By: David Kanter (dkanter.delete@this.realworldtech.com), December 17, 2010 4:45 pm
Room: Moderated Discussions
Triskaine (whatever@who?cares.com) on 12/17/10 wrote:
---------------------------
>David Kanter (dkanter@realworldtech.com) on 12/17/10 wrote:
>---------------------------
>>Heikki Kultala (hkultala@iki.NOSPAM.fi) on 12/17/10 wrote:
>>---------------------------
>>>Page 11
>>>
>>>"The projected Cayman has 17% higher memory bandwidth, roughly 2.5X the raw single
>>>precision FLOP/s and 26% higher raw double precision FLOP/s than the Tesla C2070.
>>>However, AMD's VLIW microarchitecture is inherently less efficient, and the memory
>>>hierarchy is also incredibly sensitive to the workload. For single precision applications
>>>that are primarily regular computation and regular memory access patterns, Cayman
>>>should have good utilization within each VLIW4 offer incredibly attractive performance.
>>>Even in the case of a bandwidth bound application, Cayman will be on-par or slightly
>>>ahead of the Tesla. For double precision though, Fermi is likely to be the higher
>>>performance option. The two GPUs have similar raw performance, but the inefficiencies
>>>of the VLIW will drag down AMD's actual performance."
>>>
>>>But when AMD is running double-precision code, there is no VLIW-inefficiency as
>>>all the units are together executing same DP operation, and there is no need to
>>>find parallel operations to execute, it's acting like a >scalar processor.
>>
>>That's a very good point. There are no VLIW multi-issue difficulties with 64-bit
>>instructions really. The rough edges around the rest of the chip would still be there...
>>
>>DK
>
>To further prove his point, here is a DGEMM implementation on Cypress which reaches 91 % of the peak DP performance.
>
>[url]http://www.phoronix.com/scan.php?page=news_item&px=ODg5OA[/url]
>
>
>

DGEMM is seriously the pointless benchmark ever, especially for a GPU. It's totally trivial and tells you nothing about performance for real workloads.

Anyway, I changed the article a bit to use more consistent logic.

David
< Previous Post in ThreadNext Post in Thread >
TopicPosted ByDate
AMD Cayman Architecture article onlineDavid Kanter2010/12/15 07:39 AM
  AMD Cayman Architecture article onlineBryan Catanzaro2010/12/15 12:25 PM
  AMD Cayman Architecture article onlineCarsten Spille2010/12/15 02:51 PM
  AMD Cayman Architecture article onlineDaveC2010/12/15 05:17 PM
    AMD Cayman Architecture article onlineAntti-Ville Tuunainen2010/12/16 01:13 AM
      AMD Cayman Architecture article onlinePing-Che Chen2010/12/16 02:39 AM
        AMD Cayman Architecture article onlineEduardoS2010/12/16 01:54 PM
          AMD Cayman Architecture article onlineDavid Kanter2010/12/16 03:46 PM
            AMD Cayman Architecture article onlineEduardoS2010/12/16 06:03 PM
    AMD Cayman Architecture article onlineVincent Diepeveen2010/12/17 07:16 AM
  AMD Cayman Architecture article onlinean2010/12/16 12:39 PM
    AMD Cayman Architecture article onlineRichard Cownie2010/12/16 01:51 PM
      AMD Cayman Architecture article onlineVincent Diepeveen2010/12/17 07:31 AM
        AMD Cayman Architecture article onlineRichard Cownie2010/12/17 09:22 AM
    AMD Cayman Architecture article onlineEduardoS2010/12/16 02:01 PM
      AMD Cayman Architecture article onlinean2010/12/16 02:43 PM
        AMD Cayman Architecture article onlineEduardoS2010/12/16 02:51 PM
          AMD Cayman Architecture article onlineDaveC2010/12/16 03:41 PM
            AMD Cayman Architecture article onlinehobold2010/12/16 03:56 PM
              AMD Cayman Architecture article onlineDaveC2010/12/16 05:31 PM
            AMD Cayman Architecture article onlineVincent Diepeveen2010/12/17 07:02 AM
        AMD Cayman Architecture article onlineAaron Spink2010/12/16 03:39 PM
        AMD Cayman Architecture article onlineDavid Kanter2010/12/16 03:48 PM
          AMD Cayman Architecture article onlineVincent Diepeveen2010/12/17 07:07 AM
        AMD Cayman Architecture article onlineVincent Diepeveen2010/12/17 06:56 AM
  Logic error on the articleHeikki Kultala2010/12/17 03:59 AM
    Good pointDavid Kanter2010/12/17 11:21 AM
      Good pointTriskaine2010/12/17 01:02 PM
        Good pointDavid Kanter2010/12/17 04:45 PM
      Good pointJohn2010/12/20 08:05 PM
  WavefrontsMoritz2010/12/20 04:11 AM
    WavefrontsSeni2010/12/20 01:07 PM
      TexturesMoritz2010/12/21 01:41 AM
        TexturesGabriele Svelto2010/12/21 02:21 AM
        TexturesAntti-Ville Tuunainen2010/12/21 02:33 PM
  Integer computerRobert David Graham2010/12/21 04:45 PM
    Integer computeranon2010/12/21 08:22 PM
      Integer computeretzel2010/12/21 09:12 PM
  TransposeMoritz2010/12/23 03:44 PM
    TransposeMatt Sayler2010/12/23 07:58 PM
      Transpose/RotateMoritz2010/12/24 01:38 AM
        Transpose/RotateMatt Sayler2010/12/24 06:45 AM
          Transpose/RotateMoritz2010/12/25 05:33 AM
  control-data associated with scalarsMoritz2010/12/23 03:59 PM
Reply to this Topic
Name:
Email:
Topic:
Body: No Text
How do you spell avocado?