Article: CELL Microprocessor III
By: David Wang (dwang.delete@this.RWTexpanded.com), August 3, 2005 11:28 am
Room: Moderated Discussions
Panajev2001a (g_marocchi@yahoo.com) on 8/3/05 wrote:
---------------------------
>David Wang (dwang@RWTexpanded.com) on 8/2/05 wrote:
>---------------------------
>>fastpathguru (fastpathguru@yahoo.com) on 8/2/05 wrote:
>>---------------------------
>>
>>>Re: The Mystery of the Missing FLOPS, is it possible that it's solved by accounting
>>>for the extra meat in the PPE? I didn't see that mentioned as a possibility.
>>
>>You need 68 flops per cycle to get 218 GFlops @ 3.2 GHz.
>>
>>It's easy to get 56 of those flops from 7 SPE's, using FMADD, but then you'd need
>>12 flops per cycle out of the PPE to get to 68.
>>
>>Again, assume that you can crank 8 Flops per cycle out with FMADD, you still need
>>4 flops per cycle. The PPE is multi-threaded, but the impression that I got from
>>the IBM folks is that the scalar and vector FP units are actually shared so you
>>really can't get more flops out of the PPE either. Regardless, even if the FP units
>>aren't shared, you still can't get 4 flops per cycle out of the scalar side of thing.
>>
>>So no, the beefier PPE can't get to 12 flops per cycle even if it had independent scalar/vector FP units.
>
>Why not ? See Gekko ;).
I've forgotten about Gekko entirely.
http://www.hotchips.org/archives/hc13/2_Mon/06ibm-gekko.pdf
So the 12 Flops/cycle can be attained by the PPE if the PPE's scalar FPU can do a 2 wide SP FMADD in addition to a separate and independent Altivec unit. That does make things quite curious. I'm not sure how many programmers will be able to make use of this capability, but I think you're right, the Gekko ISA extension to the DD2 PPE explanation sounds to be the most reasonable explanation that I have come across about how Sony was able to claim the 218 GFlops rating with only 7 SPE's @ 3.2 GHz.
Perhaps you can write a rebuttal article and we can link it to the current article. I think the readership in general will benefit from this discussion.
---------------------------
>David Wang (dwang@RWTexpanded.com) on 8/2/05 wrote:
>---------------------------
>>fastpathguru (fastpathguru@yahoo.com) on 8/2/05 wrote:
>>---------------------------
>>
>>>Re: The Mystery of the Missing FLOPS, is it possible that it's solved by accounting
>>>for the extra meat in the PPE? I didn't see that mentioned as a possibility.
>>
>>You need 68 flops per cycle to get 218 GFlops @ 3.2 GHz.
>>
>>It's easy to get 56 of those flops from 7 SPE's, using FMADD, but then you'd need
>>12 flops per cycle out of the PPE to get to 68.
>>
>>Again, assume that you can crank 8 Flops per cycle out with FMADD, you still need
>>4 flops per cycle. The PPE is multi-threaded, but the impression that I got from
>>the IBM folks is that the scalar and vector FP units are actually shared so you
>>really can't get more flops out of the PPE either. Regardless, even if the FP units
>>aren't shared, you still can't get 4 flops per cycle out of the scalar side of thing.
>>
>>So no, the beefier PPE can't get to 12 flops per cycle even if it had independent scalar/vector FP units.
>
>Why not ? See Gekko ;).
I've forgotten about Gekko entirely.
http://www.hotchips.org/archives/hc13/2_Mon/06ibm-gekko.pdf
So the 12 Flops/cycle can be attained by the PPE if the PPE's scalar FPU can do a 2 wide SP FMADD in addition to a separate and independent Altivec unit. That does make things quite curious. I'm not sure how many programmers will be able to make use of this capability, but I think you're right, the Gekko ISA extension to the DD2 PPE explanation sounds to be the most reasonable explanation that I have come across about how Sony was able to claim the 218 GFlops rating with only 7 SPE's @ 3.2 GHz.
Perhaps you can write a rebuttal article and we can link it to the current article. I think the readership in general will benefit from this discussion.
Topic | Posted By | Date |
---|---|---|
New CELL Article Online | David Kanter | 2005/08/02 11:32 AM |
New CELL Article Online | mas | 2005/08/02 12:46 PM |
New CELL Article Online | mas | 2005/08/02 12:53 PM |
New CELL Article Online | David Wang | 2005/08/02 01:46 PM |
New CELL Article Online | fastpathguru | 2005/08/02 04:05 PM |
New CELL Article Online | David Wang | 2005/08/02 06:27 PM |
New CELL Article Online | Panajev2001a | 2005/08/03 03:26 AM |
New CELL Article Online | David Wang | 2005/08/03 11:28 AM |
New CELL Article Online | Deadmeat | 2005/08/04 01:05 PM |
New CELL Article Online | David Wang | 2005/08/04 05:47 PM |
New CELL Article Online | Deadmeat | 2005/08/04 07:04 PM |
New CELL Article Online | john evans | 2005/08/04 08:30 PM |
New CELL Article Online | Deadmeat | 2005/08/05 12:10 PM |
New CELL Article Online | Linus Torvalds | 2005/08/05 06:21 PM |
New CELL Article Online | Deadmeat | 2005/08/05 07:33 PM |
New CELL Article Online | fastpathguru | 2005/08/05 10:36 PM |
New CELL Article Online | john evans | 2005/08/05 10:51 PM |
New CELL Article Online | Deadmeat | 2005/08/06 04:09 AM |
New CELL Article Online | fastpathguru | 2005/08/06 06:29 AM |
New CELL Article Online | Deadmeat | 2005/08/07 04:06 PM |
New CELL Article Online | David Wang | 2005/08/04 09:03 PM |
New CELL Article Online | Deadmeat | 2005/08/05 12:21 PM |
New CELL Article Online | David Wang | 2005/08/05 11:51 PM |
New CELL Article Online | David Wang | 2005/08/06 12:00 AM |
New CELL Article Online | Deadmeat | 2005/08/07 03:39 PM |
New CELL Article Online | David Wang | 2005/08/08 01:57 PM |
New CELL Article Online | Deadmeat | 2005/08/08 02:55 PM |
New CELL Article Online | David Wang | 2005/08/08 03:37 PM |
New CELL Article Online | Deadmeat | 2005/08/08 05:05 PM |
New CELL Article Online | David Wang | 2005/08/08 05:47 PM |
New CELL Article Online | Deadmeat | 2005/08/08 06:25 PM |
Implausible at best, irrational most likely... | David Kanter | 2005/08/08 06:51 PM |
Implausible at best, irrational most likely... | Deadmeat | 2005/08/09 10:26 AM |
New CELL Article Online | David Wang | 2005/08/08 07:46 PM |
New CELL Article Online | Deadmeat | 2005/08/09 10:36 AM |
New CELL Article Online | David Wang | 2005/08/09 11:12 AM |
New CELL Article Online | Deadmeat | 2005/08/09 01:26 PM |
New CELL Article Online | David Wang | 2005/08/09 02:36 PM |
New CELL Article Online | Aaron Spink | 2005/08/09 02:57 PM |
New CELL Article Online | David Wang | 2005/08/10 10:06 AM |
New CELL Article Online | Serge Monkewitz | 2005/08/09 01:18 PM |
New CELL Article Online | Deadmeat | 2005/08/09 01:30 PM |
New CELL Article Online | Vitaly Vidmirov | 2005/08/11 01:36 AM |
New CELL Article Online | Anonymous | 2005/08/03 04:11 PM |
New CELL Article Online | fastpathguru | 2005/08/03 05:19 PM |
New CELL Article Online | mas | 2005/08/03 07:59 PM |
New CELL Article Online | José Javier Zarate | 2005/08/04 05:20 AM |
New CELL Article Online | mas | 2005/08/04 05:27 AM |
New CELL Article Online | mas | 2005/08/05 06:50 AM |
New CELL Article Online | PiedPiper | 2005/08/02 09:02 PM |