By: David Kanter (dkanter.delete@this.realworldtech.com), August 11, 2012 9:25 pm
Room: Moderated Discussions
jp (jasdfasdf.delete@this.gmail.com) on August 11, 2012 1:21 pm wrote:
> David Kanter (dkanter.delete@this.realworldtech.com) on July 25, 2012 1:37 am
> wrote:
> > New computational efficiency data shows GPUs with a clear edge over
> CPUs, but
> > the gap is narrowing as CPUs adopt wide vectors (e.g. AVX).
> Surprisingly, a
> > throughput CPU is the most energy efficient processor,
> offering hope for future
> > architectures. Our data also shows some
> advantages of AMD's Bulldozer, and the
> > overhead associated with highly
> scalable server CPUs.
> >
> > Comments and feedback
> > welcome!
> >
>
> > David
>
> Nvidia claims that GK110 (K20) will have a 3x improvement in DP
> performance, see first
> slide:
> http://www.brightsideofnews.com/news/2012/8/9/can-the-new-nvidia-quadro-
> k5000-become-the-most-profitable-graphics-card.aspx
>
> That would mean 3x665 DP
> GFLOPS (M2090) => 1.995 TFLOP DP, which sounds almost too good to be true and
> would require a frequency of roughly 1 Ghz...
Honestly I would expect around 1.3 TFLOP/s. But the real question is at what cost in terms of power?
Achieving 2 TFLOP/s might be feasible, but within 300W? That seems unlikely.
David
> David Kanter (dkanter.delete@this.realworldtech.com) on July 25, 2012 1:37 am
> wrote:
> > New computational efficiency data shows GPUs with a clear edge over
> CPUs, but
> > the gap is narrowing as CPUs adopt wide vectors (e.g. AVX).
> Surprisingly, a
> > throughput CPU is the most energy efficient processor,
> offering hope for future
> > architectures. Our data also shows some
> advantages of AMD's Bulldozer, and the
> > overhead associated with highly
> scalable server CPUs.
> >
> > Comments and feedback
> > welcome!
> >
>
> > David
>
> Nvidia claims that GK110 (K20) will have a 3x improvement in DP
> performance, see first
> slide:
> http://www.brightsideofnews.com/news/2012/8/9/can-the-new-nvidia-quadro-
> k5000-become-the-most-profitable-graphics-card.aspx
>
> That would mean 3x665 DP
> GFLOPS (M2090) => 1.995 TFLOP DP, which sounds almost too good to be true and
> would require a frequency of roughly 1 Ghz...
Honestly I would expect around 1.3 TFLOP/s. But the real question is at what cost in terms of power?
Achieving 2 TFLOP/s might be feasible, but within 300W? That seems unlikely.
David



