By: Alberto (git.delete@this.git.it), August 26, 2015 6:26 am
Room: Moderated Discussions
juanrga (nospam.delete@this.juanrga.com) on August 26, 2015 5:49 am wrote:
> Anon (nope.delete@this.nope.com) on August 26, 2015 1:23 am wrote:
> > juanrga (nospam.delete@this.juanrga.com) on August 25, 2015 6:01 pm wrote:
> > > Nvidia talk at ISC2015 was much more interesting. They compared
> > > two KNL CPUs against Power+CUDA using Amdahl's
> > > law. At 98% parallel the KNL was competitive. At 90% parallel
> > > work the KNL system was about two times slower than
> > > the Power+CUDA system: ~2 min vs 4.5 min. At 70% parallel work, the KNL system was more than 3x slower.
> > >
> > > Wider vector units and less cores had worked better.
> >
> > KNL will still be available as a PCIe card- SKX & KNL combo systems
> > are entirely feasible, for workloads that fit that paradigm.
>
> The card version is for legacy customers. New systems will favor the CPU version.
> Nvidia was comparing Summit and Aurora configurations and how Aurora will require
> one order of magnitude more nodes to achieve similar performance.
Even the Host version of KNL can be paired with Xeons in the node :), Skylake Xeon and hopefully even Broadwell Xeon will have the right dedicated fabric.
Why to rise the node number??
> Anon (nope.delete@this.nope.com) on August 26, 2015 1:23 am wrote:
> > juanrga (nospam.delete@this.juanrga.com) on August 25, 2015 6:01 pm wrote:
> > > Nvidia talk at ISC2015 was much more interesting. They compared
> > > two KNL CPUs against Power+CUDA using Amdahl's
> > > law. At 98% parallel the KNL was competitive. At 90% parallel
> > > work the KNL system was about two times slower than
> > > the Power+CUDA system: ~2 min vs 4.5 min. At 70% parallel work, the KNL system was more than 3x slower.
> > >
> > > Wider vector units and less cores had worked better.
> >
> > KNL will still be available as a PCIe card- SKX & KNL combo systems
> > are entirely feasible, for workloads that fit that paradigm.
>
> The card version is for legacy customers. New systems will favor the CPU version.
> Nvidia was comparing Summit and Aurora configurations and how Aurora will require
> one order of magnitude more nodes to achieve similar performance.
Even the Host version of KNL can be paired with Xeons in the node :), Skylake Xeon and hopefully even Broadwell Xeon will have the right dedicated fabric.
Why to rise the node number??