By: jp (asdasdf.delete@this.gmail.com), August 13, 2012 2:02 am
Room: Moderated Discussions
EBFE (x.delete@this.y.com) on August 12, 2012 8:27 pm wrote:
> jp (asdfasdf.delete@this.gmail.com) on August 10, 2012 1:23 am wrote:
> > Eric
> (eric.kjellen.delete@this.gmail.com) on August 9, 2012 5:12 pm wrote:
> >
> >
> > jp (asdfasdf.delete@this.gmail.com) on August 9, 2012 1:58 pm
> wrote:
> > > >
> > EBFE
> > > (x.delete@this.y.com) on July 25,
> 2012 3:25 am wrote:
> > > >
> > > GCN
> > > >
> > >
>
> > > >
> > > > Yes I
> > > > didn't see
> > AMD:s
> GCN arch in the charts?
> > > >
> > > >
> > > GCN is the
> first
> > single chip ever to
> > > > push 1 TFLOP DP. And the new
>
> > > professional
> > edition has
> > > > ECC.
> > >
> >
> > > >
> > >
> >
> http://www.hpcwire.com/hpcwire/2012-08-08/amd_unveils_teraflop_gpu_with_
> >
> >
> > >
> > > ecc_support.html?featured=top
> > >
> >
> > That's very impressive,
> > particularly with TDP
> > > at 274W.
> Though we should probably expect Tesla K20
> > to arrive soon and deliver
>
> > > higher performance in a comparable power
> > envelope. I wonder
> why AMD hasn't
> > > released or announced an HPC oriented
> > compute
> card based on the same GPU. Could
> > > the reason be that AMD
> software
> > tools are still immature in comparison with
> > > Nvidia's
> and Intel's
> > offerings?
> >
> >
> > Well I've been working with
> their tools using OpenCL on AMD
> > cards and it is currently far more
> immature when compared to working with CUDA
> > on Nvidia cards when it comes
> to driver reliability, developer environment,
> > documentation, and feature
> set.
>
> Can you elaborate it? esp. 'feature set'
On feature set:
- Direct access to texture features for fast interpolations
- Warp voting functions
- certain floating point atomic instructions*
- Register shuffle instructions (direct exchange of data via registers instead of via smem )
- GPUDirect, DMA between GPUs
I'm sure I can lengthen this list if I spend some more time on it.
* ex atomicAdd(float* ), might have been introduced recently
> jp (asdfasdf.delete@this.gmail.com) on August 10, 2012 1:23 am wrote:
> > Eric
> (eric.kjellen.delete@this.gmail.com) on August 9, 2012 5:12 pm wrote:
> >
> >
> > jp (asdfasdf.delete@this.gmail.com) on August 9, 2012 1:58 pm
> wrote:
> > > >
> > EBFE
> > > (x.delete@this.y.com) on July 25,
> 2012 3:25 am wrote:
> > > >
> > > GCN
> > > >
> > >
>
> > > >
> > > > Yes I
> > > > didn't see
> > AMD:s
> GCN arch in the charts?
> > > >
> > > >
> > > GCN is the
> first
> > single chip ever to
> > > > push 1 TFLOP DP. And the new
>
> > > professional
> > edition has
> > > > ECC.
> > >
> >
> > > >
> > >
> >
> http://www.hpcwire.com/hpcwire/2012-08-08/amd_unveils_teraflop_gpu_with_
> >
> >
> > >
> > > ecc_support.html?featured=top
> > >
> >
> > That's very impressive,
> > particularly with TDP
> > > at 274W.
> Though we should probably expect Tesla K20
> > to arrive soon and deliver
>
> > > higher performance in a comparable power
> > envelope. I wonder
> why AMD hasn't
> > > released or announced an HPC oriented
> > compute
> card based on the same GPU. Could
> > > the reason be that AMD
> software
> > tools are still immature in comparison with
> > > Nvidia's
> and Intel's
> > offerings?
> >
> >
> > Well I've been working with
> their tools using OpenCL on AMD
> > cards and it is currently far more
> immature when compared to working with CUDA
> > on Nvidia cards when it comes
> to driver reliability, developer environment,
> > documentation, and feature
> set.
>
> Can you elaborate it? esp. 'feature set'
On feature set:
- Direct access to texture features for fast interpolations
- Warp voting functions
- certain floating point atomic instructions*
- Register shuffle instructions (direct exchange of data via registers instead of via smem )
- GPUDirect, DMA between GPUs
I'm sure I can lengthen this list if I spend some more time on it.
* ex atomicAdd(float* ), might have been introduced recently



