By: Linus Torvalds (torvalds.delete@this.linux-foundation.org), April 20, 2012 1:24 pm
Room: Moderated Discussions
bakaneko (nyan@hyan.wan) on 4/20/12 wrote:
>
>Or the much simpler explanation that the FPU doubled in the
>amount of registers and better opcodes which made it
>faster...
Yeah, looking some more at the particular benchmarks, it
looks like the ones that improved in a big way are all
things that might just be AVX or FMA or something.
So maybe it's not so much a fragile uarch, more of just
specialized benchmarks (and a sign that gcc-4.7 does a
reasonable job of vectorization, perhaps)
Linus
>
>Or the much simpler explanation that the FPU doubled in the
>amount of registers and better opcodes which made it
>faster...
Yeah, looking some more at the particular benchmarks, it
looks like the ones that improved in a big way are all
things that might just be AVX or FMA or something.
So maybe it's not so much a fragile uarch, more of just
specialized benchmarks (and a sign that gcc-4.7 does a
reasonable job of vectorization, perhaps)
Linus



