By: Jörn Engel (joern.delete@this.purestorage.com), May 15, 2022 11:34 pm
Room: Moderated Discussions
Simon Farnsworth (simon.delete@this.farnz.org.uk) on May 14, 2022 5:20 am wrote:
>
> SVE2's big advantage over NEON is not wider vectors, but all the compiler-convenience features
> it has that allow a compiler to be more aggressive about auto-vectorization. For people who
> are hand-tuning codes for peak performance, SVE2 at 128 bit and NEON are about the same, but
> SVE2 pulls ahead handily (due to the FFR register and associated instructions) when you're
> writing "serial" code and relying on the compiler doing something sensible to it.
Why do you think that tail handling is less of a problem for manually written vector code?
Asking for a friend.
>
> SVE2's big advantage over NEON is not wider vectors, but all the compiler-convenience features
> it has that allow a compiler to be more aggressive about auto-vectorization. For people who
> are hand-tuning codes for peak performance, SVE2 at 128 bit and NEON are about the same, but
> SVE2 pulls ahead handily (due to the FFR register and associated instructions) when you're
> writing "serial" code and relying on the compiler doing something sensible to it.
Why do you think that tail handling is less of a problem for manually written vector code?
Asking for a friend.