By: Eric Fink (eric.delete@this.anon.com), June 5, 2022 4:31 am
Room: Moderated Discussions
Linus Torvalds (torvalds.delete@this.linux-foundation.org) on June 4, 2022 10:17 am wrote:
> I have the exact reverse reaction.
>
> Text sequences are usually quite short. The whole "I have
> gigabytes of JSON" seems a very artificial example.
>
Oh, I fully agree with everything you said, I should have been more precise what I mean with "longish". When I was expression my concerns about viability of vector-based ISA for low-latency computations, I was thinking about really small data structures, like geometric primitives that often fit into a couple of floats. Bulk-processed strings are often at least some orders of magnitudes longer, which is what I had in mind when I said "longish".
But then again that might not even be the case. JSON requests can be quite short. Same for UTF-8 validation. It's just of you have to validate a 12-byte string, the performance impact might not be large enough to justify crazy optimisations. But if you are loading a larger (several KBs+) text file, a vector approach will probably help out a lot, even if there is a non-trivial setup cost.
> I have the exact reverse reaction.
>
> Text sequences are usually quite short. The whole "I have
> gigabytes of JSON" seems a very artificial example.
>
Oh, I fully agree with everything you said, I should have been more precise what I mean with "longish". When I was expression my concerns about viability of vector-based ISA for low-latency computations, I was thinking about really small data structures, like geometric primitives that often fit into a couple of floats. Bulk-processed strings are often at least some orders of magnitudes longer, which is what I had in mind when I said "longish".
But then again that might not even be the case. JSON requests can be quite short. Same for UTF-8 validation. It's just of you have to validate a 12-byte string, the performance impact might not be large enough to justify crazy optimisations. But if you are loading a larger (several KBs+) text file, a vector approach will probably help out a lot, even if there is a non-trivial setup cost.