Thoughts on "Improving the Utilization of µop Caches..."

By: anon (, October 15, 2020 10:57 am
Room: Moderated Discussions
anon ( on October 15, 2020 11:56 am wrote:
> Paul A. Clayton ( on October 14, 2020 12:11 pm wrote:


> > With µop cache entries being extended basic blocks, I would have guessed that rename optimization
> > would be applied. (One can determine which sources are actually written by previous instructions
> > in the extended basic block and replace them with destination numbers, removing the need to check
> > dependencies. One could also re-order µops, at least within a basic block, to reduce routing
> > energy or provide some other benefit. If a banked register alias table was used, bank conflicts
> > could be cached [such might also introduce another cause of entry termination — if bandwidth
> > is limited at the RAT, higher µop cache fetch bandwidth would be less useful].)
> >
> > I do not mean to be unfairly negative (and I hope my negative
> > comments are not cruel or even just insensitive).
> > I know it is easier to find fault than to provide superior alternatives, easy to ignore limited resources
> > available for research, and easy to see creative ideas as less stunningly creative in hindsight. Since
> > I have not written a single research paper (not even a survey),
> > I am likely less sensitive to the difficulties.
> > My expectations may also be unrealistic; not every expert in the field is more competent than I am in
> > every aspect of the field (even though I am just a hobbyist), not every paper can be so clearly written
> > that readers feel like geniuses because it was so easy to understand, not every paper is so thorough
> > and well organized that readers feel the authors are reading their minds when objections or enhancements
> > are addressed just as they come to the reader's mind.
> >
> > Perhaps something positive will come from this post.
> Thank you for this detailed analysis. I haven't read the paper thoroughly yet, but I wanted
> to discuss one of your comments about reordering uops and rename optimizations.
> Regarding reordering, the problem here is that is that you cannot generally rename out-of-order because
> although this might not have any impact at first glance (What's the difference between "add rax, rcx;
> ld rbx, [rdx], add r12, rax" and "ld rbx, [rdx], add rax, rcx; add r12, rax"?), I think it it gets messy
> if you want precise exceptions/interruptions. So, you probably can rename out-of-order but you need
> to map those out-of-order mappings back to a ROB-like structure that is allocated earlier than rename
> in the pipeline and it might be weird. However, that is an interesting thought because I know of some
> designs where the RAT is port-limited and so rename groups with at most x reads/writes have to be formed,
> which may not match what is coming out of Decode (compiler could help though).
> On the rename optimization thing ("rewriting"). I am not
> sure I followed the idea. Could you please elaborate?

< Previous Post in ThreadNext Post in Thread >
TopicPosted ByDate
Zen 3Blue2020/10/08 08:58 AM
  Zen 3Rayla2020/10/08 09:10 AM
  Zen 3Adrian2020/10/08 09:13 AM
    Does anyone know whether Zen 3 has AVX-512? (NT)Foo_2020/10/08 10:54 AM
      Does anyone know whether Zen 3 has AVX-512?Adrian2020/10/08 11:11 AM
  Zen 3 - Number of load/store units2020/10/08 09:21 AM
    Zen 3 - Number of load/store unitsRayla2020/10/08 09:28 AM
      Zen 3 - Number of load/store units2020/10/08 10:22 AM
        Zen 3 - Number of load/store unitsAdrian2020/10/08 10:53 AM
          Zen 3 - Number of load/store unitsTravis Downs2020/10/08 08:45 PM
          Zen 3 - CAD benchmarkPer Hesselgren2020/10/09 06:29 AM
            Zen 3 - CAD benchmarkAdrian2020/10/09 08:27 AM
        Zen 3 - Number of load/store unitsitsmydamnation2020/10/08 01:38 PM
          Zen 3 - Number of load/store unitsGroo2020/10/08 01:48 PM
            Zen 3 - Number of load/store unitsWilco2020/10/08 02:02 PM
              Zen 3 - Number of load/store unitsDummond D. Slow2020/10/08 03:39 PM
                Zen 3 - Number of load/store unitsDoug S2020/10/09 07:11 AM
                  Zen 3 - Number of load/store unitsDummond D. Slow2020/10/09 08:43 AM
                    Zen 3 - Number of load/store unitsDoug S2020/10/09 12:43 PM
                      N7 and N7P are not load/Store units - please fix the topic in your replies (NT)Heikki Kultala2020/10/10 06:37 AM
  Zen 3Jeff S.2020/10/08 11:16 AM
    Zen 3anon2020/10/08 12:57 PM
    Disappointing opening line in paperPaul A. Clayton2020/10/11 05:16 AM
      Thoughts on "Improving the Utilization of µop Caches..."Paul A. Clayton2020/10/14 11:11 AM
        Thoughts on "Improving the Utilization of µop Caches..."anon2020/10/15 10:56 AM
          Thoughts on "Improving the Utilization of µop Caches..."anon2020/10/15 10:57 AM
            Sorry about the messanon2020/10/15 10:58 AM
              Sorry about the messBrett2020/10/16 02:22 AM
          Caching dependence info in µop cachePaul A. Clayton2020/10/16 05:20 AM
            Caching dependence info in µop cacheanon2020/10/16 11:36 AM
              Caching dependence info in µop cachePaul A. Clayton2020/10/18 12:28 PM
  Zen 3juanrga2020/10/09 09:12 AM
  Zen 3Mr. Camel2020/10/09 05:30 PM
    Zen 3anon.12020/10/09 11:44 PM
      Cinebench is terrible benchmarkDavid Kanter2020/10/10 09:36 AM
        Cinebench is terrible benchmarkanon.12020/10/10 11:06 AM
        Cinebench is terrible benchmarkhobold2020/10/10 11:33 AM
          Some comments on benchmarksPaul A. Clayton2020/10/14 11:11 AM
            Some comments on benchmarksMark Roulo2020/10/14 02:21 PM
    Zen 3Adrian2020/10/10 12:59 AM
      Zen 3Adrian2020/10/10 01:18 AM
        Zen 3majord2020/10/15 03:02 AM
  Zen 3hobold2020/10/10 07:58 AM
    Zen 3Maynard Handley2020/10/10 09:36 AM
      Zen 3hobold2020/10/10 11:19 AM
        Zen 3anon2020/10/11 01:58 AM
          Zen 3hobold2020/10/11 11:32 AM
            Zen 3anon2020/10/11 12:07 PM
              Zen 3hobold2020/10/11 01:22 PM
    Zen 3anon2020/10/10 10:51 AM
    Zen 3Michael S2020/10/11 12:16 AM
      Zen 3hobold2020/10/11 01:13 AM
        Zen 3Michael S2020/10/11 01:18 AM
      Zen 3anon.12020/10/11 11:17 AM
  Zen 3David Hess2020/10/12 05:43 AM
    more power? (NT)anonymous22020/10/12 12:26 PM
      I think he's comparing 65W 3700X vs 105W 5800X (NT)John H2020/10/12 03:33 PM
        ?! Those are apples and oranges! (NT)anon2020/10/12 03:49 PM
Reply to this Topic
Body: No Text
How do you spell tangerine? 🍊