MEM : ALU ratio

Article: Knights Landing Details
By: Nicolas Capens (nicolas.capens.delete@this.gmail.com), January 10, 2014 11:24 pm
Room: Moderated Discussions
David Kanter (dkanter.delete@this.realworldtech.com) on January 10, 2014 4:06 pm wrote:
> Nicolas Capens (nicolas.capens.delete@this.gmail.com) on January 10, 2014 2:22 pm wrote:
> > David Kanter (dkanter.delete@this.realworldtech.com) on January 9, 2014 6:42 pm wrote:
> > > > > I'll further add that I'm willing to wager that I'm correct about the 2 load
> > > > > pipelines, and that Nicolas is wrong about a virtually addressed L0 cache.
> > > >
> > > > What exactly do you mean by 2 load pipelines? If you mean that both vector units would be
> > > > able to use a memory operand in the same cycle, then yes, based on Eric's code that seems
> > > > to be a necessity. However due to the duplicate loading of the same memory this can be provided
> > > > by a dual-ported L0 cache, while the L1 cache itself only requires a single load port.
> > >
> > > I mean that the KNL core has two AGUs, both VPUs can execute
> > > load+op every cycle and that the L1D cache can fill
> > > two hits per cycle. I do not believe the cache is dual ported, as I believe it will be heavily banked.
> >
> > As far as I know that counts as a dual-ported cache.
>
> No it doesn't.

It's what my professor called it: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.107.3602

What did your professor call the thingies that determine the maximum number of simultaneous accesses instead of ports?

> > Anyway, a multi-ported multi-banked L1 cache is a reasonable possibility. I just don't see why it "must" be
> > the only possibility, especially with x86 being a load-op
> > architecture as the only explanation in >your article.
>
> > Given that 1 byte/FLOP would suffice and that the code generated
> > by ICC has a lot of duplicate >memory accesses,
> > it also seems reasonable to me that there's a single-ported L1 cache and a dual-ported L0 cache.
>
> First, 1B/FLOP isn't sufficient. Look at KNC, it can source 1x64B operand from memory
> and 2 from registers while performing 8 FMAs. That's 8B for 2 operations or 4B/FLOP.

I was talking about single-precision FLOPS. And it's obvious that KNC had 2 bytes/SPFLOP since providing any less would have required cutting that bus in half and sequencing things, which adds more complexity overall. So KNC isn't the gold standard here. Doubling the number of execution units does not mean the number of memory load ports have to be doubled. Kepler has one load port per six FMA units. I'm not saying that's the best ratio, but it's a strong indication that 2 bytes/SPFLOP is completely unnecessary.

> Haswell also sources 64B/cycle while performing 8 FMAs.

Yes, and it has a third and fourth arithmetic execution port, both of which can't have a memory source operand if the other two already utilize the two L1 read ports. So that's a 2:4 ratio, while I'm suggesting a 1:2 ratio for KNL, augmented with an L0 cache so you can actually have a 1:1 ratio to lower register pressure while staying power efficient.

> So if you look at these designs, it's very clear that Intel believes that 4B/FLOP is the right
> amount of L1D bandwidth. Frankly, you can analyze compiler output all you want...but Intel
> invested millions in making the right decision and that's infinitely more convincing.

So what do you think NVIDIA and AMD invested in making the right decision?

Looking at typical x86 assembly output teaches us a lot of things. Not every instruction takes a memory source operand. The ratio is rarely over 1:2 when the accesses are unique, and having 32 registers helps a lot to ensure that for AVX-512. However Eric's experiments show that the Intel compiler for KNL isn't shy to use as many memory operands as possible. And knowing how much an L1 access cost in power, that doesn't make sense unless they have 'something' to make it cheap for very locally reused data. I call that something an L0 cache.

> > L1 accesses cost a considerable amount of power (http://www.eecs.harvard.edu/~shao/papers/shao2013-islped.pdf),
> >
> > and if Intel wants to win the exascale race then they have to consider anything that might
> > offer an improvement in performance/Watt. An efficient L0 cache and tuning the compiler
> > to maximize its use might be part of such innovations.
>
> It won't be. L1 caches are fine for KNL and the code you're
> seeing is probably the result of an immature compiler.

The Intel compiler has been around for a long time, and AVX-512 is a very minor variation on KNC's 512-bit ISA extension. Also as someone pointed out elsewhere you need a somewhat representative compiler to generate traces that can be used during the design of the new architecture. And lastly this behavior isn't coming from some complicated experimental optimization. This is part of basic register allocation and for decades the overall goal has been to minimize the number of memory operands. To suddenly see that basic rule reversed cannot possibly be something Intel has overlooked. It's a deliberate choice and we have to find the explanation in the micro-architecture.

> Caches might cost power, but the real problem is large scale coherency. The L1D is fine.

The paper above says it takes 40% more power. That's a huge deal. Even if large scale coherency is a bigger problem, you can't ignore this. 40% is a lot of opportunity for doing something a little different when doubling the number of execution units, like, an L0 cache perhaps? Even if the L0 cache itself costs 10% of power per access, that's a huge saving over a second L1 access.

> > > I'm willing to wager money that there is no L0 as you have described it.
> >
> > Gamblers wage a lot of money even though they know they have less than a 50% chance of winning
> > but choose to ignore it. So your money doesn't mean anything to me even though you say this
> > must be the only option for KNL. I won't wager anything because to me it's a coin on it's
> > side. I think a dual-ported L1 is not the only option. That doesn't mean I think the other
> > option is more likely. And neither does that doubt mean I think it's any less likely.
>
> What I hear is that you aren't very confident of the L0 being the right
> solution, whereas I'm highly confident that it is the wrong solution.

There's a difference between it being the wrong solution and two L1 read ports being the right solution. The L0 cache is just one way Intel might avoid a 40% increase in power consumption, but the most likely one I've been able to come up with so far based on the compiler output. That you think that output is "probably" the result of an immature compiler, which it clearly isn't, is a much more interesting expression of doubt. So what's your other explanation?
< Previous Post in ThreadNext Post in Thread >
TopicPosted ByDate
Knights Landing details (new article)David Kanter2014/01/02 11:58 PM
  eDRAM as cacheiz2014/01/03 03:39 AM
    eDRAM optionsEric Bron2014/01/09 02:45 AM
  Knights Landing details (new article)Emil Briggs2014/01/03 05:06 AM
  Knights Landing details (new article)Michael S2014/01/03 06:05 AM
    PCI-E and QPIDavid Kanter2014/01/03 11:11 AM
  eDRAM still seems too expensive ...Mark Roulo2014/01/03 09:48 AM
    Nevermind ... I see that you addressed this :-)Mark Roulo2014/01/03 09:51 AM
    eDRAM still seems too expensive ...Eric Bron2014/01/03 12:42 PM
  eDRAM or stacked DRAM?Patrick Chase2014/01/03 10:21 AM
    eDRAM or stacked DRAM?Wes Felter2014/01/03 02:00 PM
      eDRAM or stacked DRAM?Patrick Chase2014/01/03 06:26 PM
        eDRAM or stacked DRAM?tarlinian2014/06/23 08:59 PM
          eDRAM or stacked DRAM?Maynard Handley2014/06/24 12:47 AM
            eDRAM or stacked DRAM?Michael S2014/06/24 02:13 AM
            eDRAM or stacked DRAM?David Kanter2014/06/24 11:09 AM
              eDRAM or stacked DRAM?anon2014/06/24 06:50 PM
                eDRAM or stacked DRAM?Eric Bron2014/06/24 09:02 PM
                  eDRAM or stacked DRAM?anon2014/06/24 09:39 PM
                eDRAM or stacked DRAM?Michael S2014/06/25 12:46 AM
              eDRAM or stacked DRAM?Michael S2014/06/25 12:29 AM
          eDRAM or stacked DRAM?Eric Bron2014/06/24 04:37 AM
            eDRAM or stacked DRAM?tarlinian2014/06/24 07:53 AM
              eDRAM or stacked DRAM?Eric Bron2014/06/24 08:09 AM
                eDRAM or stacked DRAM?tarlinian2014/06/24 08:40 AM
                  eDRAM or stacked DRAM?Eric Bron2014/06/24 09:10 AM
                    eDRAM or stacked DRAM?Eric Bron2014/06/24 09:12 AM
          eDRAM or stacked DRAM?Wes Felter2014/06/24 09:09 PM
            eDRAM or stacked DRAM?Michael S2014/06/25 01:02 AM
  Why not tag-inclusive L3?Paul A. Clayton2014/01/03 03:28 PM
    Why not tag-inclusive L3?Eric Bron2014/01/04 02:22 AM
  Knights Landing L/S bandwidthNicolas Capens2014/01/04 04:43 AM
    Knights Landing L/S bandwidthEric Bron2014/01/04 05:20 AM
      Knights Landing L/S bandwidthNicolas Capens2014/01/04 01:55 PM
        Knights Landing L/S bandwidthEric Bron2014/01/04 02:27 PM
          Knights Landing L/S bandwidthhobold2014/01/04 03:23 PM
            Knights Landing L/S bandwidthEric Bron2014/01/04 04:20 PM
              Knights Landing L/S bandwidthMichael S2014/01/05 02:42 AM
                Knights Landing L/S bandwidthEric Bron2014/01/05 02:49 AM
                  Knights Landing L/S bandwidthPatrick Chase2014/01/11 07:13 PM
                    Knights Landing L/S bandwidthNicolas Capens2014/01/13 07:39 PM
                Knights Landing L/S bandwidthNicolas Capens2014/01/05 02:18 PM
                  Knights Landing L/S bandwidthMichael S2014/01/06 03:09 AM
                    Knights Landing L/S bandwidthEric Bron2014/01/06 04:11 AM
                      Knights Landing L/S bandwidthMichael S2014/01/06 04:40 AM
                        Knights Landing L/S bandwidthEric Bron2014/01/06 04:54 AM
                        Knights Landing L/S bandwidthEric Bron2014/01/08 08:00 AM
                    Knights Landing L/S bandwidthNicolas Capens2014/01/07 02:31 PM
                      Knights Landing L/S bandwidthMichael S2014/01/07 03:17 PM
                        Knights Landing L/S bandwidthNicolas Capens2014/01/07 08:55 PM
                          Knights Landing L/S bandwidthMichael S2014/01/08 12:42 AM
                            Knights Landing L/S bandwidthGabriele Svelto2014/01/08 07:30 AM
                              Occam's razorNicolas Capens2014/01/08 01:33 PM
                                Occam's razorGabriele Svelto2014/01/08 01:51 PM
                                  Occam's razorEric Bron2014/01/08 02:28 PM
                                    Occam's razorbakaneko2014/01/09 03:45 AM
                                      Occam's razoranon2014/01/09 04:02 AM
                                        Occam's razorbakaneko2014/01/09 05:24 AM
                                          Occam's razorbakaneko2014/01/09 05:51 AM
                                            Occam's razoranon2014/01/09 06:18 AM
                                          Occam's razoranon2014/01/09 06:16 AM
                                            Occam's razorbakaneko2014/01/09 07:43 AM
                                              Occam's razoranon2014/01/09 08:17 AM
                                                Occam's razorbakaneko2014/01/09 10:12 AM
                                                  Occam's razorEric Bron2014/01/09 10:18 AM
                                                    Occam's razorbakaneko2014/01/09 10:58 AM
                                                  Occam's razoranon2014/01/09 11:35 AM
                                                    Occam's razorbakaneko2014/01/12 09:48 AM
                                                  99.9% not a new extensionNicolas Capens2014/01/10 10:39 AM
                                                    Compiler complexityGabriele Svelto2014/01/11 02:58 AM
                                                      Compiler complexityNicolas Capens2014/01/11 12:20 PM
                                                        Compiler complexityGabriele Svelto2014/01/11 02:17 PM
                                                          Patent pendingNicolas Capens2014/01/14 06:21 PM
                                                    99.9% not a new extensionbakaneko2014/01/12 10:08 AM
                                  L0 data cacheEric Bron2014/01/08 03:52 PM
                                  Occam's razorDavid Kanter2014/01/08 03:53 PM
                                    Occam's razorNicolas Capens2014/01/09 02:07 AM
                                      Occam's razorRicardo B2014/01/09 04:21 AM
                                        Virtually indexed, untaggedNicolas Capens2014/01/10 10:27 AM
                                          Virtually indexed, untaggedGabriele Svelto2014/01/11 03:08 AM
                                            Virtually indexed, untaggedNicolas Capens2014/01/11 08:45 PM
                                              Virtually indexed, untaggedDavid Kanter2014/01/12 01:13 AM
                                                Virtually indexed, untaggedanon2014/01/12 03:02 AM
                                                Virtually indexed, untaggedNicolas Capens2014/01/16 08:55 AM
                                              Virtually indexed, untaggedMichael S2014/01/12 03:09 AM
                                                Virtually indexed, untaggedNicolas Capens2014/01/16 09:47 AM
                                      Occam's razorDavid Kanter2014/01/09 05:42 PM
                                        Occam's razorNicolas Capens2014/01/10 01:22 PM
                                          Occam's razorDavid Kanter2014/01/10 03:06 PM
                                            MEM : ALU ratioNicolas Capens2014/01/10 11:24 PM
                                              MEM : ALU ratioGabriele Svelto2014/01/11 02:47 AM
                                                MEM : ALU ratioEric Bron2014/01/11 03:41 AM
                                                  MEM : ALU ratioEric Bron2014/01/11 04:06 AM
                                                    MEM : ALU ratioDavid Kanter2014/01/11 07:28 PM
                                                      MEM : ALU ratioEric Bron nli2014/01/12 01:54 AM
                                                  MEM : ALU ratioGabriele Svelto2014/01/11 09:15 AM
                                                MEM : ALU ratioNicolas Capens2014/01/14 05:56 PM
                                                  Etiquette in linking to papersPaul A. Clayton2014/01/14 06:44 PM
                                                  MEM : ALU ratioanon2014/01/14 07:32 PM
                                                    L0 power costNicolas Capens2014/01/16 01:05 PM
                                                      L0 power costanon2014/01/16 09:01 PM
                                                        L0 power costNicolas Capens2014/01/18 11:30 PM
                                                          Links revealedPaul A. Clayton2014/01/19 03:47 PM
                                                          L0 power costanon2014/01/20 12:19 AM
                                                            L0 power costNicolas Capens2014/01/20 01:49 PM
                                                              L0 power costanon2014/01/21 12:18 AM
                                                                Q.E.D.Nicolas Capens2014/01/21 07:44 PM
                                                                  Q.E.D.anon2014/01/21 08:24 PM
                                                                    Straw manNicolas Capens2014/01/23 10:56 PM
                                                                      Straw mananon2014/01/25 05:46 AM
                                                                        Still waiting for an explanationNicolas Capens2014/01/25 11:19 PM
                                                                          Still waiting for an explanationExophase2014/01/26 12:13 PM
                                                                            Still waiting for an explanationbakaneko2014/01/26 10:52 PM
                                                                  Q.E.D.Ricardo B2014/01/22 05:58 PM
                                                                    Q.E.D.Michael S2014/01/23 03:59 AM
                                                                      L0 entry countNicolas Capens2014/01/24 12:11 AM
                                                                        L0 entry countEric Bron2014/01/24 01:08 AM
                                                                          L0 entry countMichael S2014/01/24 05:18 AM
                                                                            L0 entry countEric Bron2014/01/24 06:15 AM
                                                                              L0 entry countMichael S2014/01/24 07:10 AM
                                                                                L0 entry countEric Bron2014/01/24 07:20 AM
                                                                          L0 entry countNicolas Capens2014/01/24 01:33 PM
                                                                            L0 entry countEric Bron2014/01/24 02:20 PM
                                                                              L0 entry count and L1 read port orthogonalityNicolas Capens2014/01/26 12:14 AM
                                                                                L0 entry count and L1 read port orthogonalityEric Bron2014/01/26 02:49 AM
                                                                    L0 hit rateNicolas Capens2014/01/23 11:49 PM
                                                                      L0 hit rateRicardo B2014/01/24 05:42 AM
                                                                        L0 hit rateExophase2014/01/24 12:37 PM
                                                                          L0 hit rateEric Bron2014/01/24 01:12 PM
                                                                        L0 vs RF powerNicolas Capens2014/01/24 01:43 PM
                                              MEM : ALU ratioDavid Kanter2014/01/11 12:47 PM
                                                MEM : ALU ratioNicolas Capens2014/01/16 08:23 AM
                                                  MEM : ALU ratioStubabe2014/01/17 11:58 AM
                                                    MEM : ALU ratioStubabe2014/01/17 12:42 PM
                                                      MEM : ALU ratioMichael S2014/01/18 03:57 PM
                                                        MEM : ALU ratiobakaneko2014/01/18 11:47 PM
                                                    MEM : ALU ratioNicolas Capens2014/01/20 02:48 PM
                                                      It's called "tunnel vision" (NT)iz2014/01/20 03:36 PM
                                                      MEM : ALU ratioMichael S2014/01/20 03:37 PM
                                                        MEM : ALU ratioStubabe2014/01/21 03:54 PM
                                                        MEM : ALU ratioNicolas Capens2014/01/21 09:07 PM
                                                          MEM : ALU ratioMichael S2014/01/22 07:17 AM
                                                            MEM : ALU ratioNicolas Capens2014/01/24 02:33 PM
                                                      MEM : ALU ratioStubabe2014/01/21 03:32 PM
                                                        MEM : ALU ratioMichael S2014/01/22 07:56 AM
                                                          MEM : ALU ratioStubabe2014/01/23 08:06 AM
                                                            MEM : ALU ratioEric Bron2014/01/23 08:45 AM
                                                              editEric Bron2014/01/23 08:49 AM
                                                            MEM : ALU ratioMichael S2014/01/23 08:58 AM
                                                              MEM : ALU ratioEric Bron2014/01/23 09:29 AM
                                                                MEM : ALU ratioMichael S2014/01/23 09:33 AM
                                                              MEM : ALU ratioStubabe2014/01/24 03:50 AM
                                                MEM : ALU ratiobakaneko2014/01/23 09:36 AM
                                              MEM : ALU ratioNoSpammer2014/01/11 02:39 PM
                                                L1 vs L0 access costNicolas Capens2014/01/16 02:17 PM
                                                  L1 vs L0 access costNoSpammer2014/01/19 12:48 PM
                                                    L1 vs L0 access costdmcq2014/01/22 04:45 AM
                                                      L1 vs L0 access costGabriele Svelto2014/01/22 06:29 AM
                                                        L1 vs L0 access costdmcq2014/01/22 12:33 PM
                                                          L1 vs L0 access costGabriele Svelto2014/01/22 03:33 PM
                                                            L1 vs L0 access costdmcq2014/01/24 03:19 AM
                                                    L1 vs L0 access costNicolas Capens2014/01/24 01:16 AM
                                      Occam's razorPatrick Chase2014/01/13 10:19 AM
                                  Occam's razorNicolas Capens2014/01/08 11:40 PM
                                    Occam's razorGabriele Svelto2014/01/09 01:41 AM
                                      Occam's razorEric Bron2014/01/09 01:54 AM
                                        Occam's razorGabriele Svelto2014/01/09 05:35 AM
                                          Occam's razorEric Bron2014/01/09 06:14 AM
                                            avoiding redundant loadsEric Bron2014/01/09 06:18 AM
                                            AVX2 versionEric Bron2014/01/09 06:32 AM
                                      Occam's razorAmiba Gelos2014/01/09 02:01 AM
                                        Occam's razorEric Bron2014/01/09 02:06 AM
                                          Occam's razorAmiba Gelos2014/01/09 02:43 AM
                                            Occam's razorEric Bron2014/01/09 03:02 AM
                                        L0 access latencyNicolas Capens2014/01/09 03:27 AM
                                          L0 access latencyAmiba Gelos2014/01/09 04:16 AM
                                            compared to L0$ i would say banking is far more likely (NT)Amiba Gelos2014/01/09 04:20 AM
                                            L0 access latencyNicolas Capens2014/01/10 02:20 PM
                                      Occam's razorNicolas Capens2014/01/09 03:19 AM
                                    Occam's razorNoSpammer2014/01/09 11:55 AM
                                      Occam's razorNicolas Capens2014/01/10 02:40 PM
                                        Occam's razorMichael S2014/01/11 09:21 AM
                                        Occam's razorMichael S2014/01/12 02:21 PM
                                          KNC compiler outputNicolas Capens2014/01/16 05:39 PM
                                            KNC compiler outputMichael S2014/01/18 04:13 PM
                                    L0 cache coherencyDavid Kanter2014/01/11 07:39 PM
                                Occam's razoranon2014/01/09 04:12 AM
                            Knights Landing L/S bandwidthEric Bron2014/01/08 09:46 AM
                              Knights Landing L/S bandwidthMichael S2014/01/08 10:23 AM
                            Knights Landing L/S bandwidthNicolas Capens2014/01/08 01:02 PM
                              Knights Landing L/S bandwidthMichael S2014/01/08 01:29 PM
                                Knights Landing L/S bandwidthEric Bron2014/01/08 01:54 PM
                                  Knights Landing L/S bandwidthMichael S2014/01/08 02:00 PM
                                    Knights Landing L/S bandwidthEric Bron2014/01/08 02:13 PM
                                      Knights Landing L/S bandwidthMichael S2014/01/08 02:28 PM
                                        Knights Landing L/S bandwidthEric Bron2014/01/08 02:32 PM
                                          Knights Landing L/S bandwidthMichael S2014/01/08 02:40 PM
                                            Knights Landing L/S bandwidthEric Bron2014/01/08 02:51 PM
                                              Knights Landing L/S bandwidthMichael S2014/01/09 11:18 AM
                          Knights Landing L/S bandwidthPatrick Chase2014/01/12 09:03 PM
                            Also page/line splits?David Kanter2014/01/12 09:50 PM
                              Also page/line splits?anon2014/01/13 12:44 AM
                                Also page/line splits?none2014/01/13 02:09 AM
                                  Also page/line splits?anon2014/01/13 03:19 AM
                            Knights Landing L/S bandwidthExophase2014/01/12 11:15 PM
                            Knights Landing L/S bandwidthanon2014/01/13 12:41 AM
                              Knights Landing L/S bandwidthPatrick Chase2014/01/13 10:14 AM
                            Aliased writesNicolas Capens2014/01/14 08:46 PM
                      Knights Landing L/S bandwidthRicardo B2014/01/07 03:27 PM
                        Knights Landing L/S bandwidthNicolas Capens2014/01/07 09:28 PM
                          Knights Landing L/S bandwidthRicardo B2014/01/08 01:13 AM
                            Knights Landing L/S bandwidthEric Bron2014/01/08 10:10 AM
                            Knights Landing L/S bandwidthNicolas Capens2014/01/08 02:31 PM
                              Knights Landing L/S bandwidthRicardo B2014/01/08 02:58 PM
                                Knights Landing L/S bandwidthG. Gouvine2014/01/09 08:10 AM
                                  Knights Landing L/S bandwidthRicardo B2014/01/09 10:19 AM
                                    Efficient load queue vs. efficient L0 cacheNicolas Capens2014/01/11 11:28 AM
                                      Efficient load queue vs. efficient L0 cacheG. Gouvine2014/01/13 01:11 AM
                                        Efficient load queue vs. efficient L0 cacheMichael S2014/01/13 02:43 AM
                                Register file read port requirementsNicolas Capens2014/01/10 11:55 PM
                                  Register file read port requirementsRicardo B2014/01/11 04:24 AM
                                    Register file read port requirementsEric Bron2014/01/11 04:32 AM
                                      Register file read port requirementsMichael S2014/01/11 08:57 AM
                                        Register file read port requirementsEric Bron2014/01/11 10:16 AM
                                          Register file read port requirementsMichael S2014/01/11 10:46 AM
                                            Register file read port requirementsEric Bron2014/01/11 11:12 AM
                                              Register file read port requirementsMichael S2014/01/11 11:36 AM
                                                Register file read port requirementsEric Bron2014/01/11 11:51 AM
                                              Register file read port requirementsPatrick Chase2014/01/13 01:27 PM
                                                Register file read port requirementsEric Bron2014/01/13 03:24 PM
                                                  Register file read port requirementsPatrick Chase2014/01/13 05:02 PM
                                                    Register file read port requirementsEric Bron2014/01/14 03:50 AM
                                                      Register file read port requirementsMichael S2014/01/14 10:36 AM
                                                        Register file read port requirementsEric Bron nli2014/01/14 12:04 PM
                                            Register file read port requirementsPatrick Chase2014/01/13 01:17 PM
                                              Register file read port requirementsMichael S2014/01/15 03:27 AM
                                        Register file read port requirementsEric Bron2014/01/11 10:28 AM
                                          Register file read port requirementsMichael S2014/01/11 11:07 AM
                                            Register file read port requirementsPatrick Chase2014/01/13 01:40 PM
                                          Register file read port requirementsPatrick Chase2014/01/13 01:34 PM
                                      Register file read port requirementsRicardo B2014/01/11 11:55 AM
                                        Register file read port requirementsEric Bron2014/01/11 12:17 PM
                                          Register file read port requirementsRicardo B2014/01/11 01:36 PM
                                            Register file read port requirementsEric Bron2014/01/11 01:42 PM
                                              Register file read port requirementsRicardo B2014/01/11 02:20 PM
                                                Register file read port requirementsEric Bron2014/01/11 02:26 PM
                                                  Register file read port requirementsMichael S2014/01/11 03:07 PM
                                                    Register file read port requirementsRicardo B2014/01/11 03:38 PM
                                                      Register file read port requirementsMichael S2014/01/11 03:49 PM
                                                Register file read port requirementsEric Bron2014/01/11 02:39 PM
                                                  Register file read port requirementsEric Bron2014/01/11 02:41 PM
                                                  Register file read port requirementsRicardo B2014/01/11 03:30 PM
                                    Register file read port requirementsNicolas Capens2014/01/11 11:09 AM
              Knights Landing L/S bandwidthanon2014/01/05 05:55 AM
                Knights Landing L/S bandwidthEric Bron2014/01/05 06:30 AM
                  Knights Landing L/S bandwidthanon2014/01/06 12:07 AM
                    Knights Landing L/S bandwidthEric Bron2014/01/06 01:38 AM
                      Knights Landing L/S bandwidthanon2014/01/06 03:01 AM
                        Knights Landing L/S bandwidthEric Bron2014/01/06 03:44 AM
                          Knights Landing L/S bandwidthanon2014/01/06 04:39 AM
                            Knights Landing L/S bandwidthEric Bron2014/01/06 05:00 AM
                              Knights Landing L/S bandwidthanon2014/01/06 05:44 AM
                                Knights Landing L/S bandwidthMichael S2014/01/06 07:54 AM
                                  Knights Landing L/S bandwidthEric Bron2014/01/06 09:11 AM
                                    Knights Landing L/S bandwidthMichael S2014/01/06 09:14 AM
                                      Knights Landing L/S bandwidthEric Bron2014/01/06 10:37 AM
                                        Knights Landing L/S bandwidthRicardo B2014/01/08 05:25 AM
                                          Knights Landing L/S bandwidthEric Bron2014/01/08 07:36 AM
                                            Knights Landing L/S bandwidthEric Bron2014/01/08 07:41 AM
                                            KNC code generator with EVEX back-end?Michael S2014/01/08 08:43 AM
                                              KNC code generator with EVEX back-end?Exophase2014/01/08 09:00 AM
                                                KNC code generator with EVEX back-end?Ricardo B2014/01/08 10:39 AM
                                                  KNC code generator with EVEX back-end?Eric Bron2014/01/08 11:15 AM
                                                    KNC code generator with EVEX back-end?Exophase2014/01/08 12:17 PM
                                                      KNC code generator with EVEX back-end?Ricardo B2014/01/08 01:06 PM
                                                        KNC code generator with EVEX back-end?Exophase2014/01/08 01:24 PM
                                                        KNC code generator with EVEX back-end?Eric Bron2014/01/08 01:38 PM
                                                    KNC code generator with EVEX back-end?Michael S2014/01/08 12:54 PM
                                              KNC code generator with EVEX back-end?Eric Bron2014/01/08 09:25 AM
                                              KNC code generator with EVEX back-end?Eric Bron2014/01/08 09:35 AM
                                                KNC code generator with EVEX back-end?Michael S2014/01/08 10:07 AM
                                                  KNC code generator with EVEX back-end?Eric Bron2014/01/08 10:24 AM
                                                    KNC code generator with EVEX back-end?Michael S2014/01/08 10:43 AM
                                                      KNC code generator with EVEX back-end?Eric Bron2014/01/08 12:23 PM
                                              KNC code generator with EVEX back-end?Eric Bron2014/01/08 09:43 AM
                                          AVX2 code much different than AVX-512Eric Bron2014/01/08 07:52 AM
                                            evil questionhobold2014/01/08 09:22 AM
                                              evil questionEric Bron2014/01/08 09:27 AM
                                                evil questionhobold2014/01/08 01:33 PM
                                                  evil questionMichael S2014/01/08 01:37 PM
                                                    stupid question (was: evil question)hobold2014/01/09 04:41 AM
                                                      stupid question (was: evil question)Eric Bron2014/01/09 04:52 AM
                                                        stupid question (was: evil question)Michael S2014/01/09 07:00 AM
                                                          stupid question (was: evil question)Michael S2014/01/09 07:12 AM
                                                            stupid question (was: evil question)Eric Bron2014/01/09 09:47 AM
                                                              stupid question (was: evil question)Michael S2014/01/09 10:48 AM
                                                                more decisive (hopefully) test caseMichael S2014/01/09 11:01 AM
                                                                  more decisive (hopefully) test caseEric Bron2014/01/09 11:08 AM
                                                                    more decisive (hopefully) test caseMichael S2014/01/09 11:24 AM
                                                                      more decisive (hopefully) test caseEric Bron2014/01/09 11:27 AM
                                                                        more decisive (hopefully) test caseMichael S2014/01/09 11:33 AM
                                                                  AVX2Eric Bron2014/01/09 11:14 AM
                                                                    AVX2Michael S2014/01/09 11:30 AM
                                                                      AVX2Eric Bron2014/01/09 11:40 AM
                                                                  another tryMichael S2014/01/09 02:02 PM
                                                                    another tryEric Bron2014/01/09 02:33 PM
                                                                      another tryMichael S2014/01/09 03:20 PM
                                                                      another try - ignore misformated mess aboveMichael S2014/01/09 03:24 PM
                                                                        another try - ignore misformated mess aboveGabriele Svelto2014/01/10 12:01 AM
                                                                          another try - ignore misformated mess aboveEric Bron2014/01/10 02:05 AM
                                                                            another try - ignore misformated mess aboveMichael S2014/01/11 09:23 AM
                                                                              another try - ignore misformated mess aboveEric Bron2014/01/11 10:08 AM
                                                                                another try - ignore misformated mess aboveMichael S2014/01/11 11:09 AM
                                                                                  another try - ignore misformated mess aboveMichael S2014/01/11 11:12 AM
                                                                                    another try - ignore misformated mess aboveEric Bron2014/01/11 11:24 AM
                                                                                      another try - ignore misformated mess aboveMichael S2014/01/11 12:24 PM
                                                                                        another try - ignore misformated mess aboveEric Bron2014/01/11 01:11 PM
                                                                                          another try - ignore misformated mess aboveMichael S2014/01/11 01:18 PM
                                                                                            another try - ignore misformated mess aboveEric Bron2014/01/11 01:27 PM
                                                                                              another try - ignore misformated mess aboveMichael S2014/01/11 01:29 PM
                                                                                                another try - ignore misformated mess aboveEric Bron2014/01/11 01:46 PM
                                                                                                  another try - ignore misformated mess aboveEric Bron2014/01/11 01:46 PM
                                                                                                  another try - ignore misformated mess aboveMichael S2014/01/11 02:28 PM
                                                                                        another try - ignore misformated mess aboveEric Bron2014/01/11 01:17 PM
                                                                                          another try - ignore misformated mess aboveMichael S2014/01/11 01:24 PM
                                                                    KNC versionMichael S2014/01/11 04:19 PM
                                                                      KNC versionEric Bron nli2014/01/12 01:59 AM
                                                                        KNC versionGabriele Svelto2014/01/12 08:06 AM
                                                  evil questionEric Bron2014/01/08 01:41 PM
              Knights Landing L/S bandwidthPatrick Chase2014/01/05 10:20 PM
                Knights Landing L/S bandwidthEric Bron2014/01/06 01:45 AM
                  Knights Landing L/S bandwidthanon2014/01/06 03:12 AM
                    Knights Landing L/S bandwidthMichael S2014/01/06 03:17 AM
                      Knights Landing L/S bandwidthanon2014/01/06 04:20 AM
          Knights Landing L/S bandwidthNicolas Capens2014/01/04 04:34 PM
            Knights Landing L/S bandwidthEric Bron2014/01/04 04:44 PM
              Knights Landing L/S bandwidthNicolas Capens2014/01/05 11:25 AM
                Knights Landing L/S bandwidthEric Bron2014/01/05 12:50 PM
                  Knights Landing L/S bandwidthNicolas Capens2014/01/05 02:34 PM
                    Might even help with gatherNicolas Capens2014/01/05 02:40 PM
                      What is an L0 cache?David Kanter2014/01/05 09:44 PM
                        What is an L0 cache?anon2014/01/06 04:57 AM
                          What is an L0 cache?Nicolas Capens2014/01/06 11:57 AM
                            What is an L0 cache?anon2014/01/06 01:18 PM
    Knights Landing L/S bandwidthDavid Kanter2014/01/04 09:58 AM
      Knights Landing L/S bandwidthNicolas Capens2014/01/04 03:24 PM
        Knights Landing L/S bandwidthEric Bron2014/01/04 03:46 PM
          Knights Landing L/S bandwidthKonrad Schwarz2014/01/07 11:48 PM
            Knights Landing L/S bandwidthMichael S2014/01/08 01:45 AM
        Knights Landing L/S bandwidthDavid Kanter2014/01/05 12:44 AM
          Knights Landing L/S bandwidthEric Bron2014/01/05 02:55 AM
          Knights Landing L/S bandwidthNicolas Capens2014/01/05 11:18 AM
            Knights Landing L/S bandwidthMaynard Handley2014/01/05 10:33 PM
              Knights Landing L/S bandwidthEric Bron2014/01/06 03:02 AM
                Knights Landing L/S bandwidthMichael S2014/01/06 03:23 AM
                  Knights Landing L/S bandwidthEric Bron2014/01/06 03:35 AM
                    Knights Landing L/S bandwidthMichael S2014/01/06 04:20 AM
                      Knights Landing L/S bandwidthMichael S2014/01/06 04:32 AM
                      Knights Landing L/S bandwidthEric Bron2014/01/06 04:36 AM
                        Knights Landing L/S bandwidthMichael S2014/01/06 05:00 AM
                          Knights Landing L/S bandwidthEric Bron2014/01/06 05:07 AM
                          Knights Landing L/S bandwidthEric Bron2014/01/06 05:14 AM
                            editsEric Bron2014/01/06 05:22 AM
                              optimized versionEric Bron2014/01/06 05:35 AM
                                yet more optimized versionEric Bron2014/01/06 05:42 AM
                                  latest version for todayEric Bron2014/01/06 05:51 AM
                                    Probably just L2 bandwith limitedNicolas Capens2014/01/06 10:48 AM
                                  yet more optimized versionMaynard Handley2014/01/06 05:54 PM
                                optimized versionMaynard Handley2014/01/06 05:52 PM
                                  optimized versionMichael S2014/01/07 09:42 AM
                                    optimized versionNicolas Capens2014/01/07 11:36 AM
                                      optimized versionMichael S2014/01/07 02:41 PM
                                        optimized versionNicolas Capens2014/01/07 09:52 PM
                                          optimized versionMichael S2014/01/08 01:10 AM
                                    optimized versionEric Bron2014/01/07 01:34 PM
                                      optimized versionMichael S2014/01/07 02:18 PM
                                        optimized versionEric Bron2014/01/07 02:30 PM
                                          optimized versionEric Bron2014/01/07 02:33 PM
                                            optimized versionMichael S2014/01/07 02:57 PM
                                    optimized versionMaynard Handley2014/01/07 05:50 PM
                                      optimized versionMichael S2014/01/08 01:39 AM
                Knights Landing L/S bandwidthMaynard Handley2014/01/06 05:47 PM
              Knights Landing L/S bandwidthNicolas Capens2014/01/06 08:18 AM
                Knights Landing L/S bandwidthMaynard Handley2014/01/06 05:56 PM
                  Knights Landing L/S bandwidthNicolas Capens2014/01/07 11:18 AM
        Knights Landing L/S bandwidthNoSpammer2014/01/05 12:15 PM
          Knights Landing L/S bandwidthNicolas Capens2014/01/05 02:06 PM
            Knights Landing L/S bandwidthNoSpammer2014/01/06 03:20 AM
              Knights Landing L/S bandwidthNicolas Capens2014/01/06 10:54 AM
                Knights Landing L/S bandwidthNoSpammer2014/01/06 12:24 PM
                  Knights Landing L/S bandwidthNicolas Capens2014/01/06 08:15 PM
                    Knights Landing L/S bandwidthNoSpammer2014/01/07 02:58 AM
                      Knights Landing L/S bandwidthNicolas Capens2014/01/07 02:18 PM
                        Knights Landing L/S bandwidthNoSpammer2014/01/08 12:38 PM
                          Knights Landing L/S bandwidthNicolas Capens2014/01/08 10:14 PM
  AVX512F questionMichael S2014/01/06 09:18 AM
    AVX512F questionNicolas Capens2014/01/06 11:01 AM
Reply to this Topic
Name:
Email:
Topic:
Body: No Text
How do you spell green?