better test desired. This one is not pushing FPU to the limits

By: Per Hesselgren (perhesselgren.delete@this.yahoo.se), March 19, 2017 8:49 am
Room: Moderated Discussions
Michael S (already5chosen.delete@this.yahoo.com) on March 19, 2017 4:28 am wrote:
> Per Hesselgren (perhesselgren.delete@this.yahoo.se) on March 19, 2017 3:06 am wrote:
> > Michael S (already5chosen.delete@this.yahoo.com) on March 18, 2017 10:21 am wrote:
> > > Per Hesselgren (perhesselgren.delete@this.yahoo.se) on March 17, 2017 8:49 am wrote:
> > > >
> > > > Now I have got a Ryzen 1700 myself so I have some results.
> > > > This is the single thread matrix multiply:
> > > >
> > > > Algorithm Ivy Bridge Excavator Ryzen
> > > > ----n 8,09 8,25 5,22
> > > > ----v 7,91 7,59 5,58
> > > > ----u 7,79 4,56 2,5
> > > > ----p 8,06 7,74 5,27
> > > > ----t 3,08 6,35 4,94
> > > > ----i 1,58 2,5 1,31
> > > > ----b 4,19 6,26 3,87
> > > > ----m 1,39 3,08 1,15
> > > > ----w 2,22 3,66 1,97
> > > > ----r 3,09 6,2 4,94
> > > >
> > > > The times in secs are not so interesting as the clocks are all different.
> > > > But if we use the -n algorithm time as 100% index we get:
> > > > Algorithm Ivy Bridge Excavator Ryzen
> > > > -----n 100 100 100
> > > > -----v 98 92 107
> > > > -----u 96 55 48
> > > > -----p 100 94 101
> > > > -----t 38 77 95
> > > > -----i 20 30 25
> > > > -----b 52 76 74
> > > > -----m 17 37 22
> > > > -----w 27 44 38
> > > > -----r 38 75 95
> > >
> > > Can you report it in FLOPs/core and FLOPs/(core*Hz) ?
> > >
> > > Results for Algorithm m will be sufficient, the rest of them are obviously doing something wrong.
> > >
> > As we have N³ multiplies and N³-N² adds 1 sec means around 2 GFLOPS.
>
> So, the time reported is for a single multiplication of 1000x1000 matrices?
>
> > -m Ivy Bridge=1.44 Excavator=0.65 Ryzen=1,74
> > This is all single thread with clocks 3.3, 3.5 and 3.7 GHz
> > -m Ivy Bridge=0.44 Excavator=0.19 Ryzen=0.47 GFLOPS/GHz
>
> Thank you.
>
> >
> > This is an old compiler so I made some tests with 32-bit GCC for Ryzen:
> > -i was the best with 0.78 secs
>
> Still only 0.69 FLOPs/(core*Hz)
>
> > For Raspberry Pi (1.2 GHz)
> > -r was the best with 5.03 secs followed by
> > -t at 5.04 secs
> >
>
> All x86 scores are EXTREMELY low. The test appears to not push IvyB/Excavator/Ryzen FPUs at all. The
> bottleneck is somewhere else. Most likely, compiler does not utilize SIMD at all. But even without
> SIMD and without FMA at FPU level all this cores shell be capable of ~1.5-1.8 FLOPs/(core*Hz)
>
> As to Raspberry Pi, I don't know if it is pushing FPU or not.
> Is your Raspberry Pi a BCM2837 running in 64-bit mode? ARM Cortex A53?
> I didn't find FPU throughput numbers in A53 TRM. I am sure that
> other RWT posters (Wilco? none? Exophase?) can tell us.
>
>

Yes this is not like a flops test. The effect of -msse2 in Linux is very small.
It is more like a test of cache performance if DN=1000. You can test DN=100 for example... Here you have a better flops test for L1 cache:
http://home.claranet.nl/users/mhx/flops.c
You also have all the alternative timers.
< Previous Post in ThreadNext Post in Thread >
TopicPosted ByDate
Is Ryzen 6-wide?Travis2017/03/03 06:27 PM
  Is Ryzen 6-wide?Maynard Handley2017/03/03 06:50 PM
  AMD describes it as 6 wideVertexMaster2017/03/03 11:27 PM
    AMD describes it as 6 wideExophase2017/03/04 12:21 PM
      AMD describes it as 6 wideanon2017/03/04 12:54 PM
        AMD describes it as 6 wideTravis2017/03/04 01:12 PM
          AMD describes it as 6 wideTravis2017/03/04 02:05 PM
            AMD describes it as 6 wideExophase2017/03/04 05:16 PM
              AMD describes it as 6 wideMichael S2017/03/05 10:27 AM
              AMD describes it as 6 wideTravis2017/03/05 12:45 PM
                AMD Ryzen inst lat and tputPer Hesselgren2017/03/07 12:00 AM
                  AMD Ryzen inst lat and tputTravis2017/03/07 01:50 PM
                    AMD Ryzen inst lat and tputGabriele Svelto2017/03/08 04:37 AM
                      AMD Ryzen inst lat and tputTravis2017/03/08 12:03 PM
                  AMD Ryzen inst lat and tputTravis2017/03/07 02:01 PM
                    Sorry, failed to properly close code tag :( (NT)Travis2017/03/07 02:02 PM
                    AMD Ryzen inst lat and tputPer Hesselgren2017/03/07 02:48 PM
                      AMD Ryzen inst lat and tputTravis2017/03/07 03:16 PM
                      AMD Ryzen inst lat and tputPer Hesselgren2017/03/07 03:19 PM
                        AMD Ryzen inst lat and tputPer Hesselgren2017/03/18 07:50 AM
                          AMD Ryzen inst lat and tputPer Hesselgren2017/03/18 08:05 AM
                            AMD Ryzen inst lat and tput-7zipPer Hesselgren2017/03/25 03:48 AM
                  FMA tput looks like a mistakeMichael S2017/03/12 03:24 AM
                    FMA tput looks like a mistakeGian-Carlo Pascutto2017/03/14 01:27 AM
                      FMA tput looks like a mistakeMichael S2017/03/14 03:39 AM
                        FMA tput looks like a mistakeGian-Carlo Pascutto2017/03/14 07:36 AM
                          FMA tput looks like a mistakeMichael S2017/03/14 08:45 AM
                      FMA tput looks like a mistakeEmil Briggs2017/03/14 04:24 PM
                        FMA tput looks like a mistakeGian-Carlo Pascutto2017/03/15 01:32 AM
        AMD describes it as 6 wideBrett2017/03/04 01:39 PM
      AMD describes it as 6 wideTravis2017/03/04 01:06 PM
        FP & Int pipelines, AVX-256VertexMaster2017/03/04 02:09 PM
          FP & Int pipelines, AVX-256Travis2017/03/04 03:26 PM
          No Ryzen BKDG yetg c2017/03/19 07:43 AM
            No Ryzen BKDG yetGroo2017/03/20 02:02 PM
              No Ryzen BKDG & revision guide yetBrendan2017/03/22 12:37 AM
                No Ryzen BKDG & revision guide yetmuziqaz2017/03/22 03:26 AM
                No Ryzen BKDG & revision guide yetGroo2017/03/22 06:54 AM
                  No Ryzen BKDG & revision guide yetAdrian2017/03/22 07:54 AM
                  No Ryzen BKDG & revision guide yetblue2017/03/22 07:58 AM
                    FMA3 bug possibly connected to power managementhobold2017/03/22 09:14 AM
                      FMA3 bug possibly connected to power managementMatthias Waldhauer2017/03/28 07:20 AM
                        FMA3 bug only in WindowsPer Hesselgren2017/04/05 06:29 AM
                    No Ryzen BKDG & revision guide yetGroo2017/03/23 06:50 AM
                  No Ryzen BKDG & revision guide yetBrendan2017/03/23 11:31 PM
                    No Ryzen BKDG & revision guide yetMatthias Waldhauer2017/03/24 06:08 PM
        AMD describes it as 6 wideDavid Kanter2017/03/04 04:19 PM
          AMD describes it as 6 wide (how to test this?)Domaldel2017/03/22 09:58 AM
            AMD describes it as 6 wide (how to test this?)Travis2017/03/22 03:17 PM
              AMD describes it as 6 wide (how to test this?)Domaldel2017/03/23 12:49 AM
        AMD describes it as 6 wideanon2017/03/04 05:16 PM
          AMD describes it as 6 wideTravis2017/03/05 12:55 PM
            AMD describes it as 6 wideDavid Kanter2017/03/05 06:31 PM
              AMD describes it as 6 wideTravis2017/03/05 07:23 PM
                AMD describes it as 6 wideDavid Kanter2017/03/05 10:00 PM
                  AMD describes it as 6 wideTravis2017/03/06 11:15 AM
                    AMD describes it as 6 wideRobert David Graham2017/03/06 11:53 AM
                      AMD describes it as 6 wideTravis2017/03/06 04:09 PM
                    AMD describes it as 6 wideTIm McCaffrey2017/03/06 02:56 PM
                  AMD describes it as 6 wideDomaldel2017/03/08 04:36 PM
                1800x system availablemuziqaz2017/03/13 09:22 AM
                  1800x system availablePer Hesselgren2017/03/16 03:14 AM
                    1800x system availablemuziqaz2017/03/17 03:53 AM
                      1800x system availableAdrian2017/03/17 07:25 AM
                        1800x system availableAdrian2017/03/17 08:00 AM
                          1800x system availablemuziqaz2017/03/17 11:48 AM
                          1800x system availablemuziqaz2017/03/17 01:42 PM
                            1800x system availableAdrian2017/03/17 02:42 PM
                              1800x system availablemuziqaz2017/03/17 03:07 PM
                                1800x system availableAdrian2017/03/17 03:18 PM
                            1800x system availableAdrian2017/03/17 03:07 PM
                            1800x system availableDomaldel2017/03/18 06:40 AM
                              1800x system availablemuziqaz2017/03/18 06:54 AM
                                1800x system availablemuziqaz2017/03/18 02:28 PM
                                  1800x system availablePer Hesselgren2017/03/21 12:30 AM
                                    1800x system availablePer Hesselgren2017/03/21 02:58 AM
                                    1800x system availableGian-Carlo Pascutto2017/03/21 12:25 PM
                                      1800x system availableMichael_S2017/03/21 02:24 PM
                                        1800x system availableGian-Carlo Pascutto2017/03/21 02:56 PM
                                          1800x system availableMichael S2017/03/21 03:36 PM
                                            done (NT)Michael S2017/03/22 03:59 AM
                                      1800x system availableanonymou52017/03/21 03:27 PM
                                        1800x system availableWilco2017/03/22 06:17 PM
                                          1800x system availableDomaldel2017/03/23 12:52 AM
                                          1800x system availableanonymou52017/03/23 12:24 PM
                                            1800x system availableKlimax2017/04/03 01:35 AM
                                    1800x system availablemuziqaz2017/03/22 03:25 AM
                                      1800x system availablemuziqaz2017/03/24 03:24 AM
                              1800x system availableAdrian2017/03/18 08:36 AM
                    1800x system availablePer Hesselgren2017/03/17 08:49 AM
                      1800x system available-16TPer Hesselgren2017/03/17 01:10 PM
                      1800x system availableMichael S2017/03/18 10:21 AM
                        1800x system availablePer Hesselgren2017/03/19 03:06 AM
                          better test desired. This one is not pushing FPU to the limitsMichael S2017/03/19 04:28 AM
                            better test desired. This one is not pushing FPU to the limitsPer Hesselgren2017/03/19 08:49 AM
                              better test desired. This one is not pushing FPU to the limitsPer Hesselgren2017/03/19 10:47 AM
                                better test desired. This one is not pushing FPU to the limitsMichael S2017/03/19 11:45 AM
                                  better test desired. This one is not pushing FPU to the limitsMichael S2017/03/19 11:47 AM
        AMD describes it as 6 widePoindexter2017/03/05 07:55 PM
          AMD describes it as 6 wideTravis2017/03/05 08:05 PM
            AMD describes it as 6 wideDavid Kanter2017/03/05 10:03 PM
              AMD describes it as 6 wideTravis2017/03/06 11:19 AM
                AMD describes it as 6 widePoindexter2017/03/07 08:54 AM
                  AMD describes it as 6 wideTravis2017/03/07 01:54 PM
                  AMD describes it as 6 wideanon2017/03/08 04:19 AM
                    AMD describes it as 6 wideTravis2017/03/08 12:35 PM
                      AMD describes it as 6 wideanon2017/03/08 02:48 PM
                        AMD describes it as 6 wideTravis2017/03/09 12:02 PM
                          AMD describes it as 6 wideanon2017/03/09 05:44 PM
                            AMD describes it as 6 wideanon2017/03/12 08:28 AM
                            AMD describes it as 6 wideTravis2017/06/20 04:48 PM
                              AMD describes it as 6 wideAdrian2017/06/21 07:48 AM
Reply to this Topic
Name:
Email:
Topic:
Body: No Text
How do you spell green?