better test desired. This one is not pushing FPU to the limits

By: Michael S (already5chosen.delete@this.yahoo.com), March 19, 2017 4:28 am
Room: Moderated Discussions
Per Hesselgren (perhesselgren.delete@this.yahoo.se) on March 19, 2017 3:06 am wrote:
> Michael S (already5chosen.delete@this.yahoo.com) on March 18, 2017 10:21 am wrote:
> > Per Hesselgren (perhesselgren.delete@this.yahoo.se) on March 17, 2017 8:49 am wrote:
> > >
> > > Now I have got a Ryzen 1700 myself so I have some results.
> > > This is the single thread matrix multiply:
> > >
> > > Algorithm Ivy Bridge Excavator Ryzen
> > > ----n 8,09 8,25 5,22
> > > ----v 7,91 7,59 5,58
> > > ----u 7,79 4,56 2,5
> > > ----p 8,06 7,74 5,27
> > > ----t 3,08 6,35 4,94
> > > ----i 1,58 2,5 1,31
> > > ----b 4,19 6,26 3,87
> > > ----m 1,39 3,08 1,15
> > > ----w 2,22 3,66 1,97
> > > ----r 3,09 6,2 4,94
> > >
> > > The times in secs are not so interesting as the clocks are all different.
> > > But if we use the -n algorithm time as 100% index we get:
> > > Algorithm Ivy Bridge Excavator Ryzen
> > > -----n 100 100 100
> > > -----v 98 92 107
> > > -----u 96 55 48
> > > -----p 100 94 101
> > > -----t 38 77 95
> > > -----i 20 30 25
> > > -----b 52 76 74
> > > -----m 17 37 22
> > > -----w 27 44 38
> > > -----r 38 75 95
> >
> > Can you report it in FLOPs/core and FLOPs/(core*Hz) ?
> >
> > Results for Algorithm m will be sufficient, the rest of them are obviously doing something wrong.
> >
> As we have N³ multiplies and N³-N² adds 1 sec means around 2 GFLOPS.

So, the time reported is for a single multiplication of 1000x1000 matrices?

> -m Ivy Bridge=1.44 Excavator=0.65 Ryzen=1,74
> This is all single thread with clocks 3.3, 3.5 and 3.7 GHz
> -m Ivy Bridge=0.44 Excavator=0.19 Ryzen=0.47 GFLOPS/GHz

Thank you.

>
> This is an old compiler so I made some tests with 32-bit GCC for Ryzen:
> -i was the best with 0.78 secs

Still only 0.69 FLOPs/(core*Hz)

> For Raspberry Pi (1.2 GHz)
> -r was the best with 5.03 secs followed by
> -t at 5.04 secs
>

All x86 scores are EXTREMELY low. The test appears to not push IvyB/Excavator/Ryzen FPUs at all. The bottleneck is somewhere else. Most likely, compiler does not utilize SIMD at all. But even without SIMD and without FMA at FPU level all this cores shell be capable of ~1.5-1.8 FLOPs/(core*Hz)

As to Raspberry Pi, I don't know if it is pushing FPU or not.
Is your Raspberry Pi a BCM2837 running in 64-bit mode? ARM Cortex A53?
I didn't find FPU throughput numbers in A53 TRM. I am sure that other RWT posters (Wilco? none? Exophase?) can tell us.

< Previous Post in ThreadNext Post in Thread >
TopicPosted ByDate
Is Ryzen 6-wide?Travis03/03/17 06:27 PM
  Is Ryzen 6-wide?Maynard Handley03/03/17 06:50 PM
  AMD describes it as 6 wideVertexMaster03/03/17 11:27 PM
    AMD describes it as 6 wideExophase03/04/17 12:21 PM
      AMD describes it as 6 wideanon03/04/17 12:54 PM
        AMD describes it as 6 wideTravis03/04/17 01:12 PM
          AMD describes it as 6 wideTravis03/04/17 02:05 PM
            AMD describes it as 6 wideExophase03/04/17 05:16 PM
              AMD describes it as 6 wideMichael S03/05/17 10:27 AM
              AMD describes it as 6 wideTravis03/05/17 12:45 PM
                AMD Ryzen inst lat and tputPer Hesselgren03/07/17 12:00 AM
                  AMD Ryzen inst lat and tputTravis03/07/17 01:50 PM
                    AMD Ryzen inst lat and tputGabriele Svelto03/08/17 04:37 AM
                      AMD Ryzen inst lat and tputTravis03/08/17 12:03 PM
                  AMD Ryzen inst lat and tputTravis03/07/17 02:01 PM
                    Sorry, failed to properly close code tag :( (NT)Travis03/07/17 02:02 PM
                    AMD Ryzen inst lat and tputPer Hesselgren03/07/17 02:48 PM
                      AMD Ryzen inst lat and tputTravis03/07/17 03:16 PM
                      AMD Ryzen inst lat and tputPer Hesselgren03/07/17 03:19 PM
                        AMD Ryzen inst lat and tputPer Hesselgren03/18/17 07:50 AM
                          AMD Ryzen inst lat and tputPer Hesselgren03/18/17 08:05 AM
                            AMD Ryzen inst lat and tput-7zipPer Hesselgren03/25/17 03:48 AM
                  FMA tput looks like a mistakeMichael S03/12/17 03:24 AM
                    FMA tput looks like a mistakeGian-Carlo Pascutto03/14/17 01:27 AM
                      FMA tput looks like a mistakeMichael S03/14/17 03:39 AM
                        FMA tput looks like a mistakeGian-Carlo Pascutto03/14/17 07:36 AM
                          FMA tput looks like a mistakeMichael S03/14/17 08:45 AM
                      FMA tput looks like a mistakeEmil Briggs03/14/17 04:24 PM
                        FMA tput looks like a mistakeGian-Carlo Pascutto03/15/17 01:32 AM
        AMD describes it as 6 wideBrett03/04/17 01:39 PM
      AMD describes it as 6 wideTravis03/04/17 01:06 PM
        FP & Int pipelines, AVX-256VertexMaster03/04/17 02:09 PM
          FP & Int pipelines, AVX-256Travis03/04/17 03:26 PM
          No Ryzen BKDG yetg c03/19/17 07:43 AM
            No Ryzen BKDG yetGroo03/20/17 02:02 PM
              No Ryzen BKDG & revision guide yetBrendan03/22/17 12:37 AM
                No Ryzen BKDG & revision guide yetmuziqaz03/22/17 03:26 AM
                No Ryzen BKDG & revision guide yetGroo03/22/17 06:54 AM
                  No Ryzen BKDG & revision guide yetAdrian03/22/17 07:54 AM
                  No Ryzen BKDG & revision guide yetblue03/22/17 07:58 AM
                    FMA3 bug possibly connected to power managementhobold03/22/17 09:14 AM
                      FMA3 bug possibly connected to power managementMatthias Waldhauer03/28/17 07:20 AM
                        FMA3 bug only in WindowsPer Hesselgren04/05/17 06:29 AM
                    No Ryzen BKDG & revision guide yetGroo03/23/17 06:50 AM
                  No Ryzen BKDG & revision guide yetBrendan03/23/17 11:31 PM
                    No Ryzen BKDG & revision guide yetMatthias Waldhauer03/24/17 06:08 PM
        AMD describes it as 6 wideDavid Kanter03/04/17 04:19 PM
          AMD describes it as 6 wide (how to test this?)Domaldel03/22/17 09:58 AM
            AMD describes it as 6 wide (how to test this?)Travis03/22/17 03:17 PM
              AMD describes it as 6 wide (how to test this?)Domaldel03/23/17 12:49 AM
        AMD describes it as 6 wideanon03/04/17 05:16 PM
          AMD describes it as 6 wideTravis03/05/17 12:55 PM
            AMD describes it as 6 wideDavid Kanter03/05/17 06:31 PM
              AMD describes it as 6 wideTravis03/05/17 07:23 PM
                AMD describes it as 6 wideDavid Kanter03/05/17 10:00 PM
                  AMD describes it as 6 wideTravis03/06/17 11:15 AM
                    AMD describes it as 6 wideRobert David Graham03/06/17 11:53 AM
                      AMD describes it as 6 wideTravis03/06/17 04:09 PM
                    AMD describes it as 6 wideTIm McCaffrey03/06/17 02:56 PM
                  AMD describes it as 6 wideDomaldel03/08/17 04:36 PM
                1800x system availablemuziqaz03/13/17 09:22 AM
                  1800x system availablePer Hesselgren03/16/17 03:14 AM
                    1800x system availablemuziqaz03/17/17 03:53 AM
                      1800x system availableAdrian03/17/17 07:25 AM
                        1800x system availableAdrian03/17/17 08:00 AM
                          1800x system availablemuziqaz03/17/17 11:48 AM
                          1800x system availablemuziqaz03/17/17 01:42 PM
                            1800x system availableAdrian03/17/17 02:42 PM
                              1800x system availablemuziqaz03/17/17 03:07 PM
                                1800x system availableAdrian03/17/17 03:18 PM
                            1800x system availableAdrian03/17/17 03:07 PM
                            1800x system availableDomaldel03/18/17 06:40 AM
                              1800x system availablemuziqaz03/18/17 06:54 AM
                                1800x system availablemuziqaz03/18/17 02:28 PM
                                  1800x system availablePer Hesselgren03/21/17 12:30 AM
                                    1800x system availablePer Hesselgren03/21/17 02:58 AM
                                    1800x system availableGian-Carlo Pascutto03/21/17 12:25 PM
                                      1800x system availableMichael_S03/21/17 02:24 PM
                                        1800x system availableGian-Carlo Pascutto03/21/17 02:56 PM
                                          1800x system availableMichael S03/21/17 03:36 PM
                                            done (NT)Michael S03/22/17 03:59 AM
                                      1800x system availableanonymou503/21/17 03:27 PM
                                        1800x system availableWilco03/22/17 06:17 PM
                                          1800x system availableDomaldel03/23/17 12:52 AM
                                          1800x system availableanonymou503/23/17 12:24 PM
                                            1800x system availableKlimax04/03/17 01:35 AM
                                    1800x system availablemuziqaz03/22/17 03:25 AM
                                      1800x system availablemuziqaz03/24/17 03:24 AM
                              1800x system availableAdrian03/18/17 08:36 AM
                    1800x system availablePer Hesselgren03/17/17 08:49 AM
                      1800x system available-16TPer Hesselgren03/17/17 01:10 PM
                      1800x system availableMichael S03/18/17 10:21 AM
                        1800x system availablePer Hesselgren03/19/17 03:06 AM
                          better test desired. This one is not pushing FPU to the limitsMichael S03/19/17 04:28 AM
                            better test desired. This one is not pushing FPU to the limitsPer Hesselgren03/19/17 08:49 AM
                              better test desired. This one is not pushing FPU to the limitsPer Hesselgren03/19/17 10:47 AM
                                better test desired. This one is not pushing FPU to the limitsMichael S03/19/17 11:45 AM
                                  better test desired. This one is not pushing FPU to the limitsMichael S03/19/17 11:47 AM
        AMD describes it as 6 widePoindexter03/05/17 07:55 PM
          AMD describes it as 6 wideTravis03/05/17 08:05 PM
            AMD describes it as 6 wideDavid Kanter03/05/17 10:03 PM
              AMD describes it as 6 wideTravis03/06/17 11:19 AM
                AMD describes it as 6 widePoindexter03/07/17 08:54 AM
                  AMD describes it as 6 wideTravis03/07/17 01:54 PM
                  AMD describes it as 6 wideanon03/08/17 04:19 AM
                    AMD describes it as 6 wideTravis03/08/17 12:35 PM
                      AMD describes it as 6 wideanon03/08/17 02:48 PM
                        AMD describes it as 6 wideTravis03/09/17 12:02 PM
                          AMD describes it as 6 wideanon03/09/17 05:44 PM
                            AMD describes it as 6 wideanon03/12/17 08:28 AM
                            AMD describes it as 6 wideTravis06/20/17 04:48 PM
                              AMD describes it as 6 wideAdrian06/21/17 07:48 AM
Reply to this Topic
Name:
Email:
Topic:
Body: No Text
How do you spell green?