Article: Medfield, Intel's x86 Phone Chip
By: Wilco (Wilco.Dijkstra.delete@this.ntlworld.com), January 26, 2012 7:25 am
Room: Moderated Discussions
observer (no.thanks@for.now) on 1/26/12 wrote:
---------------------------
>A predicted branch would be predicted either taken or not taken, so fetching would
>start working on either of those paths, right? High performance "deeply" pipelined
>instruction fetch costs power in my experience... not much difference in power for
>correct / incorrect predictions, but correct predictions reduce execution time ->
>lowering the energy consumption for a given task. Could you elaborate why this would
>be very different for the core you had in mind? Disregarding execution unit power that is.
We were discussing max instantaneous power for a CPU, not energy consumption for a given task. It doesn't make sense to talk about maximizing energy use, you'd just turn off the caches... You actually want to minimize the total energy by lowering frequency and voltage to the optimal perf/Watt point (for a low leakage CPU that would typically be its lowest frequency if there is no time limit). That means average power consumption will be extremely low.
To get max power use, you have to maximise the number of transistor switches per cycle, especially the expensive transistors in critical paths such as cache lookup. Stalling a modern CPU doesn't use much power as there are no transistor switches in the majority of the core due to clock gating. Therefore you need to minimize interlocks, branch mispredicts, cachemisses etc. You need a branch every cycle as sequential fetch avoids TLB and cache lookups and just reads the next block from a preselected cacheline.
Wilco
---------------------------
>A predicted branch would be predicted either taken or not taken, so fetching would
>start working on either of those paths, right? High performance "deeply" pipelined
>instruction fetch costs power in my experience... not much difference in power for
>correct / incorrect predictions, but correct predictions reduce execution time ->
>lowering the energy consumption for a given task. Could you elaborate why this would
>be very different for the core you had in mind? Disregarding execution unit power that is.
We were discussing max instantaneous power for a CPU, not energy consumption for a given task. It doesn't make sense to talk about maximizing energy use, you'd just turn off the caches... You actually want to minimize the total energy by lowering frequency and voltage to the optimal perf/Watt point (for a low leakage CPU that would typically be its lowest frequency if there is no time limit). That means average power consumption will be extremely low.
To get max power use, you have to maximise the number of transistor switches per cycle, especially the expensive transistors in critical paths such as cache lookup. Stalling a modern CPU doesn't use much power as there are no transistor switches in the majority of the core due to clock gating. Therefore you need to minimize interlocks, branch mispredicts, cachemisses etc. You need a branch every cycle as sequential fetch avoids TLB and cache lookups and just reads the next block from a preselected cacheline.
Wilco
Topic | Posted By | Date |
---|---|---|
Medfield article online | David Kanter | 2012/01/23 02:51 PM |
server error | bakaneko | 2012/01/24 04:00 AM |
Fixed | David Kanter | 2012/01/24 05:02 AM |
Fixed | Joel | 2012/01/24 08:43 AM |
Fixed | Ricardo B | 2012/01/24 12:25 PM |
Fixed | David Kanter | 2012/01/24 06:29 PM |
Fixed | Gabriele Svelto | 2012/01/24 02:07 PM |
Fixed | David Kanter | 2012/01/24 06:30 PM |
Reference platform battery life | Doug Siebert | 2012/01/24 03:03 PM |
standby time | Foo_ | 2012/01/25 07:58 AM |
standby time | Anon | 2012/01/26 04:42 AM |
standby time | Foo_ | 2012/01/26 05:02 AM |
standby time | Doug Siebert | 2012/01/26 01:39 PM |
standby time | Anon | 2012/01/26 02:22 PM |
standby time | anon | 2012/01/26 03:08 PM |
standby time | Anon | 2012/01/26 07:03 PM |
standby time | anon | 2012/01/26 09:57 PM |
standby time | anon | 2012/01/26 10:01 PM |
standby time | Anon | 2012/01/27 10:32 PM |
standby time | Doug Siebert | 2012/01/27 03:15 PM |
standby time | anon | 2012/01/27 03:41 PM |
Reference platform battery life | David Kanter | 2012/01/27 11:09 AM |
Performance analysis laughable | Wilco | 2012/01/24 04:23 PM |
Performance analysis laughable | David Kanter | 2012/01/24 06:19 PM |
Performance analysis laughable | IntelUser2000 | 2012/01/24 08:30 PM |
Performance analysis laughable | IntelUser2000 | 2012/01/24 08:32 PM |
Performance analysis laughable | David Kanter | 2012/01/25 12:34 AM |
Performance analysis laughable | IntelUser2000 | 2012/01/25 12:56 AM |
Performance analysis laughable | David Kanter | 2012/01/25 03:07 AM |
Performance analysis laughable | Alberto | 2012/01/25 01:54 PM |
Atom HT gain | Wilco | 2012/01/25 06:43 AM |
Atom HT gain | IntelUser2000 | 2012/01/25 07:53 AM |
Atom HT gain | none | 2012/01/25 08:04 AM |
Atom HT gain | IntelUser2000 | 2012/01/25 08:35 AM |
Atom HT gain | Foo_ | 2012/01/25 08:06 AM |
Performance analysis laughable | Wilco | 2012/01/24 09:21 PM |
Performance analysis laughable | David Kanter | 2012/01/24 11:13 PM |
Performance analysis laughable | Wilco | 2012/01/25 05:30 AM |
Performance analysis laughable | none | 2012/01/25 07:14 AM |
Performance analysis laughable | Wilco | 2012/01/25 08:18 AM |
Performance analysis laughable | observer | 2012/01/26 05:17 AM |
Performance analysis laughable | Wilco | 2012/01/26 07:25 AM |
Process numbers | Alberto | 2012/01/26 10:29 AM |
Performance analysis laughable | David Kanter | 2012/02/02 01:38 AM |
Performance analysis laughable | tupper | 2012/01/25 05:27 PM |
Performance analysis laughable | Linus Torvalds | 2012/01/25 09:37 PM |
Performance analysis laughable | Doug Siebert | 2012/01/26 03:12 PM |
Medfield article online | Andreas | 2012/01/25 04:10 AM |
Medfield article online | Alberto | 2012/01/25 10:44 AM |
Medfield article online | IntelUser2000 | 2012/01/25 11:24 AM |
Medfield article online | David Kanter | 2012/01/25 10:58 PM |
Medfield article online | Doug Siebert | 2012/01/26 02:20 PM |
Medfield article online | Eric | 2012/01/26 07:10 PM |
Medfield article online | Doug Siebert | 2012/01/27 03:40 PM |
64-bit | Ingeneer | 2012/01/25 10:28 AM |
64-bit | Foo_ | 2012/01/25 11:23 AM |
64-bit | Ingeneer | 2012/01/25 03:34 PM |
64-bit | Ungo | 2012/01/25 05:08 PM |
64-bit | EduardoS | 2012/01/26 01:55 PM |
Saltwell memcpy | SHK | 2012/01/26 03:41 AM |
Medfield WiFi & Bluetooth | Rob Thorpe | 2012/01/26 04:09 AM |
Medfield WiFi & Bluetooth | David Kanter | 2012/01/27 06:54 PM |
Medfield WiFi & Bluetooth | Rob Thorpe | 2012/01/28 03:22 PM |
Medfield article online (NT) | Anil | 2012/01/26 06:57 PM |
Medfield article online | Anil | 2012/01/26 07:11 PM |
Medfield article online | Mr. Camel | 2012/01/26 07:26 PM |
Medfield article online | none | 2012/01/27 02:41 AM |