Is x86 architecture hampering performance?

Page 3 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

CTho9305

Elite Member
Jul 26, 2000
9,214
1
81
Originally posted by: Sahakiel
Originally posted by: imgod2u
Unfortunately, whether the cache blocks or not, the processor will still run into problems when the data needed is not there. This is the major bottleneck in modern systems with processor idle time being in the 200+ cycles. That's way more of a hit than any branch mispredict.
I think you missed the significance of a nonblocking cache, but you do bring up another good point. Although stalls on memory accesses are pretty rare (ignoring compulsory misses), their associated penalties are large enough to be a problem.
The thing is, even with a non-blocking cache, if you have, say, a 50-entry re-order buffer, any delays longer than 50 cycles are guarateend to stall you (assuming none of those intructions take multiplies or otherwise slow you down), because you can't track more instructions than the missing memory access and the next 49 instructions/uops. You also have to consider that it's very likely that within a few instructions, you'll hit something that depends on the result of the memory access, and you have to delay that instruction, and therefore any later ones that depend on it.
 

Sahakiel

Golden Member
Oct 19, 2001
1,746
0
86
Originally posted by: CTho9305
The thing is, even with a non-blocking cache, if you have, say, a 50-entry re-order buffer, any delays longer than 50 cycles are guarateend to stall you (assuming none of those intructions take multiplies or otherwise slow you down), because you can't track more instructions than the missing memory access and the next 49 instructions/uops. You also have to consider that it's very likely that within a few instructions, you'll hit something that depends on the result of the memory access, and you have to delay that instruction, and therefore any later ones that depend on it.

Yes, I know there are no magic pills, only a large number of anesthetics. Like I said, once the program is up and running, fetches to main memory are rare, but the penalties are high enough to be a problem. Non-blocking caches simply help alleviate the problem to a certain degree. With a OOOE design, the processor can continue with other calculations until it is forced to stall. Once the data is finally retrieved, the processor can then play catch up and pick up some of the slack, represented by pipeline bubbles. Depending on the processor design (based on speed and costs), you can pick up anywhere from 0-100+% of the slack (probably closer to 20% IRL). That's the flexibility of OOOE.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |