Intel processors crashing Unreal engine games (and others)

Page 20 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

coercitiv

Diamond Member
Jan 24, 2014
6,257
12,196
136
That one I admit I didn't know about, but . . . it looks like all it does is affect CPU duty cycle, which isn't the AMD approach.
The main mechanic is to lower frequency & voltage in order to meet a max temperature target.

All they seemed to have done is upped turbo limits a bit and given the CPU a very short period of time in which it can hit those boost clocks provided thermal limits haven't already been hit.
TVB is not just about higher clocks, it enables lower voltage for a given clock target based on temperature. It also lowers max clocks based on temperature by limiting max turbo ratios for a given temperature threshold.

Quotes from the links I already provided you with:
Reaching the maximum operating temperature activates the Thermal Control Circuit (TCC). When activated the TCC causes both the processor IA core and graphics core to reduce frequency and voltage adaptively. The Adaptive Thermal Monitor will remain active as long as the package temperature remains at its specified limit. Therefore, the Adaptive Thermal Monitor will continue to reduce the package frequency and voltage until the TCC is de-activated.
The Adaptive Thermal Monitor does not require any additional hardware, software drivers, or interrupt handling routines.
Thermal Velocity Boost does two things.
  1. First, it decreases the operating voltage if the CPU temperature is below the TjMax.
  2. Two, it opportunistically increases the clock frequency above the Turbo Boost 2.0 and 3.0 frequencies if the CPU operates below a part-specific temperature threshold. This threshold is usually 70 degrees Celsius on desktop CPUs.
in 2020, Intel opened up the TVB configuration to motherboard vendors. The feature is named OverClocking Thermal Velocity Boost, or OCTVB for short.

The easiest way to think of OCTVB is limiting, or clipping, the maximum allowed CPU ratio based on the CPU operating temperature. The hotter the CPU, the more you clip the CPU ratio. OCTVB is based on the by core usage Turbo Ratio configuration. For each number of active cores, you can define two temperature points, each with a unique number of “down-bins’. A down-bin is essentially the number of ratios you want to drop.
 

Markfw

Moderator Emeritus, Elite Member
May 16, 2002
25,639
14,630
136
The main mechanic is to lower frequency & voltage in order to meet a max temperature target.


TVB is not just about higher clocks, it enables lower voltage for a given clock target based on temperature. It also lowers max clocks based on temperature by limiting max turbo ratios for a given temperature threshold.

Quotes from the links I already provided you with:
My only input to this, is that Intel has lied about so many things that I don't trust anything they say. That includes all the quotes in the links below this.
 
Reactions: spursindonesia

lakedude

Platinum Member
Mar 14, 2009
2,580
329
126
My only input to this, is that Intel has lied about so many things that I don't trust anything they say.
That is it! Trust has been broken and not just with Intel.

Companies with good reputations don't seem to understand how important their reputation is anymore.

This absolutely played a role in choosing AMD for our latest build. The Intel chip beat our choice in most benchmarks but the power draw was so high. It wouldn't even be so bad if they said up front that their chips drew up to xxx watts and that they were designed to handle it. But instead there is all this obfuscation.

Ran into a little of this enabling XMP with our AMD but it has been rock solid.
 

Joe NYC

Platinum Member
Jun 26, 2021
2,072
2,585
106
Lol poor Steve. Immediately after he posts this video news breaks that Intel is enforcing that all mobo makers must roll out a BIOS that enforces "Default" default settings of PL1=125W and PL2=188W for LGA1700 boards (equivalent to the Gigabyte numbers he shows in the video). So all his benches are technically unlocked from out of the box settings expected after May 31.
I thought about that too.

Between the two, Gigabyte board and Asus board, Steve guessed wrong, tested on Asus, at 253W, rather than Gigabyte 188W.
 

coercitiv

Diamond Member
Jan 24, 2014
6,257
12,196
136
Some context based on the table above: if CEP is enabled by default, even if they allow mobo makers to adjust AC Load Line on their own, they are no longer over rewarded for undervolting the CPU. In fact, the opposite is now true, aggressive undervolting with CEP enabled will lead to lower performance. Add the enforced power and current limits, and most new CPUs should do just fine.
 

DrMrLordX

Lifer
Apr 27, 2000
21,709
10,983
136
The main mechanic is to lower frequency & voltage in order to meet a max temperature target.


TVB is not just about higher clocks, it enables lower voltage for a given clock target based on temperature. It also lowers max clocks based on temperature by limiting max turbo ratios for a given temperature threshold.

Quotes from the links I already provided you with:

Yeah I read all that, and it's not the same as what AMD is doing.

If you take a Zen2 or Zen3 (and presumably a Zen4) running a ST workload, it's going to hit a particular maximum boost clock (which is only approximated by advertised ST boost clocks) without hitting max local temp (95C) and will not boost beyond that so long as temperature remains fixed (that is, fixed at a temp below 95C). If you lower temp by around 10C (by, say, turning up the AC or dunking a rad in icewater), it should buy you ~50mhz in extra clocks without you having to change any settings. The voltage will barely budge because it doesn't have to. The boost algo can adapt upwards or downwards, depending on the current temperature it has reached by maintaining a particular clockspeed and voltage. If things start to heat up, it'll back off clocks (and possibly voltage but mostly just clocks) well before it hits 95C. Note I'm using ST here for a reason, in that it's very easy to hit the 95C absolute temp limit on a Zen4 in particular in such a way that the boost behavior can be obscured. Even when you saturate the cooling solution and reach max temp, overall you're going to get a better v/f curve with a cooling solution that can handle more heat flux.

if you have a recent Intel k CPU running an ST workload, it'll either a). run straight to the TVB clock and stay there or b). hit the ST boost clock, depending on your board and cooling. In case a), if you lower temps by 10C, you get no improvement, because there is a TVB limit that it won't go beyond under any circumstance. The boost algo can't adapt upwards AND downwards, it can only adjust downwards thanks to the TCC, which is really just another way to throttle the CPU. In case b), assuming there was some kind of thermal limit preventing the CPU from holding TVB clocks, lowering temps by 10C or more will just convert your case from b) to a), that is, it'll rise to TVB clocks and then stop.

To bring things back around to the main topic, I don't think that Intel's TCC or anything similar necessarily contributes to the crashes. I do think it's indicative of their overall approach to power limits and boost behavior that result in a chip that isn't flexible enough to remain stable when so many of the power and current limits have been removed by default on Z790 motherboards. I do think that people who manage to keep their 13900k and 14900k/s CPUs cooler will probably get less crashing, while those who rely on the CPU's ability to throttle while using an HSF or other weaker cooling solution may be more prone to crashing. That's kind of a "duh" but when you compare it to the competition, the difference starts to stand out.
 
Jul 27, 2020
16,820
10,767
106
In fact, the opposite is now true, aggressive undervolting with CEP enabled will lead to lower performance. Add the enforced power and current limits, and most new CPUs should do just fine.
So someone wanting to undervolt to reduce temps and slightly increase ST boost needs to override the limits and disable CEP?
 
Jul 27, 2020
16,820
10,767
106
Is ASROCK/MSI (DDR5/DDR4 mobos I have) gonna help me stay in compliance with the above table or do I need to arrange a consultation session with an overclocking expert or freelance mobo repair/power delivery technician?

Whatever happened to plug and play???

Seriously need to consider dumping my 12700K and Z690D4/Z790D5 mobo on some unsuspecting Intel groupie...
 

krumme

Diamond Member
Oct 9, 2009
5,955
1,595
136
The reason we don't see reviewers hammering Intel non stop for this disaster is they are part of it. They let Intel had their cake and eat it; Market lower tdp CPU while they were consuming absolutely crazy amounts of power and giving nice benchmarks. Now it's all starting to crash in slow motion and imo reviewers need to see more critical of their own role in this dirty game.
 

Ranulf

Platinum Member
Jul 18, 2001
2,388
1,270
136
The reason we don't see reviewers hammering Intel non stop for this disaster is they are part of it. They let Intel had their cake and eat it; Market lower tdp CPU while they were consuming absolutely crazy amounts of power and giving nice benchmarks. Now it's all starting to crash in slow motion and imo reviewers need to see more critical of their own role in this dirty game.

Yeah, good luck with that.
 

DAPUNISHER

Super Moderator CPU Forum Mod and Elite Member
Super Moderator
Aug 22, 2001
28,663
21,170
146
In case anyone missed the conclusion of Schilling's article at HardwareLUXX -

So what does this mean for the customer?​

Basically everything stays the same. The specifications for the K models remain as they were before and as we used them in the tests. This also applies to the KS models with PL1 = PL2.

Basically, the question of the instability of the processors remains unanswered. A baseline profile wasn't the solution. As long as you stick to the "Intel Default Settings" in your motherboard settings, there shouldn't be any problems, according to Intel. Whether the mainboard manufacturers have implemented all of these in case of doubt is of course another matter.
r/whatcouldpossiblygowrong
 

MrTeal

Diamond Member
Dec 7, 2003
3,578
1,725
136
Even if they had come up with a very straightforward solution, it would be many months (or never) before motherboards with pre-update BIOSes clear the channel. For all the existing and near-future users, it's still going to rely on them doing a BIOS update to fix their instability, which is not something most users love doing.
 

DAPUNISHER

Super Moderator CPU Forum Mod and Elite Member
Super Moderator
Aug 22, 2001
28,663
21,170
146
Hold on to your trousers, they may have actually guessed right!
Gigabyte already pulled the bios. HardwareLUXX said it was using AC/DC LL of 1.7 mOhm instead of the recommended 1.1 mOhm.

This is a press release so I am not convinced this is Intel's final answer. Probably just sweating bullets Steve or another reviewer will use that 188W PL2 in their next video. 💀 Making it damage control with reviewers until they sort things out entirely. If Intel sticks to this as the official statement, or this IS the official statement? Then LOLZ!

EDIT: It's starting to feel a lot like -

 
Mar 8, 2024
37
110
66
The reason we don't see reviewers hammering Intel non stop for this disaster is they are part of it. They let Intel had their cake and eat it; Market lower tdp CPU while they were consuming absolutely crazy amounts of power and giving nice benchmarks. Now it's all starting to crash in slow motion and imo reviewers need to see more critical of their own role in this dirty game.

The upper echelon of tech-tubers have been harping on this for years, at least. Gamer's Nexus and HUB have been particularly excoriating with the latest developments, too.
 
Reactions: spursindonesia

Panino Manino

Senior member
Jan 28, 2017
830
1,036
136
Gigabyte already pulled the bios. HardwareLUXX said it was using AC/DC LL of 1.7 mOhm instead of the recommended 1.1 mOhm.

This is a press release so I am not convinced this is Intel's final answer. Probably just sweating bullets Steve or another reviewer will use that 188W PL2 in their next video. 💀 Making it damage control with reviewers until they sort things out entirely. If Intel sticks to this as the official statement, or this IS the official statement? Then LOLZ!

EDIT: It's starting to feel a lot like -



Like I commented before, I'm impressed by the lack of (more) reaction from communities.
Intel is getting confident that they can get away with no consequences.
 

DAPUNISHER

Super Moderator CPU Forum Mod and Elite Member
Super Moderator
Aug 22, 2001
28,663
21,170
146
Like I commented before, I'm impressed by the lack of (more) reaction from communities.
Intel is getting confident that they can get away with no consequences.
How much more do we need? 😵‍💫 Every PC reviewer, website, and forum and their social media accounts all have content about it. GN has not jumped on it feet first like Bruce Lee, only because they have been investigating the EK disaster. I am certain with this still being a circus they will have their own input soon. It is too big a SNAFU not to.
 

Thunder 57

Platinum Member
Aug 19, 2007
2,722
3,911
136
How much more do we need? 😵‍💫 Every PC reviewer, website, and forum and their social media accounts all have content about it. GN has not jumped on it feet first like Bruce Lee, only because they have been investigating the EK disaster. I am certain with this still being a circus they will have their own input soon. It is too big a SNAFU not to.

What unfortunate timing for GN. They spent all that time on EK and are ready to cover that and then BAM! big Intel story that they are missing out on.
 

Mopetar

Diamond Member
Jan 31, 2011
7,936
6,239
136
I would imagine he wants to do his own testing for any full video that he releases about this. Otherwise it'll probably get mentioned in a news roundup video.
 

Markfw

Moderator Emeritus, Elite Member
May 16, 2002
25,639
14,630
136
Just saw this:
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |