ru24.pro
Game24.pro
Август
2024

Intel CPU crashes: what you need to know—microcode to blame but fix incoming this month, alongside two-year extended warranty

0
  • Some Intel 13th and 14th Gen CPUs (especially high-end ones) have been crashing, or even failing
  • The issue is caused by "elevated operating voltage"
  • It affects 13th and 14th Gen CPUs of 65 W Processor Base Power and higher
  • A mid-August microcode patch is planned to fix the issue for CPUs that have not yet been affected
  • If your CPU is affected by elevated operating voltage and you're experiencing instability, contact Intel customer support

Many PC gamers are understandably concerned about stability issues facing Intel 13th and 14th Gen CPUs, codename Raptor Lake, and especially the Core i9 13900K and Core i9 14900K. Over the past few months there have been increasing reports of stability issues and game crashes, and even some reports of chip failure.

Although issues were noted in 2023, in 2024 things only got more confusing as Intel, motherboard partners, game devs, tech journalists, and even Nvidia speculated about underlying causes or possible solutions. Intel's now officially investigating the issue, which it says is because of a microcode fault affecting voltages, and is planning a fix for August.

Status update: What's the latest

August 2, 2024:  We're waiting to hear more from Intel about the "elevated operating voltage" that it says is causing instability issues in 13th and 14th Gen CPUs that have a Processor Base Power of 65W or higher. That includes all K-series desktop processors.

We're also waiting for a microcode patch that Intel says should fix the issue for CPUs not already affected. Intel's targeting mid-August for rolling this patch out.

For users already affected, Intel recommends contacting its customer support. The company also just announced "two-year extended warranty support for our boxed Intel Core 13th and 14th Gen desktop processors."

What's the issue, exactly?

(Image credit: Future)

From mid-2023 onwards, we started to hear increasing reports from game devs and gamers that high-end Intel 13th and 14th Gen CPUs (primarily the Core i9 13900K and Core i9 14900K) were crashing in Unreal Engine games. By mid-July 2024 it became obvious that these Raptor Lake CPUs have stability issues. Both CPUs in servers and end-user systems have been noted to have a high prevalence of crashes and even chip failure.

These crashes seem to occur during shader compression/decompression/compilation in games. The crash errors often reference a problem with video memory, which might hint at a GPU issue but, in fact, are caused by the CPU. The errors might also reference data corruption. The stability issues also seem to manifest as general crashes, hang-ups, or unpredictable behaviour when the CPU is under heavy loads.

Over the past few months, the solutions offered up have been only temporary band-aids, such as underclocking or undervolting the CPU, which game devs and Nvidia recommended. In what now seems like another band-aid, back in April, Intel recommended motherboard manufacturers implement BIOS files that adhere to Intel's recommended power profiles. Manufacturers started to do so, but unfortunately the updated BIOS files didn't follow Intel's latest recommendations.

Now, Intel has admitted that there's actually a problem with the CPU microcode causing "elevated operating voltage" in affected processors and is planning on rolling out a mid-August microcode patch to fix the issue. While Intel has said it won't recall chips already affected, the company encourages anyone facing issues to contact customer support

When did we know about the issue?

(Image credit: Future)

The first hints that something might be wrong with Intel 13th and 14th Gen CPUs came during mid- to late-2023 when some game developers identified heightened crashing risks for them in games. 

In its August 2023 Remnant 2 update notes (via TechSpot), Gearbox stated: "We have identified an issue on some Intel 13th generation CPUs where upon startup the game will display a message about being out of video memory or the crash reporter will pop up referencing an issue with decompressing a shader. If you experience this problem, you will likely also see it in other DX12 games." 

Gearbox recommended underclocking affected CPUs using Intel's Extreme Tuning Utility (XTU).

At the end of 2023, Vermintide 2 developer Fatshark said, "It has been noted that players with the Intel i9 13900k/14900k and Intel i7 13700k/14700k CPUs are prone to these crashes. Players have been able to work around this by underclocking the 'Performance Core' speed using Intel XTU, from x55 to x53."

Similar reports continued into 2024. In February, RAD, a division of Epic Games, pointed the finger at Intel for Unreal Engine game crashes, stating, "We believe that this is a hardware problem which affects primarily Intel 13900K and 14900K processors, less likely 13700, 14700 and other related processors as well."

(Image credit: Future)

More and more Unreal Engine games seemed to be having problems with the current- and previous-gen Intel processors, particularly the high-end Core i9 13900K and Core i9 14900K, primarily during shader compilation. Around this time, all kinds of potential causes and solutions were bandied about, including undervolting the CPU. 

In seeming agreement with Fatshark and others, Nvidia reportedly recommended (via Tom's Hardware) affected users underclock their CPUs (and remember that clock frequency scales with voltage, so an underclock is equivalent to an indirect undervolt).

Intel eventually started investigating the issue, telling ZDNet Korea that it "is aware of problems that occur when executing certain tasks on 13th and 14th generation core processors for desktop PCs, and is analyzing them with major affiliates." It was also leaked that Intel recommended partnered "system and motherboard manufacturers to provide end users with a default BIOS profile that matches Intel recommended settings."

Top CPU coolers

(Image credit: Future)

Best AIO cooler for CPUs: Keep your chip chill.
Best air cooler for CPUs: Classic, quiet cooling.

Motherboard manufacturers started to implement "recommended" Intel Baseline Profiles to BIOS files. Asus added a Baseline profile, but it didn't do quite what Intel recommended. Other motherboard vendors followed suit, releasing BIOS updates that, according to Intel, didn't fully adhere to Intel's recommendations. 

Intel said (via Hardwareluxx), "Several motherboard manufacturers have released BIOS profiles labeled 'Intel Baseline Profile'. However, these BIOS profiles are not the same as the 'Intel Default Settings' recommendations that Intel has recently shared with its partners regarding the instability issues reported on 13th and 14th gen K SKU processors." 

The company then presented a table to make clear its actual recommendations and pressured motherboard manufacturers to implement proper default settings.

Then, in June, Intel found and patched a CPU microcode bug that wasn't the root cause of the problems but could have "potentially contributed" to 13th and 14th Gen instability. But given that the bug in question was to do with a thermal boost feature on the i9 chips, this only raised further questions about deeper thermal and voltage issues.

The stability issues were then brought back into the full light of day when, in mid-July, game developer Alderon Games said it was switching all its servers to AMD, claiming it's "only a matter of time before affected CPUs fail."

Further intrigue was sparked when YouTuber Level1Tech investigated Intel stability issues by looking through data some game devs gave him, finding that 90% of errors occurred for Intel 13th and 14th Gen systems. Level1Tech also spoke to a datacentre service provider that confirmed "unusually high" support incidents for Intel 13th and 14th Gen configurations.

The fact that servers seemed to be facing issues with 13th and 14th Gen chips made people think it might not be caused by high-end CPUs being pushed to the limits, because server CPUs are often lower power for stability.

Nevertheless, there was still reasonable speculation that the issues could be because of elevated or incorrect voltages. For instance, on 14 July, Buildzoid pointed out that the stability issues seem to be worse on higher voltages, which is why the Core i5 14600K and lower have fewer crash reports, because they mostly operate below 1.4V.

There's also been speculation over the issue being degradation caused by oxidation, but Intel has denied that oxidation is related to the current stability issues. It has, however, confirmed that Oxidation issues were "identified in late 2022" and that the company was "able to confirm full removal of impacted processors in our supply chain by early 2024," but that "on-shelf inventory may have persisted into early 2024 as a result." Whatever the case, Intel claims this is unrelated to the stability issues in question.

Intel then confirmed that the stability issues are, as others had speculated, to do with incorrect voltages, telling us that "elevated operating voltage is causing instability issues in some 13th/14th Gen desktop" and this "is stemming from a microcode algorithm resulting in incorrect voltage requests to the processor." It also said it is "delivering a microcode patch which addresses the root cause of exposure to elevated voltages," and that it's targeting mid-August for the patch release.

(Image credit: Future)

Since then, it's mostly been a waiting game. But Intel has told The Verge that it won't recall chips already affected, stating that "customers experiencing instability on their 13th or 14th Generation desktop processor-based systems should contact Intel customer support for further assistance." 

The company also confirmed that 13th and 14th Gen processors with a 65W Processor Base Power or higher could be affected (though of course not all will be). This means not just the i9s, but potentially all the way down to the Core i5 13400 and Core i5 14400, too.

That's where we're at right now, at least officially: We're waiting for Intel to push out its mid-August patch. Unofficially, though, the speculation and investigation continues. For instance, Buildzoid has been looking at Intel CPU voltages through oscilloscope, and, according to one forum summary, shows the problem looks to be caused by non-standard transient voltage spikes (in addition to the more normal ones). These spikes won't be picked up by normal voltage monitoring programs because they occur too quickly. This, so the theory goes, might be causing the processors to degrade prematurely.

That's unofficial, though, so for now we await further word from Intel. It might already be too late for public opinion and reputation, however, as a law firm has already started investigating whether to launch a class-action lawsuit against Intel over the instability issues. 

If you have an affected CPU, you can contact Intel customer support here. Intel's just announced "two-year extended warranty support for our boxed Intel Core 13th and 14th Gen desktop processors," too.

Here's what I think

(Image credit: Future)

I can't reliably recommend purchasing an Intel 13th or 14th Gen CPU right now, at least not without some major warnings in place. 

This is because Intel has been less than forthcoming about the prevalence of affected CPUs, the kinds of problems they're likely to face, and how 13th and 14th Gen users can tell whether their CPU has already been affected. We also don't know which CPU batches were affected by the separate issue of oxidation.

Intel's taken too long, in my opinion, to investigate and resolve these problems. These issues were noted as far back as mid-2023, and they were noted with some prevalence early in 2024. The back-and-forth between Intel and motherboard partners earlier in the year only seemed to obfuscate the issue for ordinary PC gamers.

If the upcoming microcode patch fixes the problem and we can be sure all newly purchased 13th and 14th Gen CPUs will be fine, I can happily recommend them again. But until we know that's the case, I advise caution. The fixes others have suggested (such as underclocking) are band-aids to a deeper underlying problem and, at any rate, decrease performance, so the official microcode patch is what we're waiting for.