Are these sporadic failures linked to the kernel's position on physical devices?
Are these sporadic failures linked to the kernel's position on physical devices?
I've constructed my Windows 11 PC setup some time ago, and only recently started facing unpredictable crash issues.
When the crashes occur, both monitors turn black and the fans spin at maximum speed. If I don’t intervene, this situation can last for several hours. I have to manually initiate a shutdown using the I/O button before a proper restart occurs. The system continues its background operations; I can still hear the audio from the YouTube video I was watching prior to the crash, but the displays stop responding entirely.
The most recent crash happened today. At that time, the CPU and GPU weren’t operating at full capacity (I was just browsing YouTube), and temperatures likely wouldn’t have reached such high levels since the PC had been running for about 10-15 minutes. With many fans and a large AIO cooler for the CPU, the situation seems manageable.
Reviewing the Windows Event Viewer revealed critical or warning events just before the crash:
- Error (Event ID 6008): Unexpected system shutdown at 10:53:11 PM on 9/9/2024.
- Critical (Event ID 41): Kernel Power – The reboot didn’t occur cleanly. This might result from unresponsiveness, a crash, or power loss.
- Warning (Event ID 219): Kernel-PnP – Driver WUDFRd failed to load for the device path specified.
These logs suggest possible issues with system stability or driver problems.
I also encounter this error occasionally, but it doesn’t seem directly responsible for this crash. It appears sporadically over several hours.
Could you provide more details about the event? For instance, was there any fan control software malfunction or a GPU/PSU cable issue?
Here are my system details:
- Motherboard: Gigabyte B550 Vision D (Firmware 17, AGESA updates installed)
- CPU: AMD 5950X
- RAM: G-Skill Trident Z NEO 3600MHz (4x16GB)
- GPU: MSI 4090 Suprim X (latest NVidia driver update)
- PSU: Seasonic Prime TX-1000
- Case: Corsair 5000T (13 x SP120 fans)
- OS: Latest Windows 11
- Fan Control: Corsair iCue (latest update)
Any additional information would help clarify what might be causing these interruptions.
you receive a secure boot alert since tpm is turned on, disabling tpm in bios will resolve it (or activate secure boot as per your preference)
this issue isn't related to pnp errors, which typically cause a black screen and are linked to windows hello for biometric authentication
a black screen is usually due to the gpu, so verify that power cables are properly connected
by the way, a reboot via the case button doesn't seem to work?
AFAIK, I have to enable TPM to register Windows 11 or is there actually no reason to leave TPM enabled?
I'm using a straight 12VHPWR from cablemod that is compatible with my TX 1000 PSU and, so far, it has served me rather well. I've unplugged that power cord and saw no melted spots or anything similar, so I can only assume that the power deliver should be intact. I know that checking it with a Multimeter would probably be the better method to ascertain full functionality...
I noticed something strange when I was running some pytorch machine learning scripts. The training stopped after about 30 minutes or so with the notifications that there was a CUDA error, the screens didn't black out this time though. Could this just be a damaged GPU that is starting to act erratically? Or maybe it's just overburdened by the heat it had to put up with during this rather hot end of summer season?
Is there any way for me to check if all the CUDA cores are doing their jobs correctly? Can CUDA cores cause black screens as well?
That CUDA core error appeared before I unplugged and plugged in the power cord back again into the GPU. Maybe this simple procedure helped, the weather also got quite a bit colder so maybe that's helping as well.
I do have access to both a functional reset and shutdown button on my case, but I usually just press the shutdown button, leave the PC be for a minute or two and start it up from a clean slate.