Question About Unusual Crash/Reboot Occurring Only When System Is Left Unattended
Question About Unusual Crash/Reboot Occurring Only When System Is Left Unattended
Reliability Monitor details are available here: https://imgur.com/a/31P6gBu
Image reference: https://imgur.com/a/31P6gBu
Observation notes from the bottom of the picture show fans pulling into a case and exhaust exiting from the top and right side. The image orientation should have been flipped 90 degrees counter-clockwise. The front of my case appears at the bottom. Please accept this as straightforward feedback.
I’ve kept the system off during research sessions. This was the first instance I experienced a complete freeze while typing. Regarding the reliability monitor, what I notice is that Windows wasn’t properly shut down in most cases (the reliability history is saved as an XML file). The events I see in the event viewer are mainly from the OP and occasionally include this link: https://imgur.com/a/l6wa4kG
View: https://imgur.com/a/l6wa4kG
It seems today’s issue was likely due to a Windows Update.
Fans: my general thought being that the fans slow/stop when idle and something gets hot or perhaps a cable moves slightly (from a change in air flows) and looses connectivity or causes a short.
= = = =
In Reliability History take another look at the dates showing all three type of errors.
Problems started on 2/16 and stopped on 2/26 for 4 days (inclusive). Then reoccured today 3/1.
Any thing common to those days/dates: backups, updates, certain apps being run, etc.?
Check Task Scheduler for anything that may be triggering other processes at those times.
Are the errors or error patterns identical?
Make note of the error codes.
Any known installs or apps that may have been run on those dates?
Run the built in Windows Troubleshooters - just as a "do over".
Then run "dism" and "sdc /scannow" again.
= = = =
Many of the errors are in Misc. Failures - correct?
If there is an overall pattern of increasing numbers of varying errors then the PSU may be suspect.
I've reset the system completely and installed only Chrome and HWINFO64. I performed a full BIOS reset, wiped the OS drive, and completed the installation. The machine feels less stable now compared to before. Right now I'm using HWINFO logging, but I'm unsure if it will capture any issues. I'm planning to hand in the device and go to a repair shop because it's essential for my school work—especially since I'm just 1.5 terms away from graduation. Thanks for the details!
The system was turned off during periods with no activity. The timing doesn't show any regular pattern. In the Admin events, I see errors related to WHEA-Logger APIC IDs 0 or 10, accompanied by strange PCI errors that lack details. There are occasional warnings about something being dropped during transport. I haven't installed anything in more than a week. All start-up applications except OneDrive were disabled. Even after a fresh reset, only the Nvidia control panel and OneDrive appear in the system tray. I ran sfc scannow already—it detected corruption on a brand new installation. After that, I performed DISM and sfc again, and everything is clean. I also noticed this in HWINFO while typing: https://imgur.com/a/ARgreNN (View: https://imgur.com/a/ARgreNN). I'm rebooting now after the DISM process and using logging from HWINFO64 as well.
It’s an iBuyPower system I developed that has worked well until recently. I haven’t changed the thermal paste. I just turned off the monitors to monitor it and HWINFO is recording properly. Interestingly, it hasn’t crashed in more than an hour. I can still upload files, but nothing stands out at the moment.
Finally produced reliability rating LOL
View: https://imgur.com/a/4apnDcO
Here is what HWINFO has captured:
https://1drv.ms/f/s!AgTMZS8A00Rj4RoUq38a...s?e=smzErf
I have reviewed your original complaint again and realized the situation occurs when you are not using the computer. Has anything changed? What settings do you have for Windows updates? I adjusted my settings so that updates don’t install automatically, allowing me to be present and decide which update to install if needed.
Monitor: some displays operate with their own drivers. What are the specific models of these monitors?
The issue isn't always about the system being "active"; a power interruption can still lead to complications.
The repeated messages about the monitor stopping and Windows not shutting down properly point toward a possible loose connection or a failing power supply unit.
Check the details and examine the error codes; they might not always be useful.
When the machine is idle, background tasks such as backups or updates might still run.
Review the Task Scheduler for any processes that could be active during inactivity.
You can also use Process Explorer (free from Microsoft) to inspect all running processes.
https://learn.microsoft.com/en-us/sysint...s-explorer
I prefer the @Fix_that_Glitch suggestion—stopping updates could help simplify operations and improve control.
It might be better to consult others about certification-related errors, etc.
= = = =
Make sure all critical data is safely backed up at least twice, in separate locations away from the system itself.
Also confirm that the data can be recovered and read easily.
Unfortunately this situation has shifted. It is currently not automatically fixing itself and now behaves like a freeze—everything remains frozen as before, with no interaction and dark screens. I had to manually force a shutdown and restart. While checking the reliability history, I saw an AMD driver failed to install, so I used AMD’s chipset installer and updated the remaining components. I left it running overnight with the monitors turned off.
I paused the Windows update process and ran the update troubleshooter, which detected the issue and reported a fix. A new HWINFO log file was created and is now available.