Feeling stuck, need help identifying the issue. Let's troubleshoot together!
Feeling stuck, need help identifying the issue. Let's troubleshoot together!
So on Friday our work PC was operating smoothly. Recent updates caused problems. After logging in quickly, I received a free monitoring tool and noticed the CPU temperature jumped to 85°C before it restarted. We replaced the CPU fan—mistakenly leaving the sticker on—and it cooled down. Still, the issue persisted. I suspected an NVIDIA driver problem because programs displayed today’s date, but after fixing the overheating, it didn’t work. I attempted a restore to an earlier system, but power issues remained. I switched the power supply, which improved stability for about 15 minutes with a new cooler and no driver changes. Then it crashed again. Now I’m unsure whether the problem lies in RAM, SSD, or the RTX3060 itself. Reading the event logs is tricky for me, and I’m not familiar with them. What should I do to diagnose this? It seems to happen unexpectedly. Ideally, I’d just move to a new PC until I can investigate further. Unfortunately, this machine runs a server-configured software—likely a networking setup for a dental design program. If the dongle and database are accessible over the network from the log screen, the team could use them. But the PC is unusable, and the connected scanner is down. It’s not a dedicated server; it’s an Asus ROG Strix GT15. Any advice?
This might or might not work. I experienced random restarts and shutdowns, happening occasionally or multiple times daily. I changed the PSU, graphics card, moved components on the PCIe slots, tried various RAM options. After swapping the PSU, the issue seemed resolved but returned after weeks. Based on my observations, the problem likely lies with the power switch or the header cable—removing it from the MOBO didn’t fix it, so checking its functionality while unplugged might help.
High temps don’t force a restart; the system would just slow down. 85°C isn’t that extreme. I’d carefully reinstall or re-seat all components. Use TestMem5 or Memtest86 to check your RAM. If you have a CPU other than an F-core, you might run it without the GPU.
85c isn't too bad, you're correct, but overheating the CPU might force a restart, which is why I replaced the cooler. I'll run a memory test to check if it can keep running, and I'll try it in safe mode first to see what happens.
Disconnected the power button header, it remained steady for 8 minutes before restarting. Probably memtest comes next.
Remove all non-essential items like fans, controllers except the CPU fan or pump, any USB devices other than the boot disk, and all front panel connectors. Then verify if the issue persists. I've noticed disk USBs, fan controllers, RAM, motherboard, power supply, etc., can trigger erratic reboots. Even CPUs tend to throttle above 90°C rather than crashing outright. If you have a Samsung boot drive, install Samsung Magician and check for firmware updates—original firmware is often unstable.