How To Diagnose Random Crashes?
How To Diagnose Random Crashes?
I operate a PC with these specifications: a mid-range processor, 8GB RAM, and a NAS setup. The hardware includes an Emulex NIC and an LSI IBM controller with firmware updated. My main concerns are reliability issues—system crashes after roughly 16 hours, random shutdowns, and inability to access the web interface or SSH. The CPU lacks integrated graphics, so no console output appears even with a GPU. I've tried several fixes: swapping RAM, removing the NIC, reinstalling the OS, using TrueNAS instead of UnRAID, changing storage media, and mirroring logs. Despite these attempts, logs remain limited and don't reveal the crash cause. The system becomes unresponsive from web access and doesn’t shut down smoothly; a hard reset requires pressing the power button for five seconds. I want to confirm this isn’t related to UnRAID, as I’ve faced similar problems with TrueNAS in identical configurations. Right now, I’m testing without the HBA plugged in to see if that triggers the issues. Any advice or alternative troubleshooting steps would be greatly appreciated.
Testing was performed at default settings. Clear CMOS and initial checks were completed. If using XMP, set the frequency to 2667MHz (maximum supported) and verify. Gradually raise the DRAM speed and test again (for OC). Active cooling of memory can also be beneficial.
The process finished successfully but took roughly the same duration. I also received updated information after using a GPU, which showed the screen fully disappears in this failed state. Anyone have suggestions or possible causes? It seems like a specific part might be responsible. I’m considering replacing my main board.