Issue with random and persistent system crashes or freezes during initial configuration
Issue with random and persistent system crashes or freezes during initial configuration
Hi all,
This is a new build from June that worked perfectly throughout the month until Tuesday of last week. Since then I I have had daily crashes under low/no load and attempted multiple fixes and forums. Hoping you all can shed some light.
Issues encountered:
-
Screen freeze followed by blacking out and powering off.
- Screen freeze and required forced shutdown
- Screen goes to sleep (while all power off and sleep settings are "never") and then comes back on
- When rebooting from a crash, wont get past the MB screen.
- System freezing while browsing under very low load / Freezing when opening start menu / Freezing and crash when just opening excel.
- When installing AMD bluetooth driver update I lost ethernet connection.
- While installing windows 11 from USB, system refusing to restart and screen flickered green.
- Extreme lag in windows log followed by desktop (usually ended up in system crashing)
System Specs:
Case:
Lian Li A4 H20
Board
: B650i Aorus Ultra
BIOS
: BIOS Version/Date American Megatrends International, LLC. F30, 22/05/2024
VGA
: MSI Ventus 3x OC 4070TI Super 16gb
Riser Cable
PCIE Gen 4 - included in case
PSU
: Corsair SF750
CPU
: AMD Ryzen 7800x3d
MEM
: G.SKILL Flare X5 Series (AMD EXPO) DDR5 RAM 32GB (2x16GB) 6000MT/s CL30-38-38-96 1.35V (F5-6000J3038F16GX2-FX5)
HDD/SSD
: 1x Ssd Samsung 980 Pro 1tb M.2 Pci-e Gen4 Nvme - 7000mbs
COOLER
: Water Cooler Cooler Master Masterliquid 240 Atmos Argb 240mm
Keyboard
: At the moment a Logitech G pro TKL
Mouse
: Logitech Pro Wireless
OC
: None
OS
: Windows 11 64bit Home version 23H2
Display
: ASUS TUF 31.5", 144Hz, 2K QHD, 1ms, DisplayPort e HDMI, FreeSync, HDR
-
How display is connected to the GPU
?: DP
What I've tried and did not help:
-
Made sure I am on latest BIOS (there is 1 version recently from July I still need to test)
- 2x windows installs, followed by AMD chipset updates and clean nvidia drivers. Done from a newly created bootable USB install.
-
DDU and clean install of latest nvidia drivers
- DDU and install of 2 previous nvidia drivers
- Changed power outlet
- Reset GPU in case and repositioned cables. Checked connection in PSU and all power connections in MB. Checked PCIE connections to 12vhr split and repositioned slightly in case.
- Currently doing a memtest86 on each RAM. Card 1 is 90% done and has zero errors.
What I've tried and seems to work
- My previous system had a GTX 1080TI, when installing in my current build the system seems to work. I had 1 crash only a few days ago but it was not clear why. I have had this system running for a while and had no further issues..
- Installing 4070ti super in my older system - So far zero crashes at all. In this build the GPU sites on the MB.
Sharing a link with the most recent crashes on the system I described. Before the crashes I ran 3d mark tests that worked fine only for the system to freeze and crash randomnly later in the morning.
Link:
New System with 4070
I have had the 4070 in my old system all day today with zero errors. Similarly, had the 1080ti in my new system with no issues for most of the day, I asked a question on this in the microsoft forums and the
answer was this was a driver issue followed by a likely bad card
(but the card works fine in a different setup).
On MSI the folks pointed to a possible issue with the Riser cable, but the pc was working fine for 1 month before any issues at all and the 1080ti is on the same riser.
I am hoping you all can help me before I give up and have to take the pc to repair shop.
Thank you!
Welcome to the forums, newcomer!
BIOS
: BIOS Version/Date American Megatrends International, LLC. F30, 22/05/2024
There is one more BIOS update waiting.
Two windows installs, then AMD chipset updates and fresh nvidia drivers. Completed using a freshly made bootable USB install.
Have you installed the OS in offline mode? Did you manually add all necessary drivers from their support sites?
PSU
: Corsair SF750
Is this device brand new?
I posted a question on the microsoft forums and received a response indicating it was a driver problem, possibly linked to a faulty card (though it functions properly elsewhere).
You can eliminate the GPU as the main cause by testing it in a system with more power from the PSU.
On MSI, they mentioned a potential Riser cable issue, but the PC operated normally for a month before any problems appeared, and the 1080ti was connected to the same riser.
To test the riser cable, you can remove it and see if the problem continues.
Today I completed the memtest86 on both RAM modules and found no problems. After inserting them, the PC began to boot in an unstable way. Sometimes I accessed BIOS, other times I went straight to Windows; upon restart from Windows, the PC would fail to restart and the keyboard light would flash while the screen would flicker in and out until a forced shutdown was required.
Getting into the BIOS proved difficult, especially updating to the latest version which caused unexpected behavior. After restarting the process, the screen remained unstable, continuously cycling on and off. I attempted to switch from HDMI to DP but never returned to the BIOS update interface. Eventually, I managed to boot into Windows directly, and via msinfo32 I confirmed the updated version was present. I ran a 3DMark test and the scores were still below previous levels after RAM replacement or BIOS updates.
The power supply unit is brand new, and all components are original. I installed Windows 11 and required an internet connection. Once inside Windows, I allowed the update to complete, followed by installing the latest NVIDIA drivers. The AMD chipset was handled directly from the Gigabyte website for my MB version rev.1.
I suspect the GPU issue has been resolved since I’ve been using it continuously for two days without any problems. The startup inconsistencies and intermittent power cycles seem to be specific to this new machine.
The riser cable is present, but I observed signs of damage (uncertain). I plan to take the system in for repairs as I’ve decided against further troubleshooting.
I would highly suspect the riser cable. This is because the 1080 ti is a PCIe 3.0 card and the 4070 ti super is a PCIe 4.0 card. A lot of risers can have problems with PCIe 4.0 cards and its much higher data throughput. A bad PCIe 4.0 riser cable can play nice with lower bandwidth cards like a 1080 ti and then cause all sorts of issues like this with a full bandwidth card. This is a known issue with lower quality controlled riser cables.
It depends on the specific issue with the riser cable. If you observed any damage, it might be due to the cable being bent or compressed, which can harm the internal copper traces. This could lead to problems like signal loss or interference.
I saw an image earlier: https://photos.app.goo.gl/bToDS3DYTaXy3t4r7
It looks like some copper is exposed, but I'm not sure if it's actual damage or if it was present when I bought the case. After building this in early June, I hadn't opened the case until a few days after the first driver error from Nvidia. That's what's causing me frustration—I don't understand what caused this.