mrimo 2024 年 9 月 11 日 下午 2:26
Possibly faulty rtx 4090? Games constantly crashing
Hi All,

I have recently bought a MSI GeForce RTX 4090 GAMING X SLIM 24G and installed it into my PC.

OS: Windows 11 Version 23H2 Build 22631.4037

Motherboard: ASRock Z690 Phantom Gaming 4, Intel Z690, S 1700, DDR5, SATA3, PCIe 5.0,
CPU: INTEL CORE I9-12900K s1700
RAM: 128GB Corsair DDR5 DOMINATOR Platinum RGB Black, PC5-44800
Disk: 3x NVMe, OS / Data / Games
PSU: Corsair HX1500i Fully Modular Ultra-Low Noise ATX Power Supply - ATX 3.0 & PCIe 5.0 Compliant

I have installed multiple versions of the Geforce driver, currently installed: 561.09

Now, every time I play a game, I've tested several from, RDR2, GTA5, Sniper 5, Stranded Deep, Kingdom Come Deliverance

The game crashes after about 1-3 minutes of game play.

--
Example crash logs:
-- Example 01 ------------
Faulting application name: sniper5_dx12.exe, version: 2023.8.10.35004, time stamp: 0x64d544bd
Faulting module name: sniper5_dx12.exe, version: 2023.8.10.35004, time stamp: 0x64d544bd
Exception code: 0xc0000005
Fault offset: 0x0000000000188f64
Faulting process id: 0x0x3170
Faulting application start time: 0x0x1DB0488854CDC24
Faulting application path: C:\Program Files\WindowsApps\Rebellion.52602B995C00E_1.2.621.0_x64__2vbwqmt31j4mr\bin\sniper5_dx12.exe
Faulting module path: C:\Program Files\WindowsApps\Rebellion.52602B995C00E_1.2.621.0_x64__2vbwqmt31j4mr\bin\sniper5_dx12.exe
Report Id: 2d4fc8ba-48ba-47c1-84ec-d4e89e480a20
Faulting package full name: Rebellion.52602B995C00E_1.2.621.0_x64__2vbwqmt31j4mr
Faulting package-relative application ID: Game

-- Example 02 ------------

Faulting application name: KingdomCome.exe, version: 1.9.6.0, time stamp: 0x63aa8b6b
Faulting module name: nvgpucomp64.dll, version: 32.0.15.6094, time stamp: 0x66bc3617
Exception code: 0xc0000005
Fault offset: 0x0000000000b9069a
Faulting process id: 0x0x35EC
Faulting application start time: 0x0x1DB04860C99B19E
Faulting application path: E:\SteamLibrary\steamapps\common\KingdomComeDeliverance\Bin\Win64\KingdomCome.exe
Faulting module path: C:\Windows\System32\DriverStore\FileRepository\nvmdi.inf_amd64_33559cc6c2fd215a\nvgpucomp64.dll
Report Id: 1719e1fe-953b-46b7-9598-56bcaf3ec2cd
Faulting package full name:
Faulting package-relative application ID:

It seems when ever the GPU is used, my PC crashes - it even crashes when I run GPU benchmarking tools.

The troubleshooting steps taken:
Replaced my PSU from the Corsair 850w to the corsair 1500w
Tried older drivers - and the most latest
re-inserting the GPU into the PCi4 slot
Placing it in a PCIe5
Replacing the power cables from my PSU
replacing the adapter from the GPU to the PSU.
The adapter that inserts into the GPU splits into 3, and each one of those have a pcie powercable directly into the PSU - None of them are piggybacked.
I have re-installed Windows
Ive tried a sfc scan - no issues found


At this point, I thought I must have a faulty GPU, so I returned it back to Amazon, and bought another.

Only to get the same thing happening again. Surely I cant be unlucky enough to get two faulty cards?

Its also worth mentioning, the prior to purchasing the GPU, the PC was running Ubuntu / KVM hosting around 10-15 virtual machines, always under high load - and never crashed once.
Its also worth mentioning, if I remove the GPU and go back to the intel integrated GPU, again my PC never crashes.

Neither my CPU or GPU have been over or under clocked - these are base installs without any custom configs.

I'd really appreciate any help! - before I return a second GPU

Cheers!
< >
目前顯示第 16-21 則留言,共 21
Shaggin'Wagon 2024 年 9 月 16 日 下午 2:47 
引用自 Julien, cut it out.
You're wrong.
The PCIe lanes of the CPU are not shared on LGA1700 (Z690[i.pcmag.com]).

https://pg.asrock.com/mb/Intel/Z690%20Phantom%20Gaming%204/index.de.asp#Specification
They are, review the Z690 block diagram here: https://images.anandtech.com/doci/16970/Z690_Block_Diagram.png

The primary sockets connected to the CPU directly are either 1x16 or 2x8. If you populate the second x8 slot then it will drop the first 16x slot to 8x. In Z690 all components connected to the chipset all share bandwidth as well.
最後修改者:Shaggin'Wagon; 2024 年 9 月 16 日 下午 3:16
mrimo 2024 年 9 月 18 日 上午 11:28 
Hey, the cpu does have a iGPU but it was disabled in the bios

Now I rebuild the pc, using original components and the CPU and Ram warning lights came on.

I've since bought a msi mag z690 unify and rebuilt. And a new case - neo qube 2

Annoyingly the case can't be closed with the 4090 because the power cables pertrude past where the side of the case would be by about 10 mls, too much to bend too.

So I need to either return the case for a bigger one.

Or vertical mount the gpu which I've heard horror stories on.

I seriously regret the 4090 purchase and wish I kept the pc as a kvm host.

I don't get much time to troubleshooting due to small children but tonight's task is to look at either larger cases, or 'reliable' gpu risers and vertical mounts
Χάρης 2024 年 9 月 18 日 上午 11:53 
Had this issue with some of my games too until I updated my BIOS version.
AmaiAmai 2024 年 9 月 18 日 下午 1:59 
I highly doubt the issue had anything to do with your GPU.

It is just that playing DX12 games or ones that use compute tend to bring out issues sooner rather than later.

For future reference when you first get crashing like that, first remove all overclocks including factory ones. Remove the default overclock on the MB that many board manufacturers apply by default.

That is always step 0. It doesn't matter if the PC ran well a few days ago or whatever, because changes in Windows can make stability worsen on an overclock. Changes in BIOS firmware can do it too (that might be updated by Windows Update without your permission by the way).

Even changes in useless software like SSD monitoring software or something stupid like Corsair's stupid RGB lighting software (know to be a piece of garbage software), can cause crashes that will always appear as hardware issues.

It's funny because people pay a lot for those watercoolers with screens that show temperature or whatever nonsense and it makes their PC an unstable piece of plastic because the software's code is simply bad or terribly written. The hardware itself is sometimes incompatible with motherboards and this also can cause the boards to fail to boot, and lights to light up.

Again, if you have that kind of stuff, disconnect it and test. Delete that software and test.

Only you know what is on your PC and what is not. A CPU light or RAM one does not always mean a CPU or RAM issue, it can also mean that during the "check" for whatever reason, the motherboard software reported an issue (or the BIOS crashed). That can be linked to ANY piece of hardware, just more likely to be CPU or RAM.

Also, the other poster that mentioned the NVME that goes to the CPU had a point. Maybe try taking it out and seeing if the issue persists. Since that device is connected to the CPU,guess what? It has to be initialised at the SAME time when the CPU is checking itself. It will check those devices connected directly to it, check the voltage on said device and pins, etc. and then if its good it will move past it's init phase. If not, uh oh CPU light. If it cannot read the device type it will likely also throw an error and halt.

Now maybe that might be different on Intel, I don't know since I am only versed in ??? firmware. Still, I would check it anyway since it takes next to no time to verify. Certainly not as much time as having your PC down.
mrimo 2024 年 9 月 20 日 上午 9:57 
Well thank you for all of your comments
I have rebuilt my PC - (new motherboard) - and installed Windows on it today.

I will also look at the PCI lanes piece with me having three NVMes aswell as my GPU.

I dont really want to lose the three NVMes as I use them to host virtual servers - for work related tasks - and three NVMes adds more IOPs - maybe at the expense of GPU performance.

I will look into this.

I have also turned off the Intel Turbo Boost 3 - as I dont think my CPU officially supports it.
and I have turned off the Intel Turbo boost too - i think my CPU is now locked at a max 3.2GHz - - need to look into this further.

Also, I have turn down the settings for the fan and water cooling smart monitoring - so they kick in earlier then they previously did.

Hopefully tonight I can test some games and see how it performs.

Thanks again
76561199716878165 2024 年 9 月 20 日 下午 4:26 
as a pc tech can you try a different card in it and verify the the rest of the pc is fine? if it works you might just have a defective 40 series card,, lot of them I heard were breaking

I'm on a 1650.... since I know the human eye can only see 45 to 60 fps.... anything past a 3070 these days shows no extra performance to the human eye,, waste of money :-( I filled the ram instead...... 32 gigs, now I load stuff twice as quick almost

ps water cooling is the absolute worst thing you can think of as a tech...one tiny leak could potentially take EVERYTHING out,,, being water is conductive.....

have seen people use linseed oil..... least its not conductive,,, lol but what a mess to clean up
最後修改者:Cancer; 2024 年 9 月 20 日 下午 4:30
< >
目前顯示第 16-21 則留言,共 21
每頁顯示: 1530 50

張貼日期: 2024 年 9 月 11 日 下午 2:26
回覆: 21