Установить Steam
войти
|
язык
简体中文 (упрощенный китайский)
繁體中文 (традиционный китайский)
日本語 (японский)
한국어 (корейский)
ไทย (тайский)
Български (болгарский)
Čeština (чешский)
Dansk (датский)
Deutsch (немецкий)
English (английский)
Español - España (испанский)
Español - Latinoamérica (латиноам. испанский)
Ελληνικά (греческий)
Français (французский)
Italiano (итальянский)
Bahasa Indonesia (индонезийский)
Magyar (венгерский)
Nederlands (нидерландский)
Norsk (норвежский)
Polski (польский)
Português (португальский)
Português-Brasil (бразильский португальский)
Română (румынский)
Suomi (финский)
Svenska (шведский)
Türkçe (турецкий)
Tiếng Việt (вьетнамский)
Українська (украинский)
Сообщить о проблеме с переводом
then set bios to defaults, and disconnect all drives
and boot from the usb stick to test
then enable xmp/docp to see if thats the cause
One thing I forgot to mention is that I tune my GPU settings a bit: https://imgur.com/UVGoK6V
Normally, the minimum frequency default is set to only something like 600 MHz, which it is recommended to raise about 100 MHz less than the max frequency. I also tend to lower the voltage from 1150 mV to 1100 mV, however, I'm leaving that unchanged currently in case that was a cause for the issue (sudden loss of power?)
For VRAM tuning: Memory timing I set to fast and Max Frequency I set to 2180 MHz from 2000 MHz.
I don't think these should cause the issue, because its barely an overclock, bur rather just tuning. When my computer crashed like it does, it reverts the tuning controls to default, which is why it gave me suspicion of this, as well.
Try setting Adrenalin back to stock. My 7800 XT is new to me so I'm not familiar with what are common sweet spot values for tuning these RDNA2/3 chips but one of the first rules of troubleshooting when having issues is putting things back to stock/default to see if it's a variable. That's where you're at here. You need to reduce variables to remove guess work as to what's causing it. RAM is included in this set it to stock part so I agree with the above advice to test it and perhaps with XMP disabled.
Set Adrenalin to default. You could even play with uninstalling it, running DDU in safe mode, and then reinstalling the drivers/Adrenalin. Wouldn't be a bad idea just to rule that out while you're at it.
Test RAM with MeTest86.
Maybe disable XMP.
These would be your first steps. You can also do further stuff like running OCCT tests or try some combination of the OCCT CPU test or Prime 95 (one or the other for the CPU) along with Furmark (for the GPU).
Hmm okay. Should I bother upgrading to windows 11 or nah? I've kept it on 10 the past few years, but I can always just switch over for free.
I had someone suggest clearing CMOS in BIOS but I'd have to remove the CMOS battery from the motherboard which sounds like a pain.
I'm pretty confidend my RAM is good, but I can always run memtestx86 to be sure. Maybe I can do some sort of stress test that draws a lot of power from the PSU? Just need a way to replicate the crash, since once every 2 weeks or so is not good enough for diagnosis of issue.
I think of all the things mentioned, I think setting Adrenalin to default, and doing tests with the RAM (and trying XMP off if need be) are you most efficient one and two steps.
It's a process of elimination.
There's methods to replicate demanding conditions. They aren't exact but they might speed it up. That's why I mentioned OCCT (tests for a lot of parts), prime 95 (CPU loading) and Furmark (GPU loading).
Right. Because a failing GPU I'd imagine would be a bit more obvious or at least more impactful on my experience. I had an old GPU years ago and things like visual artifacts/visual screen cracks with bright pink/black textures would occur as well as crashing of course.
Is there a way I can monitor the PSU? I heard I can download a program somewhere to test for the voltage fluctuations and whatnot.
Again, swapping out parts or doing anything major like that may not be worth it at the moment, since the crash happens rarely. Maybe I can monitor some things in the mean time.
EDIT: Also, I wanted to ask - could PBO be a possible cause? Like I said, it's enabled by default since it's the 5600X.
Online I've read people suggest that it could be how the voltages are set in BIOS for your CPU, etc.
Also, I have some old 2TB harddisk drive attached to my mobo. I really only use it for old games that don't benefit from an SSD... could this cause the issue too? I just heard somewhere that jank HDDs can cause problems.
I changed to a 7800 XT and it was pretty good, except after a few days my display went Black, sound continued for a few seconds, then went stop and go, and then the PC was back at the POST screen restarting. Sometimes it would work fine for days, others it would crash a few times a day. When it worked, there was zero issues. No artifacts, temperatures were low, absolutely nothing looked wrong.
I even doubted it was the graphics card because of that, and because playing with CPU or RAM things made it more or less severe (but never made it go away). But removing the RX 7800 XT made it go away, and the system was stable otherwise.
Did an RMA and so far this new one hasn't done it once.
This isn't to say yours is the graphics card. This is merely saying don't entirely rule it out because it seems to work fine when it does work.
To reiterate myself from earlier, if the issue persists with things at stock, then I think the PSU and graphics card are the first two I'd blind guess at. RAM and then motherboard after that.
PBO can be involved, yes. That affects the voltage/frequency curve of the CPU. Something I forgot to mention earlier when asking if you had Event ID 18 was to also ask for you to post a few of them. If it's reporting the cache hierarchy error as yours is, then see if the SAME core (APIC ID) is listed. If it's the same one, it heavily does suggest you have a particularly unstable CPU core. If the APIC ID is always different, however, then it's probably NOT the CPU (but still can be) and its just different cores catching the machine check exception condition. That Event ID means "this CPU core caught an uncorrectable hardware error (MCE)".
Disabling PBO and XMP will help rule out things like CPU, Infinity Fabric, or just bad RAM. Again, all to stock.
You can use HwInfo64 or GPU-Z to monitor voltages in the background while gaming. I suggested that in an earlier reply. I did this when troubleshooting my own issue and the voltages stayed consistent which further led me back to suspecting the GPU. The only thing is the voltages might drop too fast to be caught before the CPU does its thing and triggers the restart but it's still worth doing because if it does report they drop, it's a good sign the PSU is a likelier suspect.
Right. In your case, it sounds like the crashes at least happened more frequently. It has obviously not crashed again since making this post and I've yet to turn everything to stock yet. If it happened more often, it may be easier to narrow down.
There were 2 event 18 ID when the crash occurred the other day, both having different Processor APIC IDs - one is 0 the other is 6. I can't go back further than 7 days to fetch the last crash that happened a few weeks ago, but I remember checking Processor APIC ID in the past and it was always different.
Also, you mentioned using hwinfo and GPU-Z to monitor voltages - is there a way I can have it save as logs? That way, I can go into hwinfo or GPU-Z and check what happened before the crash -- similar to how event viewer works. I'm also not sure if its possible to monitor a PSU output.
Not saying it can't be the GPU, yeah. If anything, it's less of a pain in the ass replacing a GPU rather than mobo or CPU...
One thing I will mention: Before I upgraded to this PC over a year ago, it was a pre-built PC. It came with an ASRock B450M/ac mobo, 8GB ADATA DDR4 (which I expanded), RX 580 GPU, Ryzen 5 3600 CPU, 240GB SATA SSD (which I still use, but is not my boot drive).
Over time, I replaced ALL of the components in the case, EXCEPT the PSU, Case and the remaining harddrives I use just for extra storage - boot drive is a new m.2 NVMe. Keeping the same case and PSU made building the PC and new mobo super easy. The mobo was the very last part I installed.
However, I actually had an issue where I would still bluescreen occasionally (literal BOSD error message, not like this black screen I'm getting), but it would happen infrequently like it does now. It happened with the very pre-built from the beginning, up until I replaced everything but the mobo. Replacing the mobo made me think in my head "okay, that was probably the issue." So, in a way, I narrowed it down to the PSU... BUT the crash happens in a different way and I cannot remember what those event ID errors were back then, 3-4 years ago. It just seems quite unlikely since it's a good brand (EVGA) and I made sure it is a real 80+ Gold unit.
I wouldn't conflate the earlier BSOD with the seemingly MCE issue you're having now. They seem to be different things and saying "I had a different issue once upon a time and it could be related to one I'm having now" doesn't help. Believe me, I know that all too well. Just focus on the issue you're having now. Even if there's a link, it's better to focus on addressing the here and now issue as it is currently presenting itself.