Note: Please read the whole question because I have gathered so much data and put it together along with all details.
I think I am going insane. I've been dealing with this problem for 1.5 months. I might just send my PC in for repairs but I really do not want to wait that is why I am trying to solve this without doing that. My PC black screens (and GPU fans go full blast when black screen happens) at random times. It happens 0 to 2 times a day. It is extremely inconsistent, happens while gaming or while watching videos, or just sitting in the desktop. So, it has nothing to do with load.
What exactly happens:
- Random Black screen
- Sounds from a game or a video continues during the black screen.
- I can start/stop musics with shortcuts on keyboard during the black screen.
- GPU fans go 100% during black screen (GPU was 40C before it went black screen, so there is no heat issue at all).
- I have to manually restart or wait like 10 mins until it restarts randomly.
What event viewer says:
Kernel Power 41 (63)
Bugcheck 1001 - The bugcheck was: 0x00000116 (this happened for the first time it was normally 0x00000133)
Kernel EventTracing - Error setting traits on Provider {8444a4fb-d8d3-4f38-84f8-89960a1ef12f}. Error: 0xC0000001 (this pops up like 10-20 times a day I don't think it is related with the issue but I wanted to mention it just in case)
What reliability history says:
Windows Hardware Error
Problem Event Name: LiveKernelEvent
Code: 141
Parameter 1: ffffdf82ae936460
Parameter 2: fffff8077ed43720
Parameter 3: 0
Parameter 4: 1b90
OS version: 10_0_19045
Service Pack: 0_0
Product: 768_1
OS Version: 10.0.19045.2.0.0.768.101
Locale ID: 1033
Windows stopped working
The computer has rebooted from a bugcheck. The bugcheck was: 0x00000116 (0xffffdf82b4d46460, 0xfffff8077ed43ab0, 0xffffffffc000009a, 0x0000000000000004)
What have I tried until now:
- Windows reinstallation
- GPU drivers updated
- Motherboard drivers updated
- BIOS up-to date already
Now here is the weird part. This black screen + gpu fans full blast issue is too inconsistent about logs and while it produces similar errors, the last 2 times were different (including today's error). I know little details about todays error since I was out and the PC was sitting in the desktop when I left it. When I came home it was in the password screen which got me suspicious and I checked the reliability history to see if the problem happened again and I'm guessing yes.
Examples of inconsistent error logs:
1.
Today's error about bugcheck was 0x00000116
However it was normally always 0x00000133 until today
2.
Sometimes the LiveKernelEvent looks like this:
Problem Event Name: LiveKernelEvent
Code: 1a1
Parameter 1: ffffd10fe7330040
Parameter 2: 0
Parameter 3: 0
Parameter 4: 0
OS version: 10_0_19045
Service Pack: 0_0
Product: 768_1
OS Version: 10.0.19045.2.0.0.768.101
Locale ID: 1033
However, sometimes it looks like today's error with the code 141.
3.
I saw SO MANY errors about nvlddmkm in the event viewer ONCE. This only happened once but the error seemed to be repeated like 50-60 times in a second.
Error from nvlddmkm:
The description for Event ID 0 from source nvlddmkm cannot be found. Either the component that raises this event is not installed on your local computer or the installation is corrupted. You can install or repair the component on the local computer.
If the event originated on another computer, the display information had to be saved with the event.
The following information was included with the event:
\Device\000000f0
Error occurred on GPUID: 100
The message resource is present but the message was not found in the message table
4.
There are usually EventLog events
The previous system shutdown at 7:56:59 PM on 5/3/2023 was unexpected.
The weird part about this is that the time seems to be wrong every single time. Much before than the actual black screen (in the range 10-40 mins)
LASTLY, What can I provide you:
I have reliability history dump before the windows 10 reinstallation along with all event viewer dump. Some memory dumps of black screen crashes, before the windows 10 reinstallation. Lastly, the latest crash's dumps (but I don't know for certain if it was a black screen issue again but I assume it was since the Hardware Error was there once again).