Random Bluescreen Crashes On Server 2019/16 With No Consistent Reason

Elliot 1 Reputation point
2021-07-12T23:24:40.317+00:00

Hey

I've been struggling with these rather odd and entirely random Blue Screen crashes on my server. I had a hardware upgrade last month and ever sicne then there has been an endless torrent of issues. I have managed to remove a lot of them, 2 errors were due to 2 dead HDD's and another down to a broken Raid card but i still have a Blue screen which is caused by a system "Bug Check", every time a bug check runs it seems to kill the machine and each time for a different reason, from 0x01a to 0x04b and a few others. I have a selection of memory dumps from these which i am trying to understand but memory dumps are a new thing to me and I am fairly sure that i am missing most of the constructive information here.

What I have done before this is double check every running drive is free of errors , I've had memtest and the windows memoery tester run on the machine all seemingly to no avail.

The dumps can be found here - Mem Dumps

Windows Server 2019
Windows Server 2019
A Microsoft server operating system that supports enterprise-level management updated to data storage.
3,458 questions
Windows Server 2016
Windows Server 2016
A Microsoft server operating system that supports enterprise-level management updated to data storage.
2,371 questions
Windows Server
Windows Server
A family of Microsoft server operating systems that support enterprise-level management, data storage, applications, and communications.
12,127 questions
0 comments No comments
{count} votes

5 answers

Sort by: Most helpful
  1. Jenny Feng 14,076 Reputation points
    2021-07-13T02:44:52.737+00:00

    @Elliot
    Hi,
    According to your description, Multiple different Bug Check is usually a hardware problem - even more likely on a recent build or major upgrade.

    Bug Check Code Reference
    https://learn.microsoft.com/en-us/windows-hardware/drivers/debugger/bug-check-code-reference2

    Due to security policy, community support cannot download and save user’s personal data such as dump or log files, please install WinDbg from Microsoft website and analyze crash dump files.
    https://learn.microsoft.com/en-us/windows-hardware/drivers/debugger/debugger-download-tools

    If you really need crash dump files analysis support, open a support ticket with Microsoft.

    If it is determined that it is a hardware problem, I am afraid you still need to seek help from the manufacturer.

    Hope above information can help you.

    ============================================

    If the Answer is helpful, please click "Accept Answer" and upvote it.
    Note: Please follow the steps in our documentation to enable e-mail notifications if you want to receive the related email notification for this thread.

    1 person found this answer helpful.
    0 comments No comments

  2. Docs 15,141 Reputation points
    2021-07-13T03:58:48.627+00:00

    Please perform the following steps:

    1) Upgrade the BIOS: F50 > F61C
    https://www.gigabyte.com/Motherboard/B450M-DS3H-rev-1x/support#support-dl-bios

    2) Indicate when each of the RAM modules were installed

    3) Run this script using administrative Powershell > post an image or share link

    [Cmdletbinding()] 
    Param( 
        [string]$Computername = "localhost" 
    ) 
    cls 
    $PysicalMemory = Get-WmiObject -class "win32_physicalmemory" -namespace "root\CIMV2" -ComputerName $Computername 
    
    Write-Host "Memore Modules:" -ForegroundColor Green 
    $PysicalMemory | Format-Table Tag,BankLabel,@{n="Capacity(GB)";e={$_.Capacity/1GB}},Manufacturer,PartNumber,Speed -AutoSize 
    
    Write-Host "Total Memory:" -ForegroundColor Green 
    Write-Host "$((($PysicalMemory).Capacity | Measure-Object -Sum).Sum/1GB)GB" 
    
    $TotalSlots = ((Get-WmiObject -Class "win32_PhysicalMemoryArray" -namespace "root\CIMV2" -ComputerName $Computername).MemoryDevices | Measure-Object -Sum).Sum 
    Write-Host "`nTotal Memory Slots:" -ForegroundColor Green 
    Write-Host $TotalSlots 
    
    $UsedSlots = (($PysicalMemory) | Measure-Object).Count  
    Write-Host "`nUsed Memory Slots:" -ForegroundColor Green 
    Write-Host $UsedSlots 
    
    If($UsedSlots -eq $TotalSlots) 
    { 
        Write-Host "All memory slots are filled up, none is empty!" -ForegroundColor Yellow 
    } 
    

    4) Post a share link for msinfo32 saved as NFO

    .
    .
    .
    .
    .

    Please remember to vote and to mark the replies as answers if they help.

    On the bottom of each post there is:

    Propose as answer = answered the question

    On the left side of each post: Vote = a helpful post
    .
    .
    .
    .
    .

    1 person found this answer helpful.

  3. Dave Patrick 426.1K Reputation points MVP
    2021-07-12T23:26:47.13+00:00

    The server manufacturer will be your best resource to debug problems with the hardware.

    --please don't forget to upvote and Accept as answer if the reply is helpful--


  4. Dave Patrick 426.1K Reputation points MVP
    2021-07-12T23:57:26.417+00:00

    Sounds good. This doesn't mean it can't or won't work but it does leave the burden of vetting all the components and drivers to the end user to do.

    --please don't forget to upvote and Accept as answer if the reply is helpful--

    0 comments No comments

  5. Docs 15,141 Reputation points
    2021-07-14T20:19:02.447+00:00

    All of the memory dumps were downloaded and debugged.

    There were no misbehaving drivers.

    The problem is likely related to RAM, DIMM, or motherboard.

    The most likely problem is incompatible, mismatched, or malfunctioning RAM.

    Open the motherboard manual.

    Look for information about optimal DIMM when installing only one and only two RAM modules.

    View the applicable Qualified Vendor List (QVL).

    Remove all RAM modules.

    Test RAM on the QVL (evaluate computer stability / instability).

    When purchasing RAM:

    a) Select RAM modules displayed on the QVL

    b) Purchase in pairs with matching SKU

    c) Consider RAM manufacturers that guarantee compatibility

    d) Consider RAM manufacturers that have lifetime warranties

    .
    .
    .
    .
    .

    Please remember to vote and to mark the replies as answers if they help.

    On the bottom of each post there is:

    Propose as answer = answered the question

    On the left side of each post: Vote = a helpful post
    .
    .
    .
    .
    .

    0 comments No comments