Share via

BSOD BugCheckCode 278 - on third video card

Anonymous
2013-10-05T20:58:27+00:00

My entire pc has been crashing often. It started off quite slow (every few days) and increased to 7+ times per day. It originally only happened while playing World of Warcraft and increased to happening during other games.

I used to run 2 video cards with SLI - GTX 570s. I have RMA'd both cards. Currently only have 1 card in (haven't received second new one yet). Whenever I remove 1 card, I switch my nvidia control panel to turn SLI off. Basically, I'm on my THIRD video card. I installed this current one yesterday and within 4 hours of install, crashed again.

When it crashes, sometimes the screen is a bright green, sometimes a salmon colour, sometimes it's been grey, bright pink, sometimes that rainbow/snow look.

I have memory dumps turned on but they don't save. My main drive that holds windows is a Solid State Drive. I have all my video games on a second drive which is not an SSD and at some point I tried having memory dump save to the second drive and despite being set to my second drive, it wouldn't save. Since then, it won't save even to it's default location.

Since it won't save memory dumps, I can't use the WhoCrashed program, however, in the past before memory dumps stopped saving, it always came up as Nvidia drivers that crashed.

All drivers are up to date. I even tried letting things update beta drivers, that didn't make any difference. As far as I know, power supply is strong enough to handle 2 cards, never mind just 1.

Specs:

OS: Windows 7, 64-Bit

Ram: 16 GB Corsair Vengeance

PSU: Corsair AX 1200 - ATX Power Supply

Mother Board: P8Z68-V Pro Gen 3

CPU: i5-2500k Sandy Bridge

Error:

EventData


BugcheckCode 278

BugcheckParameter1 0xfffffa80118641a0

BugcheckParameter2 0xfffff88005f83f50

BugcheckParameter3 0xffffffffc000009a

BugcheckParameter4 0x4

SleepInProgress false

PowerButtonTimestamp 0

At this point I'm wondering what else can cause this issue. It's always the 278 code. Any research I did told me that it was a video card which was the reason I RMA'd both cards. EVGA seemed to agree. Could it be my motherboard? Could it be cables?

Really at a loss here.  I originally posted this a day or 2 ago on other tech support forums and haven't even seen a single response yet, so I'm trying here.  At this point, it's crashing often again.

Thanks, hoping someone can help!

Note:  DxDiag is too big even for it's own post :/

Windows for home | Previous Windows versions | Devices and drivers

Locked Question. This question was migrated from the Microsoft Support Community. You can vote on whether it's helpful, but you can't add comments or replies or follow the question.

0 comments No comments

Answer accepted by question author

  1. Anonymous
    2013-10-11T02:43:45+00:00

    Hi Sidney,

    Very glad to hear, good work! If and when you are comfortable and feel your issue has been solved, I'd recommend marking the post of mine that answered your question so this thread no longer shows up as requiring an answer.

    Regards,

    Patrick

    Was this answer helpful?

    0 comments No comments

12 additional answers

Sort by: Most helpful
  1. Anonymous
    2013-10-07T19:28:07+00:00

    Ok well if it's just generic drivers, then it still is crashing without nvidia drivers.  How do I see what drivers are causing the crash?  The text is too big/long to fit on my monitor (27in monitor).  It doesn't display enough information for me to see which one it is.  I don't know how to gather the information that Driver Verifier grabs.

    Was this answer helpful?

    0 comments No comments
  2. Anonymous
    2013-10-07T19:13:05+00:00

    Hi,

    In regards to nVidia, that's Windows installing the generic drivers. Navigate to nVidia's website and update them for there.

    Regards,

    Patrick

    Was this answer helpful?

    0 comments No comments
  3. Anonymous
    2013-10-07T16:26:02+00:00

    Okay, I let memtest run 8 passes and it took about 20 hours, no errors showed up.

    I don't know if I have the latest firmware for my SSD since I don't know how to check/update that, but I will google it and figure it out.

    In the mean time, moving on to testing with Seagate tools.  Thanks for the reply, it will take me a few days to work through all this I think!

    Update:

    • SSD Firmware - I am unable to update my firmware on the SSD because crucial has not bothered to create the UEFI firmware they said they would back in April.
    • SeaTools - I avoided running the tests you specified, so I have run S.M.A.R.T Check, Short Drive Self Test, Short Generic.  All these passed.  Are they supposed to be done in a matter of seconds, though?  They finished incredibly fast, checked out in green text and said "Pass".  Just seems quite fast.
    • System File Checker - No integrity violations.
    • Drive Verifier - It BSOD's constantly now.  I think it's the nvidia drivers.  I uninstalled all nvidia related drivers and it keeps auto installing the drivers every time the computer restarts.  I ran driver cleaner pro to make sure it uninstalled all the bits and pieces but it doesn't matter since Plug n Play won't give me the option to disable it.  So every time the pc restarts, drivers are back.  Any thoughts?

    Was this answer helpful?

    0 comments No comments
  4. Anonymous
    2013-10-05T21:18:04+00:00

    Hi,

    Have you updated the firmware to the latest version for your SSD?

    Bugcheck 278 = 0x116 - VIDEO_TDR_ERROR

    Ensure you have the latest video card drivers. If you are already on the latest video card drivers, uninstall and install a version or a few versions behind the latest to ensure it's not a latest driver only issue. If you have already experimented with the latest video card driver and many previous versions, please give the beta driver for your card a try.

    The basic definition of a 0x116 bugcheck is:

    There may be a bug in the video driver or video hardware.

    So, let me now explain what VIDEO_TDR_ERROR means. First off, TDR is an acronym for 'Timeout Detection and Recovery'. Timeout Detection and Recovery was introduced in Vista and carried over to Windows 7. Rather than putting exactly what Timeout Detection and Recovery does exactly, I'll just directly quote the MSDN article!

    Timeout detection:

    The GPU scheduler, which is part of the DirectX graphics kernel subsystem (Dxgkrnl.sys), detects that the GPU is taking more than the permitted amount of time to execute a particular task. The GPU scheduler then tries to preempt this particular task. The preempt operation has a "wait" timeout, which is the actual TDR timeout. This step is thus the timeout detection phase of the process. The default timeout period in Windows Vista and later operating systems is 2 seconds. If the GPU cannot complete or preempt the current task within the TDR timeout period, the operating system diagnoses that the GPU is frozen.

    To prevent timeout detection from occurring, hardware vendors should ensure that graphics operations (that is, DMA buffer completion) take no more than 2 seconds in end-user scenarios such as productivity and game play.

    Preparation for recovery:

    The operating system's GPU scheduler calls the display miniport driver's**DxgkDdiResetFromTimeout** function to inform the driver that the operating system detected a timeout. The driver must then reinitialize itself and reset the GPU. In addition, the driver must stop accessing memory and should not access hardware. The operating system and the driver collect hardware and other state information that could be useful for post-mortem diagnosis.

    Desktop recovery:

    The operating system resets the appropriate state of the graphics stack. The video memory manager, which is also part ofDxgkrnl.sys, purges all allocations from video memory. The display miniport driver resets the GPU hardware state. The graphics stack takes the final actions and restores the desktop to the responsive state. As previously mentioned, some legacy DirectX applications might render just black at the end of this recovery, which requires the end user to restart these applications. Well-written DirectX 9Ex and DirectX 10 and later applications that handle Device Remove technology continue to work correctly. An application must release and then recreate its Direct3D device and all of the device's objects. For more information about how DirectX applications recover, see the Windows SDK.

     Article here.

    With this being said, if Timeout Detection and Recovery fails to recover the display driver, it will then shoot the 0x116 bugcheck. There are many different things that can cause a 0x116, which I will explain below:

    The following hardware issues can cause a TDR event:

    1. Unstable overclock (CPU, GPU, etc). Revert all and any overclocks to stock settings.

    2. Bad sector in memory resulting in corrupt data being communicated between the GPU and the system (video memory otherwise known as VRAM or physical memory otherwise known as RAM).

    GPU testing: Furmark, run for ~15 minutes and watch temperatures to ensure there's no overheating and watch for artifacts.

    RAM testing: Memtest - Refer to the below:

    Memtest:

    Memtest86+:

    Download Memtest86+ here:

    http://www.memtest.org/

    Which should I download?

    You can either download the pre-compiled ISO that you would burn to a CD and then boot from the CD, or you can download the auto-installer for the USB key. What this will do is format your USB drive, make it a bootable device, and then install the necessary files. Both do the same job, it's just up to you which you choose, or which you have available (whether it's CD or USB).

    How Memtest works:

    Memtest86 writes a series of test patterns to most memory addresses, reads back the data written, and compares it for errors.

    The default pass does 9 different tests, varying in access patterns and test data. A tenth test, bit fade, is selectable from the menu. It writes all memory with zeroes, then sleeps for 90 minutes before checking to see if bits have changed (perhaps because of refresh problems). This is repeated with all ones for a total time of 3 hours per pass.

    Many chipsets can report RAM speeds and timings via SPD (Serial Presence Detect) or EPP (Enhanced Performance Profiles), and some even support changing the expected memory speed. If the expected memory speed is overclocked, Memtest86 can test that memory performance is error-free with these faster settings.

    Some hardware is able to report the "PAT status" (PAT: enabled or PAT: disabled). This is a reference to Intel Performance acceleration technology; there may be BIOS settings which affect this aspect of memory timing.

    This information, if available to the program, can be displayed via a menu option.

    Any other questions, they can most likely be answered by reading this great guide here:

    http://forum.canardpc.com/threads/28864-FAQ-please-read-before-posting

    3. Corrupt hard drive or Windows install / OS install resulting in corruption to the registry or page file.

    HDD diagnostics: Seatools - Refer to the below:

    http://www.seagate.com/support/downloads/seatools/

    You can run it via Windows or DOS. Do note that the only difference is simply the environment you're running it in. In Windows, if you are having what you believe to be device driver related issues that may cause conflicts or false positive, it may be a wise decision to choose the most minimal testing environment (DOS).

    Run all tests EXCEPT: Fix All, Long Generic, and anything Advanced.

    To reset your page file, follow the instructions below:

    a ) Go to Start...Run...and type in "sysdm.cpl" (without the quotes) and press Enter.

    • Then click on the Advanced tab,
    • Then on the Performance Settings Button,
    • Then on the next Advanced tab,
    • Then on the Virtual Memory Change button.

    b ) In this window, note down the current settings for your pagefile (so you can restore them later on).

    -Then click on the "No paging file" radio button, and

    • then on the "Set" button. Be sure, if you have multiple hard drives, that you ensure that the paging file is set to 0 on all of them.

    -Click OK to exit the dialogs.

    c ) Reboot (this will remove the pagefile from your system)

    d ) Then go back in following the directions in step a ) and re-enter the settings that you wrote down in step

    b ). Follow the steps all the way through (and including) the reboot.

    e ) Once you've rebooted this second time, go back in and check to make sure that the settings are as they're supposed to be.

    Run System File Checker:

    SFC.EXE /SCANNOW

    Go to Start and type in "cmd.exe" (without the quotes)

    At the top of the search box, right click on the cmd.exe and select "Run as adminstrator"

    In the black window that opens, type "SFC.EXE /SCANNOW" (without the quotes) and press Enter.

    Let the program run and post back what it says when it's done. 

    • Overheating of the CPU or GPU and or other components can cause 0x116 bugchecks. Monitor your temperatures and ensure the system is cooled adequately.
    • GPU failure. Whether it's heat, power issue (PSU issue), failing VRAM, etc.

    The following software issues can cause a TDR event:

    • Incompatible drivers of any sort
    • Messy / corrupt registry
    • Corrupt Direct X - http://support.microsoft.com/kb/179113
    • Corrupt system files (run System File Checker as advised above)
    • Buggy and or corrupt 3rd party drivers. If you suspect a 3rd party driver being the issue, enable Driver Verifier:

    Driver Verifier:

    What is Driver Verifier?

    Driver Verifier is included in Windows 8, 7, Windows Server 2008 R2, Windows Vista, Windows Server 2008, Windows 2000, Windows XP, and Windows Server 2003 to promote stability and reliability; you can use this tool to troubleshoot driver issues. Windows kernel-mode components can cause system corruption or system failures as a result of an improperly written driver, such as an earlier version of a Windows Driver Model (WDM) driver.

    Essentially, if there's a 3rd party driver believed to be at issue, enabling Driver Verifier will help flush out the rogue driver if it detects a violation.

    Before enabling Driver Verifier, it is recommended to create a System Restore Point:

    Vista - START | type rstrui - create a restore point

    Windows 7 - START | type create | select "Create a Restore Point"

    Windows 8 - http://www.eightforums.com/tutorials/4690-restore-point-create-windows-8-a.html

    How to enable Driver Verifier:

    Start > type "verifier" without the quotes > Select the following options -

    1. Select - "Create custom settings (for code developers)"
    2. Select - "Select individual settings from a full list"
    3. Check the following boxes -
    • Special Pool
    • Pool Tracking
    • Force IRQL Checking
    • Deadlock Detection
    • Security Checks (Windows 7)
    • Concurrentcy Stress Test (Windows 8)
    • DDI compliance checking (Windows 8)
    • Miscellaneous Checks
    1. Select  - "Select driver names from a list"
    2. Click on the "Provider" tab. This will sort all of the drivers by the provider.
    3. Check EVERY box that is [B]NOT[/B] provided by Microsoft / Microsoft Corporation.
    4. Click on Finish.

     8.    Restart.

    Important information regarding Driver Verifier:

    • If Driver Verifier finds a violation, the system will BSOD.
    • After enabling Driver Verifier and restarting the system, depending on the culprit, if for example the driver is on start-up, you may not be able to get back into normal Windows because Driver Verifier will flag it, and as stated above, that will cause / force a BSOD.

    If this happens, do not panic, do the following:

    • Boot into Safe Mode by repeatedly tapping the F8 key during boot-up.
    • Once in Safe Mode - Start > type "system restore" without the quotes.
    • Choose the restore point you created earlier.

    If you did not set up a restore point, do not worry, you can still disable Driver Verifier to get back into normal Windows:

    • Start > Search > type "cmd" without the quotes.
    • To turn off Driver Verifier, type in cmd "verifier /reset" without the quotes.

    ・    Restart and boot into normal Windows.

    How long should I keep Driver Verifier enabled for?

    It varies, many experts and analysts have different recommendations. Personally, I recommend keeping it enabled for at least 24 hours. If you don't BSOD by then, disable Driver Verifier.

    My system BSOD'd, where can I find the crash dumps?

    They will be located in %systemroot%\Minidump

    Any other questions can most likely be answered by this article:

    http://support.microsoft.com/kb/244617

    Regards,

    Patrick

    Was this answer helpful?

    0 comments No comments