Occasional BSOD - Windows 8.1 x64

Will

Senior Administrator
Staff member
Joined
Mar 4, 2012
Posts
8,226
Location
%tmp%
My 4/5 year old laptop is starting to BSOD. This may or may not be an issue (it's only happened a few times), but I'm a bit concerned that it might be a piece of hardware failing which I won't be able to replace any time soon. I haven't taken any steps yet other than running the jcgriff2 app, but I'll check for driver updates.

Occasional BSOD when coming out of hibernation - the video driver tends to be mentioned when that happens. Today it also BSOD'ed whilst I was using the PC, something I've never seen it do before. No driver was listed.

· OS - Windows 8.1, 8, 7, Vista ?
Windows 8.1 Pro with Media Center

· x86 (32-bit) or x64 ?
x64

· What was original installed OS on system?
Windows 7

· Is the OS an OEM version (came pre-installed on system) or full retail version (YOU purchased it from retailer)?
Windows 8 upgrade bought direct from Microsoft.

· Age of system (hardware)
About 4/5 years at this point, maybe slightly newer. It must be at least 4 years old, but I don't know exactly. The hard drive is newer and has been replaced before.

· Age of OS installation - have you re-installed the OS?
Windows 8 upgraded when it came out - I haven't reinstalled since.

· CPU
Intel i5 M520 @2.40GHz

· Video Card
ATI Mobility Radeon HD 5850

· MotherBoard
Alienware M15x x- BIOS AO8. Not sure if there is a more specific model number.

· System Manufacturer
Dell/Alienware

· Exact model number (if laptop, check label on bottom)
Alienware M15x

· Laptop or Desktop?
Laptop.


----------------------

Perfmon failed to run. Gives me this error:
The operator or administrator has refused the request.

Perfmon was run using my normal administrator account, with "Run As Administrator" selected.

Collection app files attached.
 

Attachments

Hi Will,

Among all of the crash dumps, we have two bug checks:

ATTEMPTED_EXECUTE_OF_NOEXECUTE_MEMORY (fc)

This indicates that an attempt was made to execute non-executable memory.

-- PROCESS_NAME: 2433172.exe

^^ Bit of a strange image name, is this a familiar process? Google shows it's in relation to Mini Golf.

BugCheck FC, {fffff80006a67008, 80000002262d2863, ffffd000244a2700, 2}

Code:
3: kd> ln fffff80006a67008
Unable to load image \SystemRoot\system32\DRIVERS\[COLOR=#ff0000]75802976.sys[/COLOR], Win32 error 0n2
*** WARNING: Unable to verify timestamp for [COLOR=#ff0000]75802976.sys[/COLOR]
*** ERROR: Module load completed but symbols could not be loaded for [COLOR=#ff0000]75802976.sys[/COLOR]

^^ The driver that attempted to execute non-executable memory is 75802976.sys. No idea what this driver is in relation to/part of, and with an image name like this, possible malware. If your .exe above is not in fact in relation to Mini Golf, it may be a part of this driver.

VIDEO_TDR_FAILURE (116)

This indicates that an attempt to reset the display driver and recover from a timeout failed.


So, let me now explain what VIDEO_TDR_ERROR means. First off, TDR is an acronym for 'Timeout Detection and Recovery'. Timeout Detection and Recovery was introduced in Vista and carried over to Windows 7. Rather than putting exactly what Timeout Detection and Recovery does exactly, I'll just directly quote the MSDN article!

Timeout detection:
The GPU scheduler, which is part of the DirectX graphics kernel subsystem (Dxgkrnl.sys), detects that the GPU is taking more than the permitted amount of time to execute a particular task. The GPU scheduler then tries to preempt this particular task. The preempt operation has a "wait" timeout, which is the actual TDR timeout. This step is thus the timeout detection phase of the process. The default timeout period in Windows Vista and later operating systems is 2 seconds. If the GPU cannot complete or preempt the current task within the TDR timeout period, the operating system diagnoses that the GPU is frozen.
To prevent timeout detection from occurring, hardware vendors should ensure that graphics operations (that is, DMA buffer completion) take no more than 2 seconds in end-user scenarios such as productivity and game play.
Preparation for recovery:
The operating system's GPU scheduler calls the display miniport driver's DxgkDdiResetFromTimeout function to inform the driver that the operating system detected a timeout. The driver must then reinitialize itself and reset the GPU. In addition, the driver must stop accessing memory and should not access hardware. The operating system and the driver collect hardware and other state information that could be useful for post-mortem diagnosis.
Desktop recovery:
The operating system resets the appropriate state of the graphics stack. The video memory manager, which is also part of Dxgkrnl.sys, purges all allocations from video memory. The display miniport driver resets the GPU hardware state. The graphics stack takes the final actions and restores the desktop to the responsive state. As previously mentioned, some legacy DirectX applications might render just black at the end of this recovery, which requires the end user to restart these applications. Well-written DirectX 9Ex and DirectX 10 and later applications that handle Device Remove technology continue to work correctly. An application must release and then recreate its Direct3D device and all of the device's objects. For more information about how DirectX applications recover, see the Windows SDK.

With this being said, if Timeout Detection and Recovery fails to recover the display driver, it will then shoot the 0x116 bugcheck. There are many different things that can cause a 0x116, which I will explain below:

1. Ensure you have the latest video card drivers. If you are already on the latest video card drivers, uninstall and install a version or a few versions behind the latest to ensure it's not a latest driver only issue. If you have already experimented with the latest video card driver and many previous versions, please give the beta driver for your card a try.

2. In your loaded drivers list, dtsoftbus01.sys is listed which is the Daemon Tools driver. Daemon Tools is a very popular cause of BSOD's in 7/8 based systems. Please uninstall Daemon Tools. Alternative imaging programs are: MagicISO, Power ISO, etc.

3.

Code:
3: kd> lmvm rixdpx64
start             end                 module name
fffff800`03380000 fffff800`033d7000   rixdpx64   (deferred)             
    Image path: \SystemRoot\System32\drivers\rixdpx64.sys
    Image name: rixdpx64.sys
    Timestamp:        Fri Nov 17 23:07:46 [COLOR=#ff0000][U][I][B]2006[/B][/I][/U][/COLOR]

^^ Ricoh Memory Card Reader driver, dated from 2006. Way too old of a device driver to function with W8/8.1, so please update ASAP if possible or uninstall the driver and disable the device.

The following hardware issues can cause a TDR event:

1. Unstable overclock (CPU, GPU, etc). Revert all and any overclocks to stock settings.

2. Bad sector in memory resulting in corrupt data being communicated between the GPU and the system (video memory otherwise known as vRAM or physical memory otherwise known as RAM).

GPU testing: Furmark, run for ~15 minutes and watch temperatures to ensure there's no overheating and watch for artifacts.

RAM testing: Memtest (RUN FOR NO LESS THAN ~8 PASSES) - Refer to the below:

Memtest:

Memtest86+:

Download Memtest86+ here:

Memtest86+ - Advanced Memory Diagnostic Tool

Which should I download?

You can either download the pre-compiled ISO that you would burn to a CD and then boot from the CD, or you can download the auto-installer for the USB key. What this will do is format your USB drive, make it a bootable device, and then install the necessary files. Both do the same job, it's just up to you which you choose, or which you have available (whether it's CD or USB).

Do note that some older generation motherboards do not support USB-based booting, therefore your only option is CD (or Floppy if you really wanted to).

How Memtest works:

Memtest86 writes a series of test patterns to most memory addresses, reads back the data written, and compares it for errors.

The default pass does 9 different tests, varying in access patterns and test data. A tenth test, bit fade, is selectable from the menu. It writes all memory with zeroes, then sleeps for 90 minutes before checking to see if bits have changed (perhaps because of refresh problems). This is repeated with all ones for a total time of 3 hours per pass.

Many chipsets can report RAM speeds and timings via SPD (Serial Presence Detect) or EPP (Enhanced Performance Profiles), and some even support changing the expected memory speed. If the expected memory speed is overclocked, Memtest86 can test that memory performance is error-free with these faster settings.

Some hardware is able to report the "PAT status" (PAT: enabled or PAT: disabled). This is a reference to Intel Performance acceleration technology; there may be BIOS settings which affect this aspect of memory timing.

This information, if available to the program, can be displayed via a menu option.

Any other questions, they can most likely be answered by reading this great guide here:

FAQ : please read before posting
3. Corrupt hard drive or Windows install / OS install resulting in corruption to the registry or page file.

HDD diagnostics: Seatools - Refer to the below:

SeaTools | Seagate

You can run it via Windows or DOS. Do note that the only difference is simply the environment you're running it in. In Windows, if you are having what you believe to be device driver related issues that may cause conflicts or false positive, it may be a wise decision to choose the most minimal testing environment (DOS).

Run all tests EXCEPT: Fix All, Long Generic, and anything Advanced.

To reset your page file, follow the instructions below:

a ) Go to Start...Run...and type in "sysdm.cpl" (without the quotes) and press Enter.

- Then click on the Advanced tab,
- Then on the Performance Settings Button,
- Then on the next Advanced tab,
- Then on the Virtual Memory Change button.

b ) In this window, note down the current settings for your pagefile (so you can restore them later on).

-Then click on the "No paging file" radio button, and

- then on the "Set" button. Be sure, if you have multiple hard drives, that you ensure that the paging file is set to 0 on all of them.

-Click OK to exit the dialogs.

c ) Reboot (this will remove the pagefile from your system)

d ) Then go back in following the directions in step a ) and re-enter the settings that you wrote down in step

b ). Follow the steps all the way through (and including) the reboot.

e ) Once you've rebooted this second time, go back in and check to make sure that the settings are as they're supposed to be.

Run System File Checker:

SFC.EXE /SCANNOW

Go to Start and type in "cmd.exe" (without the quotes)

At the top of the search box, right click on the cmd.exe and select "Run as adminstrator"

In the black window that opens, type "SFC.EXE /SCANNOW" (without the quotes) and press Enter.

Let the program run and post back what it says when it's done.

- Overheating of the CPU or GPU and or other components can cause 0x116 bugchecks. Monitor your temperatures and ensure the system is cooled adequately.

- GPU failure- Heat, power issue (PSU issue), faulty vRAM, etc.

The following software issues can cause a TDR event:

- Incompatible drivers of any sort

- Messy / corrupt registry

- Corrupt Direct X - How to install the latest version of DirectX

- Corrupt system files (run System File Checker as advised above)

- Buggy and or corrupt 3rd party drivers. If you suspect a 3rd party driver being the issue, enable Driver Verifier:

Driver Verifier:

What is Driver Verifier?

Driver Verifier is included in Windows 8/8.1, 7, Windows Server 2008 R2, Windows Vista, Windows Server 2008, Windows 2000, Windows XP, and Windows Server 2003 to promote stability and reliability; you can use this tool to troubleshoot driver issues. Windows kernel-mode components can cause system corruption or system failures as a result of an improperly written driver, such as an earlier version of a Windows Driver Model (WDM) driver.

Essentially, if there's a 3rd party driver believed to be at issue, enabling Driver Verifier will help flush out the rogue driver if it detects a violation.

Before enabling Driver Verifier, it is recommended to create a System Restore Point:

Vista - START | type rstrui - create a restore point
Windows 7 - START | type create | select "Create a Restore Point"
Windows 8 - Restore Point - Create in Windows 8

How to enable Driver Verifier:

Start > type "verifier" without the quotes > Select the following options -

1. Select - "Create custom settings (for code developers)"
2. Select - "Select individual settings from a full list"
3. Check the following boxes -
- Special Pool
- Pool Tracking
- Force IRQL Checking
- Deadlock Detection
- Security Checks (Windows 7 & 8)
- DDI compliance checking (Windows 8)
- Miscellaneous Checks
4. Select - "Select driver names from a list"
5. Click on the "Provider" tab. This will sort all of the drivers by the provider.
6. Check EVERY box that is NOT provided by Microsoft / Microsoft Corporation.
7. Click on Finish.
8. Restart.

Important information regarding Driver Verifier:

- If Driver Verifier finds a violation, the system will BSOD. To expand on this a bit more for the interested, specifically what Driver Verifier actually does is it looks for any driver making illegal function calls. When and/if this happens, system corruption occurs if allowed to continue. When Driver Verifier is enabled, it is monitoring all 3rd party drivers (as we have it set that way) and when it catches a driver attempting to do this, it will quickly flag that driver as being a troublemaker, and bring down the system safely before any corruption can occur.

- After enabling Driver Verifier and restarting the system, depending on the culprit, if for example the driver is on start-up, you may not be able to get back into normal Windows because Driver Verifier will detect it in violation almost straight away, and as stated above, that will cause / force a BSOD.

If this happens, do not panic, do the following:

- Boot into Safe Mode by repeatedly tapping the F8 key during boot-up.

- Once in Safe Mode - Start > Search > type "cmd" without the quotes.

- To turn off Driver Verifier, type in cmd "verifier /reset" without the quotes.
・ Restart and boot into normal Windows.

If your OS became corrupt or you cannot boot into Windows after disabling verifier via Safe Mode:

- Boot into Safe Mode by repeatedly tapping the F8 key during boot-up.

- Once in Safe Mode - Start > type "system restore" without the quotes.

- Choose the restore point you created earlier.

-- Note that Safe Mode for Windows 8 is a bit different, and you may need to try different methods: 5 Ways to Boot into Safe Mode in Windows 8 & Windows 8.1

How long should I keep Driver Verifier enabled for?

I recommend keeping it enabled for at least 24 hours. If you don't BSOD by then, disable Driver Verifier. I will usually say whether or not I'd like for you to keep it enabled any longer.

My system BSOD'd with Driver Verifier enabled, where can I find the crash dumps?

They will be located in %systemroot%\Minidump

Any other questions can most likely be answered by this article:
Using Driver Verifier to identify issues with Windows drivers for advanced users

Regards,

Patrick
 
Hi Patrick,

75802976.sys is a Kaspersky Lab driver. I didn't realise I'd run the program I was testing out at the same time as the BSOD - if that was the cause of that one there is no issue there.

The latest driver update for my graphics card was in 2010, so I have the latest. The memory card reader functions fine.

I'll run Furmark and Memtest - there might not be any real issue here, especially if the non-executable memory BSOD was caused by a Kaspersky driver.
 
That definitely makes sense then, regarding the Kaspersky driver.

In any case, keep me updated.

Regards,

Patrick
 
Hi Will,

I would also recommend uninstalling Daemon Tools/Alcohol 120 for the duration -- until the BSOD epidemic ceases.

You had 1 BSOD on 12 Feb 2014 -- bugcheck 0x116 (video TDR timeout) naming ATI video as the probable cause.

Then all quiet until 1 March -
  • 01 March 2014 - 0x116; ATI video
  • 05 March 2014 - 0x116; ATI video
  • 10 March 2014 - 0x116; ATI video
  • 17 March 2014 - 0xfc (attempt made to execute non-executable memory); 75802976.sys - Kaspersky

Curiosity, please - how many dump files are there in C:\Windows\LiveKernelReports\WATCHDOG?

^^^^^^ These are usually 0x116 "Live Kernel Reports" with bugcheck 0x117 - which is very similar to your 0x116 BSODs.

  • 0x117 - Video TDR Timeout; video driver was unable to reset in the time allotted (usually 30000 ms = 30 secs), but was ultimately able to recover
  • 0x116 - same as 0x117, but was unable to ultimately recover

Check in with the Reliability Monitor to see what installed prior to 12 Feb and 1 March.

Bring up an Admin CMD prompt; type or paste -
Code:
perfmon /rel

Kind Regards. . .

John
 

Has Sysnative Forums helped you? Please consider donating to help us support the site!

Back
Top