| Author |
Message |
RaistmerVolunteer developer Volunteer tester
 Send message
Joined: 16 Jun 01 Posts: 2541 Credit: 25,569,662 RAC: 41,288

|
|
My main cruncher started to reboot itself recently.
When I disabled auto-restart I saw BSoD with REFERENCED_BY_PINTER bugcheck message.
AFAIK no system-wide updates were done immediately before these reboots begin.
Anyone encountered this type of BSoD and what were the reasons (I read MS article about ref/deref mismatch, but don't know what driver cause this)?
____________
News about SETI opt app releases: https://twitter.com/Raistmer |
|
|
|
|
|
only one question Raistmer: do you use a NDAS?
____________
|
|
|
RaistmerVolunteer developer Volunteer tester
 Send message
Joined: 16 Jun 01 Posts: 2541 Credit: 25,569,662 RAC: 41,288

|
only one question Raistmer: do you use a NDAS?
If NDAS is Network Direct Attached Storage then no, don't use.
____________
News about SETI opt app releases: https://twitter.com/Raistmer |
|
|
|
|
|
If you are using dumpfiles:
This is a set of tools for analyse of what happend.
Look at "Who Crashed".
http://www.resplendence.com/downloads
Hope this will help you |
|
|
|
|
|
First of all you should ensure only minidumps are created, as full memeory dump is not necessary unless third party needs it to create a fix.
I've used BluescreenView for many years as it's to the point to quickly find which drivers were loaded.
Some more URIs on the topic here on MSDN.
____________
Morten Ross
|
|
|
|
|
|
If it is your computer with AMD Radeon HD 6900 series which are malfunctioning, it is perhaps the same problem I have had for a long time.
It started with Catalyst version 12.4 that made my computer started BSOD with the error message REFERENCED_BY_POINTER sometimes.
But when I recently upgraded to version 12.10, it seems that the problem has disappeared.
I have now been using the driver for over a week without BSOD occurs.
|
|
|
RaistmerVolunteer developer Volunteer tester
 Send message
Joined: 16 Jun 01 Posts: 2541 Credit: 25,569,662 RAC: 41,288

|
|
Thanks all for suggestions.
Will try to implement them.
Regarding HD6950 - yes, it's that host. But driver installed long time ago (Cat 12.6). Some time ago there were often reboots, then I reduced number of running simultaneously tasks to 1 they gone. Recently I set again 2 tasks simultaneously and that config worked for week or even more. But now restarts (and BSoDs now, when restar was disabled) go too often to do anything useful on that host.
____________
News about SETI opt app releases: https://twitter.com/Raistmer |
|
|
|
|
|
When that happened to me this summer, it was because I had run the CPU at 1.47 VDC for about 3 years at 4.2 GHz, and eventually fried the CPU. I orginally thought it was a Window update, then the hard drive, then the RAM, then the motherboard. I knew it wasn't the PSU, so all that was left was the CPU. I replaced the i7 980 with an old i7 950, and it worked just fine. I really do miss that unlocked multiplier though....
Steve
____________
Warning, addicted to SETI crunching!
Crunching as a member of GPU Users Group.
GPUUG Website |
|
|
|
|
|
Try upgrade your drivers.
Your are using OpenCL 1.2 AMD-APP (938.1) which in my case causes problems.
I'm using OpenCL 1.2 AMD-APP (1016.4) right now without problems. |
|
|
RaistmerVolunteer developer Volunteer tester
 Send message
Joined: 16 Jun 01 Posts: 2541 Credit: 25,569,662 RAC: 41,288

|
When that happened to me this summer, it was because I had run the CPU at 1.47 VDC for about 3 years at 4.2 GHz, and eventually fried the CPU. I orginally thought it was a Window update, then the hard drive, then the RAM, then the motherboard. I knew it wasn't the PSU, so all that was left was the CPU. I replaced the i7 980 with an old i7 950, and it worked just fine. I really do miss that unlocked multiplier though....
Steve
It's Q35M motherboard w/o any overclocking features... And fan replaced from stock to Titan long time ago. CPU temp quite low and moreover, host lack of CPU work last few days... and still hangs. So, probably something else in my case.
____________
News about SETI opt app releases: https://twitter.com/Raistmer |
|
|
RaistmerVolunteer developer Volunteer tester
 Send message
Joined: 16 Jun 01 Posts: 2541 Credit: 25,569,662 RAC: 41,288

|
Try upgrade your drivers.
Your are using OpenCL 1.2 AMD-APP (938.1) which in my case causes problems.
I'm using OpenCL 1.2 AMD-APP (1016.4) right now without problems.
I will leave it as is through night with mini-dumps enabled. Maybe I will catch faulty AMD driver to rise the case with their support once again...
Then will do update.
[Not sure update it should be or rollback though. Just discovered that app built with old APP SDK 2.5 faster than built with APP SDK 2.6 ... And will support much more hosts ...]
____________
News about SETI opt app releases: https://twitter.com/Raistmer |
|
|
Mike Volunteer tester
 Send message
Joined: 17 Feb 01 Posts: 19467 Credit: 21,083,566 RAC: 27,095

|
Try upgrade your drivers.
Your are using OpenCL 1.2 AMD-APP (938.1) which in my case causes problems.
I'm using OpenCL 1.2 AMD-APP (1016.4) right now without problems.
I will leave it as is through night with mini-dumps enabled. Maybe I will catch faulty AMD driver to rise the case with their support once again...
Then will do update.
[Not sure update it should be or rollback though. Just discovered that app built with old APP SDK 2.5 faster than built with APP SDK 2.6 ... And will support much more hosts ...]
And will cause much more trouble because all those buggy drivers.
____________
|
|
|
RaistmerVolunteer developer Volunteer tester
 Send message
Joined: 16 Jun 01 Posts: 2541 Credit: 25,569,662 RAC: 41,288

|
|
Looks like motherboard for that host is completely dead now.
After another BSoD it refused to reboot.
Starts and stops CPU and system fans in loop.
I removed already all but CPU itself.... Even replaced PSU with old one. Same behavior.
Now when PSU switched ON (by own hard switch) it sometimes starts to spinup system and CPU fans few times sometimes even not start at all. No reaction on "Power" button pressing (on the front side of case).
Before this motherboard performed short fans and HDD spin up when PSU power ON then turned off awaiting power button press.
____________
News about SETI opt app releases: https://twitter.com/Raistmer |
|
|
RaistmerVolunteer developer Volunteer tester
 Send message
Joined: 16 Jun 01 Posts: 2541 Credit: 25,569,662 RAC: 41,288

|
|
After motherboard replace same BSoD happened again.
But now it was catched!
WhoCrashed gave next info:
This was probably caused by the following module: atikmdag.sys (atikmdag+0x14CF5)
Bugcheck code: 0x18 (0xFFFFFFFF84F7ADB0, 0xFFFFFFFF878621D8, 0x1, 0x1)
Error: REFERENCE_BY_POINTER
file path: C:\Windows\system32\drivers\atikmdag.sys
product: ATI Radeon Family
company: Advanced Micro Devices, Inc.
description: ATI Radeon Kernel Mode Driver
So, it's AMD crappy driver again...
____________
News about SETI opt app releases: https://twitter.com/Raistmer |
|
|