SETI@home v7 v7.03 (opencl_ati_cat132) and Catalyst 13.4

Message boards : Number crunching : SETI@home v7 v7.03 (opencl_ati_cat132) and Catalyst 13.4
Message board moderation

To post messages, you must log in.

AuthorMessage
Stick Project Donor
Volunteer tester

Send message
Joined: 26 Feb 00
Posts: 100
Credit: 5,283,449
RAC: 5
United States
Message 1405024 - Posted: 19 Aug 2013, 13:36:00 UTC

I recently upgraded from Catalyst 13.1 to 13.4 on my laptop in an attempt to fix a crashing problem with a new app at Collatz. (It worked.) After the upgrade, I started getting SETI@home v7 v7.03 (opencl_ati_cat132) tasks here. Most of the time, these tasks have finished up OK and, as their wingman results come in, they validate. But I've had two results that did not. One crashed at around the 90% complete mark and the other finished up OK but was invalid. My laptop is pretty reliable and is not prone to problems like this. Therefore, I suspect a glitch in software. Any thoughts?
ID: 1405024 · Report as offensive
Profile BilBg
Volunteer tester
Avatar

Send message
Joined: 27 May 07
Posts: 3720
Credit: 9,385,827
RAC: 0
Bulgaria
Message 1405036 - Posted: 19 Aug 2013, 14:10:22 UTC - in response to Message 1405024.  


Do you monitor temperatures of CPU and GPU?
(the 'One' that crashed may be caused by high temperature)

The 'marked as invalid' may be server glitch as the other task of the WU is still 'validation inconclusive':
http://setiathome.berkeley.edu/workunit.php?wuid=1302356833


 


- ALF - "Find out what you don't do well ..... then don't do it!" :)
 
ID: 1405036 · Report as offensive
Stick Project Donor
Volunteer tester

Send message
Joined: 26 Feb 00
Posts: 100
Credit: 5,283,449
RAC: 5
United States
Message 1405043 - Posted: 19 Aug 2013, 15:15:32 UTC - in response to Message 1405036.  

Do you monitor temperatures of CPU and GPU?
(the 'One' that crashed may be caused by high temperature)

No, I don't monitor temps. But I don't overclock either. And, while it's certainly possible that this one was caused by a hardware hiccup, I'm just not seeing any other indications that point to hardware..

The 'marked as invalid' may be server glitch as the other task of the WU is still 'validation inconclusive':
http://setiathome.berkeley.edu/workunit.php?wuid=1302356833

Hmm! I saw that too, but didn't consider the server glitch possibility. Is that an issue here? If so, I've missed it.
ID: 1405043 · Report as offensive
Profile Donald L. Johnson
Avatar

Send message
Joined: 5 Aug 02
Posts: 8240
Credit: 14,654,533
RAC: 20
United States
Message 1405055 - Posted: 19 Aug 2013, 15:43:39 UTC - in response to Message 1405036.  
Last modified: 19 Aug 2013, 15:46:25 UTC


Do you monitor temperatures of CPU and GPU?
(the 'One' that crashed may be caused by high temperature)

The 'marked as invalid' may be server glitch as the other task of the WU is still 'validation inconclusive':
http://setiathome.berkeley.edu/workunit.php?wuid=1302356833

And now it has validated - 2 NVidia GPUs (1 cuda50, 1 Anonymous Platform) got -9 overflow (28,0,0,0,2). Yours must have been different, but the Task Details doesn't give the signal count for yours. Didn't look to see if the others were consistently throwing -9s. Might also be the result file got damaged in transit, that can cause an automatic invalid...

May be just "one of those things". Worry if it happens on a regular basis.
Donald
Infernal Optimist / Submariner, retired
ID: 1405055 · Report as offensive
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 34253
Credit: 79,922,639
RAC: 80
Germany
Message 1405076 - Posted: 19 Aug 2013, 16:09:17 UTC

Please delete all files ending with .bin .bin_V7 and .wisdom from your project folder.
The binaries should be created by the new drivers.



With each crime and every kindness we birth our future.
ID: 1405076 · Report as offensive
Profile BilBg
Volunteer tester
Avatar

Send message
Joined: 27 May 07
Posts: 3720
Credit: 9,385,827
RAC: 0
Bulgaria
Message 1405082 - Posted: 19 Aug 2013, 16:17:51 UTC - in response to Message 1405043.  


Monitor temperatures:
SIV - System Information Viewer (you need only siv.zip - no install, just unpack and run SIV32X.exe or SIV64X.exe , should work also on your Pentium computers - reads also motherboard sensors, not only CPU internal sensors)
http://rh-software.com/

Control temperatures:
TThrottle (set max temperature for CPU and GPU, BOINC processes will be Throttled if it is reached)
http://efmer.eu/boinc/


 


- ALF - "Find out what you don't do well ..... then don't do it!" :)
 
ID: 1405082 · Report as offensive
Profile cov_route
Avatar

Send message
Joined: 13 Sep 12
Posts: 342
Credit: 10,270,618
RAC: 0
Canada
Message 1405163 - Posted: 19 Aug 2013, 18:47:02 UTC - in response to Message 1405076.  

Please delete all files ending with .bin .bin_V7 and .wisdom from your project folder.
The binaries should be created by the new drivers.

+1. Very important and not documented AFAIK except in the forums.
ID: 1405163 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1405173 - Posted: 19 Aug 2013, 19:10:42 UTC - in response to Message 1405163.  

Please delete all files ending with .bin .bin_V7 and .wisdom from your project folder.
The binaries should be created by the new drivers.

+1. Very important and not documented AFAIK except in the forums.

In the ReadMe_MultiBeam_OpenCL_ATI.txt Readme:

Known issues:

- Catalyst 12.11 beta and 13.1 have broken OpenCL compiler that will result in driver restarts or invalid results.
But these drivers can be used still if the kernels binaries are precompiled under an older Catalyst driver.
That is, delete all *.bin* files from SETI project directory, revert to Catalyst 12.8 or 12.10, or upgrade to Catalyst 13.2 or later, process at least one task
(check that those *.bin* files were generated again) and (if needed) update to Catalyst 13.1.


Claggy
ID: 1405173 · Report as offensive
Profile cov_route
Avatar

Send message
Joined: 13 Sep 12
Posts: 342
Credit: 10,270,618
RAC: 0
Canada
Message 1405203 - Posted: 19 Aug 2013, 19:40:01 UTC - in response to Message 1405173.  

I'm supposed to read the readme files? Why hasn't anyone explained this to me before!!!
ID: 1405203 · Report as offensive
Stick Project Donor
Volunteer tester

Send message
Joined: 26 Feb 00
Posts: 100
Credit: 5,283,449
RAC: 5
United States
Message 1405208 - Posted: 19 Aug 2013, 19:44:33 UTC - in response to Message 1405163.  
Last modified: 19 Aug 2013, 19:48:02 UTC

Please delete all files ending with .bin .bin_V7 and .wisdom from your project folder.
The binaries should be created by the new drivers.

+1. Very important and not documented AFAIK except in the forums.

Mike, Claggy and cov_route,

Thank you for this suggestion! I deleted 47 such files (with varying dates of creation) from my project folder and 20+ have now been recreated. My gut feeling was that the issues I reported had something to do with upgrading Catalyst and at least this suggestion deals with that possibility. Hopefully, the problems will go away and we'll be able to say it worked - but, unfortunately, it's one of those things we won't be able to prove.

Stick
ID: 1405208 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 1405212 - Posted: 19 Aug 2013, 19:50:36 UTC - in response to Message 1405203.  

I'm supposed to read the readme files? Why hasn't anyone explained this to me before!!!

LoL, RTFM ;D

SETI apps news
We're not gonna fight them. We're gonna transcend them.
ID: 1405212 · Report as offensive

Message boards : Number crunching : SETI@home v7 v7.03 (opencl_ati_cat132) and Catalyst 13.4


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.