27 Computation Errors?

Message boards : Number crunching : 27 Computation Errors?
Message board moderation

To post messages, you must log in.

AuthorMessage
Steven Gaber

Send message
Joined: 19 Jan 13
Posts: 111
Credit: 2,834,186
RAC: 11
United States
Message 1948957 - Posted: 11 Aug 2018, 7:27:35 UTC

My computer just downloaded a bunch of new WUs that were immediately given the status "Computation error (0.0205 CPUS +1AMD/ATI GPU".

What is that all about?

The BOINC version installed is 1.12.1.

These are all SETI@Home v8.8.22.

In addition, there are 29 WUs of v8.8.05 ready to start and one running.

Machine is also concurrently running Asteroids.

Any help?

Thanks.
Steve Gaber
Oldsmar, FL
ID: 1948957 · Report as offensive
Steven Gaber

Send message
Joined: 19 Jan 13
Posts: 111
Credit: 2,834,186
RAC: 11
United States
Message 1948960 - Posted: 11 Aug 2018, 7:50:10 UTC - in response to Message 1948957.  

Update: My account now shows 93 computational errors. -- SAG

My computer just downloaded a bunch of new WUs that were immediately given the status "Computation error (0.0205 CPUS +1AMD/ATI GPU".

What is that all about?

The BOINC version installed is 1.12.1.

These are all SETI@Home v8.8.22.

In addition, there are 29 WUs of v8.8.05 ready to start and one running.

Machine is also concurrently running Asteroids.

Any help?

Thanks.
Steve Gaber
Oldsmar, FL
ID: 1948960 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34754
Credit: 261,360,520
RAC: 489
Australia
Message 1948961 - Posted: 11 Aug 2018, 7:50:37 UTC

Have you restarted the computer?

Cheers.
ID: 1948961 · Report as offensive
Steven Gaber

Send message
Joined: 19 Jan 13
Posts: 111
Credit: 2,834,186
RAC: 11
United States
Message 1949023 - Posted: 11 Aug 2018, 17:43:08 UTC - in response to Message 1948961.  

Have you restarted the computer?

Cheers.


Wiggo--
Thank for the reply.

Yes, I did. And the computer immediately downloaded 29 new v8.8.22 w WUs, one of which is now running. There are also 32 v8.8.95 WUs , two of which are running. I suspended Asteroids during this time, just for clarity.

However, my account now shows 212 errors while computing.

??????

Steve Gaber
Oldsmar, FL
ID: 1949023 · Report as offensive
Profile Bill G Special Project $75 donor
Avatar

Send message
Joined: 1 Jun 01
Posts: 1282
Credit: 187,688,550
RAC: 182
United States
Message 1949026 - Posted: 11 Aug 2018, 18:33:15 UTC - in response to Message 1949023.  
Last modified: 11 Aug 2018, 18:34:11 UTC

Is it possible that you are having a heat issue? That is something that will cause errors like this.
Hoping you are not OverClocking???

SETI@home classic workunits 4,019
SETI@home classic CPU time 34,348 hours
ID: 1949026 · Report as offensive
Profile petri33
Volunteer tester

Send message
Joined: 6 Jun 02
Posts: 1668
Credit: 623,086,772
RAC: 156
Finland
Message 1949084 - Posted: 11 Aug 2018, 22:36:18 UTC - in response to Message 1949026.  

Is it possible that you are having a heat issue? That is something that will cause errors like this.
Hoping you are not OverClocking???


Heat issue. My guess too.
+1
To overcome Heisenbergs:
"You can't always get what you want / but if you try sometimes you just might find / you get what you need." -- Rolling Stones
ID: 1949084 · Report as offensive
Steven Gaber

Send message
Joined: 19 Jan 13
Posts: 111
Credit: 2,834,186
RAC: 11
United States
Message 1949102 - Posted: 12 Aug 2018, 0:06:04 UTC - in response to Message 1949026.  

Is it possible that you are having a heat issue? That is something that will cause errors like this.
Hoping you are not OverClocking???


No, not overclocking. That might be beyond my technical competence.

I will open up the computer and vacuum out any dust or cat hairs that might block fans and cooling surfaces to relieve possible heat retention.

Report of results to follow.

Thanks.

Steve Gaber
Oldsmar, FL
ID: 1949102 · Report as offensive
Steven Gaber

Send message
Joined: 19 Jan 13
Posts: 111
Credit: 2,834,186
RAC: 11
United States
Message 1949158 - Posted: 12 Aug 2018, 10:28:37 UTC - in response to Message 1949102.  

Is it possible that you are having a heat issue? That is something that will cause errors like this.
Hoping you are not OverClocking???


No, not overclocking. That might be beyond my technical competence.

I will open up the computer and vacuum out any dust or cat hairs that might block fans and cooling surfaces to relieve possible heat retention.

Report of results to follow.

Thanks.

Steve Gaber
Oldsmar, FL


And if it were an issue related to overheating, why would BOINC allow my computer to download 212 WUs over a limited time frame then immediately write them off as errors in computation?

Steve Gaber
Oldsmar, FL
ID: 1949158 · Report as offensive
JLDun
Volunteer tester
Avatar

Send message
Joined: 21 Apr 06
Posts: 573
Credit: 196,101
RAC: 0
United States
Message 1949277 - Posted: 13 Aug 2018, 1:31:17 UTC - in response to Message 1949158.  
Last modified: 13 Aug 2018, 1:32:16 UTC

why would BOINC allow my computer to download 212 WUs over a limited time frame

Short answer: because it can.

then immediately write them off as errors in computation?

Well, WHATEVER is going on is hitting at the beginning of the WU's, which may include "bad math" caused by "excess heat energy" flipping random bits....
ID: 1949277 · Report as offensive
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5124
Credit: 276,046,078
RAC: 462
Message 1949285 - Posted: 13 Aug 2018, 3:18:51 UTC - in response to Message 1949102.  

Is it possible that you are having a heat issue? That is something that will cause errors like this.
Hoping you are not OverClocking???


No, not overclocking. That might be beyond my technical competence.

I will open up the computer and vacuum out any dust or cat hairs that might block fans and cooling surfaces to relieve possible heat retention.

Report of results to follow.

Thanks.

Steve Gaber
Oldsmar, FL


Sounds like you are not running TThortle from the Seti Project -> Addons page? This will both track and slow your system down to take care of heat above the range you want.

Tom
A proud member of the OFA (Old Farts Association).
ID: 1949285 · Report as offensive
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5124
Credit: 276,046,078
RAC: 462
Message 1949287 - Posted: 13 Aug 2018, 3:37:04 UTC

You appear to have an AMD A6-6400K APU with Radeon(tm) HD Graphics. That has 2 cores.

I would try two things.
1) Download the latest drivers and do a "clean" install of all your Radeon drivers.

If that doesn't clear it up.
2) Go into Seti "local config" in the Boinc Manager and set the "number of cpus" you use to 75% or even 50%.

I am presuming you have installed the gpu command line parameters in the hidden \ProgramData\Boinc\projects\setiathome etc directory?

Something like: -sbs 192 -spike_fft_thresh 2048 -tune 1 2 1 16 -period_iterations_num 20

If your video driver is crashing try a higher # in the period_iterations parm.

And I am assuming you have TThortle installed. Seti website -> Project -> Addons -> TThortle

If your system is running too hot, it "might" be an issue.

I am also assuming your are not trying to OverClock your video card?

HTH,
Tom
A proud member of the OFA (Old Farts Association).
ID: 1949287 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1949290 - Posted: 13 Aug 2018, 3:44:57 UTC - in response to Message 1949158.  

Is it possible that you are having a heat issue? That is something that will cause errors like this.
Hoping you are not OverClocking???


No, not overclocking. That might be beyond my technical competence.

I will open up the computer and vacuum out any dust or cat hairs that might block fans and cooling surfaces to relieve possible heat retention.

Report of results to follow.

Thanks.

Steve Gaber
Oldsmar, FL


And if it were an issue related to overheating, why would BOINC allow my computer to download 212 WUs over a limited time frame then immediately write them off as errors in computation?

Steve Gaber
Oldsmar, FL

Your Nvidia ComputeCache is corrupted. When this happens you will clear out your cache of gpu work in about 20 seconds as it tries to run each task for 1.04 seconds, errors it and moves on to the next in your cache until all work is gone. Then you get to wait 24 hours for more gpu work because BOINC penalizes you for returning errored work.

Stop BOINC

Delete all the folders inside the Compute Cache at C:\Users\yourname\AppData\Roaming\NVIDIA\ComputeCache folder.

This is the folder that contains the compiled compute kernels for both CUDA and OpenCL. The drivers will recreate a new set of compiled kernels upon restart of BOINC gpu crunching.

Fix the overheating problem or back off the overclocks.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1949290 · Report as offensive
Steven Gaber

Send message
Joined: 19 Jan 13
Posts: 111
Credit: 2,834,186
RAC: 11
United States
Message 1949294 - Posted: 13 Aug 2018, 3:56:44 UTC - in response to Message 1949287.  

You appear to have an AMD A6-6400K APU with Radeon(tm) HD Graphics. That has 2 cores.

I would try two things.
1) Download the latest drivers and do a "clean" install of all your Radeon drivers.

If that doesn't clear it up.
2) Go into Seti "local config" in the Boinc Manager and set the "number of cpus" you use to 75% or even 50%.

I am presuming you have installed the gpu command line parameters in the hidden \ProgramData\Boinc\projects\setiathome etc directory?

Something like: -sbs 192 -spike_fft_thresh 2048 -tune 1 2 1 16 -period_iterations_num 20

If your video driver is crashing try a higher # in the period_iterations parm.

And I am assuming you have TThortle installed. Seti website -> Project -> Addons -> TThortle

If your system is running too hot, it "might" be an issue.

I am also assuming your are not trying to OverClock your video card?

HTH,
Tom


Thanks for all your responses.

Nope, not overclocking.

I have not tried TThortle or recently updated the Radeon drivers. I do believe I installed the GPU commands about 6 months ago.

It must be an intermittent thing, because there have been a lot of v8.8.22 WUs successfully completed and reported along with bunches of v8.8.05 ones.

I assembled this machine from parts purchased from Tiger Direct. It has been running S@H and Asteroids 24/7/365 for almost four years.
I also use this computer for email, Facebook, word processing and, occasionally astrophotography. Every now and then it shuts itself off, which I attribute to overheating.

My account now shows only 150 errors while computing.

I will endeavor to pursue this further.
Thanks again.

Steve Gaber
Oldsmar, FL
ID: 1949294 · Report as offensive
Profile Bernie Vine
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 26 May 99
Posts: 9954
Credit: 103,452,613
RAC: 328
United Kingdom
Message 1949312 - Posted: 13 Aug 2018, 6:57:08 UTC

TThortle from the Seti Project page


It's TThrottle, as in Throttle:

"a device controlling the flow of fuel or power to an engine"
ID: 1949312 · Report as offensive
Steven Gaber

Send message
Joined: 19 Jan 13
Posts: 111
Credit: 2,834,186
RAC: 11
United States
Message 1952237 - Posted: 26 Aug 2018, 19:54:26 UTC - in response to Message 1949294.  

After amassing a total of 212 errors while computing, the number continued to dwindle. After two weeks, there are now five errors, four dating to August 11 and o ne to August 9. All are v.8 8.22;

So I never really learned the origin or conclusion of this episode and didn't really carry out any of the suggested remedies. The computer and/or BOINC seemed to fix themselves. (I hope.)

Steve Gabe
Oldsmar, FL

You appear to have an AMD A6-6400K APU with Radeon(tm) HD Graphics. That has 2 cores.

I would try two things.
1) Download the latest drivers and do a "clean" install of all your Radeon drivers.

If that doesn't clear it up.
2) Go into Seti "local config" in the Boinc Manager and set the "number of cpus" you use to 75% or even 50%.

I am presuming you have installed the gpu command line parameters in the hidden \ProgramData\Boinc\projects\setiathome etc directory?

Something like: -sbs 192 -spike_fft_thresh 2048 -tune 1 2 1 16 -period_iterations_num 20

If your video driver is crashing try a higher # in the period_iterations parm.

And I am assuming you have TThortle installed. Seti website -> Project -> Addons -> TThortle

If your system is running too hot, it "might" be an issue.

I am also assuming your are not trying to OverClock your video card?

HTH,
Tom


Thanks for all your responses.

Nope, not overclocking.

I have not tried TThortle or recently updated the Radeon drivers. I do believe I installed the GPU commands about 6 months ago.

It must be an intermittent thing, because there have been a lot of v8.8.22 WUs successfully completed and reported along with bunches of v8.8.05 ones.

I assembled this machine from parts purchased from Tiger Direct. It has been running S@H and Asteroids 24/7/365 for almost four years.
I also use this computer for email, Facebook, word processing and, occasionally astrophotography. Every now and then it shuts itself off, which I attribute to overheating.

My account now shows only 150 errors while computing.

I will endeavor to pursue this further.
Thanks again.

Steve Gaber
Oldsmar, FL
ID: 1952237 · Report as offensive

Message boards : Number crunching : 27 Computation Errors?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.