Error while computing

Message boards : Number crunching : Error while computing
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile Geek@Play
Volunteer tester
Avatar

Send message
Joined: 31 Jul 01
Posts: 2467
Credit: 86,146,931
RAC: 0
United States
Message 1485369 - Posted: 6 Mar 2014, 17:25:07 UTC
Last modified: 6 Mar 2014, 17:25:52 UTC

I have 3 of the new AP work units fail with this error. All 3 AP work units were issued in this latest batch that started going out after the AP shutdown.

All my previous errors I can account for but I have no idea on these 3 work units.

Nothing changed with my setup!
Boinc....Boinc....Boinc....Boinc....
ID: 1485369 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1485375 - Posted: 6 Mar 2014, 17:31:09 UTC

Don´t know if could be related but i get this error: http://setiathome.berkeley.edu/result.php?resultid=3422583824
if you look the the WU produces error aparently when crunched by a CUDA host and no error in normal SETI.http://setiathome.berkeley.edu/workunit.php?wuid=1442833289
ID: 1485375 · Report as offensive
Profile Geek@Play
Volunteer tester
Avatar

Send message
Joined: 31 Jul 01
Posts: 2467
Credit: 86,146,931
RAC: 0
United States
Message 1485380 - Posted: 6 Mar 2014, 17:52:58 UTC

now up to 5 errors................
Boinc....Boinc....Boinc....Boinc....
ID: 1485380 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1485396 - Posted: 6 Mar 2014, 18:07:20 UTC - in response to Message 1485380.  

So....when did you 'Upgrade' to 7.2.42?

Your errors are related to the AP & MB alleged BOINC cleanup error mentioned in two very long threads.
ID: 1485396 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1485399 - Posted: 6 Mar 2014, 18:13:28 UTC - in response to Message 1485369.  

I have 3 of the new AP work units fail with this error. All 3 AP work units were issued in this latest batch that started going out after the AP shutdown.

All my previous errors I can account for but I have no idea on these 3 work units.

Nothing changed with my setup!

Geek@Play, you can't just run any old app on the project, Since you're an Alpha tester, and have access to the Alpha testing threads, you're got to Bench test them first, we have no idea if they work or not, you're running r2163:

http://setiathome.berkeley.edu/result.php?resultid=3423670283
CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX
AstroPulse v6 Windows x86 rev 2163, V6 match, by Raistmer with support of Lunatics.kwsn.net team. SSE2

r2163 has been withdrawn because it doesn't work correctly:

[quote author=Mike link=topic=1625.msg55087#msg55087 date=1393955754]
Warning

r_2163 for GPU needs to be removed it has the termination issue.
I dont know when you uploaded this rev but it has never been tested.


Claggy
ID: 1485399 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1485405 - Posted: 6 Mar 2014, 18:18:46 UTC - in response to Message 1485375.  

Don´t know if could be related but i get this error: http://setiathome.berkeley.edu/result.php?resultid=3422583824
if you look the the WU produces error aparently when crunched by a CUDA host and no error in normal SETI.http://setiathome.berkeley.edu/workunit.php?wuid=1442833289

Error

Error on call (cudaMemcpy(TripletResults, dev_TripletResults, 2 * grid.x * block.x * grid.y * block.y * sizeof(*dev_TripletResults), cudaMemcpyDeviceToHost)), file c:/[Projects]/__Sources/sah_v7_opt/Xbranch/client/cuda/cudaAcc_pulsefind.cu, line 318: invalid argument

Has been reported to both Eric and Jason already - we spotted it first with WU 1441160462

Good to see a normal outcome with the standard app in live running - that confirms bench testing carried out offline. The characteristic of these tasks is that they run for a very short time (2 seconds, in the case I watched) and then go into 'temporary exit' and 'waiting to run': and they repeat the cycle again, and again, and again.

The WU names are interesting, too:

07jl13aa.22781.38725.438086664207.12.0, WU true angle range is : 279.183095
23my13ab.29476.393355.438086664207.12.0, WU true angle range is : 252.972605

Seems to be some sort of pattern there.....

(but they're a completely different pattern from Geek's - sorry for hijacking the thread, Geek)
ID: 1485405 · Report as offensive
Profile Geek@Play
Volunteer tester
Avatar

Send message
Joined: 31 Jul 01
Posts: 2467
Credit: 86,146,931
RAC: 0
United States
Message 1485431 - Posted: 6 Mar 2014, 19:17:25 UTC

Thanks Claggy,

Some where along the time of the AP outage I must have made some changes. I have reverted to a clean Lunatics 0.41 install. Many work units were re-issued back to me. Looks like I lost some though.

I will run 0.41 for a while and let things work themselves out.
Boinc....Boinc....Boinc....Boinc....
ID: 1485431 · Report as offensive

Message boards : Number crunching : Error while computing


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.