This one is gonna get stuck

Message boards : Number crunching : This one is gonna get stuck
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile betreger Project Donor
Avatar

Send message
Joined: 29 Jun 99
Posts: 11361
Credit: 29,581,041
RAC: 66
United States
Message 1601899 - Posted: 18 Nov 2014, 1:18:38 UTC

validation Pending
Task
click for details Computer Sent Time reported
or deadline
explain Status Run time
(sec) CPU time
(sec) Credit Application
3755585266 7208674 27 Sep 2014, 0:28:50 UTC 27 Sep 2014, 4:51:37 UTC Completed, waiting for validation 4,438.75 2,775.57 pending AstroPulse v6 v6.04 (opencl_nvidia_100)
3755585267 7376332 27 Sep 2014, 0:29:04 UTC 17 Nov 2014, 22:24:46 UTC Completed, waiting for validation 375,900.61 331,743.50 pending AstroPulse v6 v6.01
3795511907 6731176 22 Oct 2014, 4:02:18 UTC 16 Nov 2014, 4:02:18 UTC Timed out - no response 0.00 0.00 --- AstroPulse v6 v6.02
3837266083 5940036 16 Nov 2014, 4:02:56 UTC 11 Dec 2014, 4:02:56 UTC In progress --- --- --- AstroPulse v6
Anonymous platform (NVIDIA GPU)
Should I abort it or let it get stuck?
ID: 1601899 · Report as offensive
David S
Volunteer tester
Avatar

Send message
Joined: 4 Oct 99
Posts: 18352
Credit: 27,761,924
RAC: 12
United States
Message 1601906 - Posted: 18 Nov 2014, 1:33:07 UTC - in response to Message 1601899.  

validation Pending
Task
click for details Computer Sent Time reported
or deadline
explain Status Run time
(sec) CPU time
(sec) Credit Application
3755585266 7208674 27 Sep 2014, 0:28:50 UTC 27 Sep 2014, 4:51:37 UTC Completed, waiting for validation 4,438.75 2,775.57 pending AstroPulse v6 v6.04 (opencl_nvidia_100)
3755585267 7376332 27 Sep 2014, 0:29:04 UTC 17 Nov 2014, 22:24:46 UTC Completed, waiting for validation 375,900.61 331,743.50 pending AstroPulse v6 v6.01
3795511907 6731176 22 Oct 2014, 4:02:18 UTC 16 Nov 2014, 4:02:18 UTC Timed out - no response 0.00 0.00 --- AstroPulse v6 v6.02
3837266083 5940036 16 Nov 2014, 4:02:56 UTC 11 Dec 2014, 4:02:56 UTC In progress --- --- --- AstroPulse v6
Anonymous platform (NVIDIA GPU)
Should I abort it or let it get stuck?

Those first two will sit there waiting until the validators come back on line. I'm not sure why it went out the 3rd and 4th times, though.
David
Sitting on my butt while others boldly go,
Waiting for a message from a small furry creature from Alpha Centauri.

ID: 1601906 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1601912 - Posted: 18 Nov 2014, 1:44:50 UTC
Last modified: 18 Nov 2014, 1:52:43 UTC

It got sent out because _1 missed the deadline, so it got sent to _2. _2 did not report by the deadline, either, so it got sent to _3. The day after _3 got it, _1 finally reported.

They should be glad the validators are down, or they would have missed that one by a lot. (edit: they should be very glad the validators are down.. it took them 51 days to crunch and report that one.)

So, because the validators are down, hurry up and crunch it and you won't get stuck in limbo. (edit3: and if you do get stuck in limbo, then that sheds some new details on this very interesting thing about WUs getting stuck. I've been trying to figure out what part of the process is the problem to try to narrow down the search area in the code.. if this one gets limbo'ed with the validators down, then there's definitely a problem in the section where the validator checks to see if there are any results that are still expected to be returned.)

(edit2: Also.. it is wu 1602787424 for those interested.)
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1601912 · Report as offensive
Profile betreger Project Donor
Avatar

Send message
Joined: 29 Jun 99
Posts: 11361
Credit: 29,581,041
RAC: 66
United States
Message 1601937 - Posted: 18 Nov 2014, 2:25:25 UTC
Last modified: 18 Nov 2014, 2:33:42 UTC

Well I suspended apx 40 GPU tasks one at a time, what a pain and it is now crunching. It should be done in a couple of hours or so. I hope I remember to restart the paused ones so work continues.
I really don't like micro managing data.
ID: 1601937 · Report as offensive
Wedge009
Volunteer tester
Avatar

Send message
Joined: 3 Apr 99
Posts: 451
Credit: 431,396,357
RAC: 553
Australia
Message 1601955 - Posted: 18 Nov 2014, 3:08:04 UTC

You should be able to select and suspend/resume multiple tasks...
Soli Deo Gloria
ID: 1601955 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1601960 - Posted: 18 Nov 2014, 3:32:08 UTC - in response to Message 1601937.  

Well I suspended apx 40 GPU tasks one at a time, what a pain and it is now crunching. It should be done in a couple of hours or so. I hope I remember to restart the paused ones so work continues.
I really don't like micro managing data.


Here is a little .bat script I use to suspend my AP tasks. I have 4 .bat files setup to suspend or resume AP & MB separately. You just have to change bmc2 & tsk_stng to suit your needs.
@ECHO OFF
pushd %~dp0
set bcmd1=--task
set bcmd2=suspend
set max_task_rep=5
set tsk_stng=ap_*.*
set project_addy=http://setiathome.berkeley.edu/
set project_path=%~dp0\projects\setiathome.berkeley.edu\
set lcnt=0
:loopy
if NOT exist %project_path%\%tsk_stng% @ECHO No tasks found & goto end
@ECHO Starting AP_%lcnt% tasks set to %bcmd2% %time%
for /f %%a IN ('dir "%project_path%\%tsk_stng%" /b /OS') do boinccmd.exe %bcmd1% %project_addy% %%a_%lcnt% %bcmd2%
@ECHO Finished AP_%lcnt% tasks set to %bcmd2% %time%
@ECHO.
if %lcnt%==%max_task_rep% goto end
set /a lcnt=%lcnt%+1 & goto loopy
:end
if "%1"=="auto" goto ext
timeout 5
:ext
exit

SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1601960 · Report as offensive
Profile betreger Project Donor
Avatar

Send message
Joined: 29 Jun 99
Posts: 11361
Credit: 29,581,041
RAC: 66
United States
Message 1601988 - Posted: 18 Nov 2014, 4:39:01 UTC


Task
click for details Computer Sent Time reported
or deadline
explain Status Run time
(sec) CPU time
(sec) Credit Application
3496456876 7204507 19 Apr 2014, 6:03:43 UTC 20 Apr 2014, 8:32:35 UTC Completed and validated 5,611.84 5,548.11 683.24 AstroPulse v6 v6.04 (opencl_nvidia_100)
3496456877 6574317 19 Apr 2014, 6:03:47 UTC 19 Apr 2014, 16:03:06 UTC Error while computing 5,809.50 838.27 --- AstroPulse v6 v6.06 (opencl_ati_100)
3497336064 6778463 19 Apr 2014, 21:53:07 UTC 15 May 2014, 18:01:46 UTC Completed and validated 138,691.89 104,752.10 683.24 AstroPulse v6
Anonymous platform (CPU)
3538180933 4164382 14 May 2014, 21:53:12 UTC 14 May 2014, 21:58:24 UTC Error while computing 0.00 0.00 --- AstroPulse v6 v6.01
3538181166 5940036 14 May 2014, 21:58:27 UTC 23 May 2014, 20:59:28 UTC Completed, waiting for validation 15,970.20 6,525.54 pending AstroPulse v6
Anonymous platform (NVIDIA GPU)

The point is moot, now it is stuck with 15 mins to go for me.
ID: 1601988 · Report as offensive

Message boards : Number crunching : This one is gonna get stuck


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.