Help with a Stuck Upload?

Message boards : Number crunching : Help with a Stuck Upload?
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile Brent Norman
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 1820
Credit: 105,440,696
RAC: 450,132
Canada
Message 1895513 - Posted: 15 Oct 2017, 16:33:42 UTC

I noticed my AMD has a stuck upload from the end of August (shows how often I look at that computer :D) that is nearing the deadline.

I imagine that it is <state>4</state> that has it in "Uploading" state, but what state would it be to change it to ready to report so that it will retry? It won't go any other way with resends, etc.

I don't have this output file either (and no slot for it):
<file_name>15no08ae.30420.7025.3.30.103.vlar_1_r1913111574_0</file_name>

This is the client state info about it:
<client_state>
================
<file>
<name>15no08ae.30420.7025.3.30.103.vlar</name>
<nbytes>365992.000000</nbytes>
<max_nbytes>0.000000</max_nbytes>
<md5_cksum>ddae92dd3fd362014979d9c7ff9268f9</md5_cksum>
<status>1</status>
<download_url>http://boinc2.ssl.berkeley.edu/sah/download_fanout/1b4/15no08ae.30420.7025.3.30.103.vlar</download_url>
</file>
<file>
<name>15no08ae.30420.7025.3.30.103.vlar_1_r1913111574_0</name>
<nbytes>23389.000000</nbytes>
<max_nbytes>65536.000000</max_nbytes>
<md5_cksum>f4d4a5d890b0a6c2188de503fd42b578</md5_cksum>
<status>0</status>
<upload_url>http://setiboincdata.ssl.berkeley.edu/sah_cgi/file_upload_handler</upload_url>
</file>
===========
<workunit>
<name>15no08ae.30420.7025.3.30.103.vlar</name>
<app_name>setiathome_v8</app_name>
<version_num>800</version_num>
<rsc_fpops_est>183887307748840.000000</rsc_fpops_est>
<rsc_fpops_bound>3677746154976800.000000</rsc_fpops_bound>
<rsc_memory_bound>33554432.000000</rsc_memory_bound>
<rsc_disk_bound>33554432.000000</rsc_disk_bound>
<file_ref>
<file_name>15no08ae.30420.7025.3.30.103.vlar</file_name>
<open_name>work_unit.sah</open_name>
</file_ref>
</workunit>
==========
<result>
<name>15no08ae.30420.7025.3.30.103.vlar_1</name>
<final_cpu_time>31127.080000</final_cpu_time>
<final_elapsed_time>32952.148880</final_elapsed_time>
<exit_status>0</exit_status>
<state>4</state>
<platform>windows_intelx86</platform>
<version_num>800</version_num>
<final_peak_working_set_size>95137792</final_peak_working_set_size>
<final_peak_swap_size>92172288</final_peak_swap_size>
<final_peak_disk_usage>41904</final_peak_disk_usage>
<stderr_out>
<![CDATA[
<stderr_txt>
setiathome_v8 8.00 DevC++/MinGW/g++ 4.8.1libboinc: 7.7.0Work Unit Info:...............WU true angle range is : 0.008002Optimal function choices:-------------------------------------------------------- name timing error-------------------------------------------------------- v_BaseLineSmooth (no other) v_vGetPowerSpectrumUnrolled 0.000780 0.00000 sse3_ChirpData_ak8 0.015591 0.00000 v_vTranspose4x16ntw 0.005796 0.00000 AK SSE folding 0.003139 0.00000 Flopcounter: 44660803580862.398000Spike count: 1Autocorr count: 0Pulse count: 9Triplet count: 3Gaussian count: 003:39:31 (4516): called boinc_finish(0)
</stderr_txt>
]]>
</stderr_out>
<wu_name>15no08ae.30420.7025.3.30.103.vlar</wu_name>
<report_deadline>1508830131.000000</report_deadline>
<received_time>1504232949.372196</received_time>
<file_ref>
<file_name>15no08ae.30420.7025.3.30.103.vlar_1_r1913111574_0</file_name>
<open_name>result.sah</open_name>
</file_ref>
</result>

================================
ID: 1895513 · Report as offensive     Reply Quote
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 8876
Credit: 114,923,647
RAC: 69,614
Australia
Message 1895634 - Posted: 16 Oct 2017, 7:36:37 UTC

I take it rebooting hasn't helped?
Grant
Darwin NT
ID: 1895634 · Report as offensive     Reply Quote
Profile Brent Norman
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 1820
Credit: 105,440,696
RAC: 450,132
Canada
Message 1895635 - Posted: 16 Oct 2017, 8:10:12 UTC

Nahh, that does nothing to help.
ID: 1895635 · Report as offensive     Reply Quote
Profile Ageless
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 14240
Credit: 3,525,914
RAC: 767
Netherlands
Message 1895643 - Posted: 16 Oct 2017, 10:19:43 UTC
Last modified: 16 Oct 2017, 10:21:23 UTC

If this task isn't visible in the Transfer tab, you can try the following.
Exit BOINC completely.
Edit client_state.xml with a simple text editor (Notepad, Notepad++) and go to that entry.

Go to the end of this part of the entry:
<stderr_out>
<![CDATA[
<stderr_txt>
setiathome_v8 8.00 DevC++/MinGW/g++ 4.8.1libboinc: 7.7.0Work Unit Info:...............WU true angle range is : 0.008002Optimal function choices:-------------------------------------------------------- name timing error-------------------------------------------------------- v_BaseLineSmooth (no other) v_vGetPowerSpectrumUnrolled 0.000780 0.00000 sse3_ChirpData_ak8 0.015591 0.00000 v_vTranspose4x16ntw 0.005796 0.00000 AK SSE folding 0.003139 0.00000 Flopcounter: 44660803580862.398000Spike count: 1Autocorr count: 0Pulse count: 9Triplet count: 3Gaussian count: 003:39:31 (4516): called boinc_finish(0)
</stderr_txt>
]]>
</stderr_out>

and immediately after this put <ready_to_report/>

So this part becomes
<stderr_out>
<![CDATA[
<stderr_txt>
setiathome_v8 8.00 DevC++/MinGW/g++ 4.8.1libboinc: 7.7.0Work Unit Info:...............WU true angle range is : 0.008002Optimal function choices:-------------------------------------------------------- name timing error-------------------------------------------------------- v_BaseLineSmooth (no other) v_vGetPowerSpectrumUnrolled 0.000780 0.00000 sse3_ChirpData_ak8 0.015591 0.00000 v_vTranspose4x16ntw 0.005796 0.00000 AK SSE folding 0.003139 0.00000 Flopcounter: 44660803580862.398000Spike count: 1Autocorr count: 0Pulse count: 9Triplet count: 3Gaussian count: 003:39:31 (4516): called boinc_finish(0)
</stderr_txt>
]]>
</stderr_out>
<ready_to_report/>

Then restart BOINC.

This will force BOINC to report this task, which will fail to validate because it will not have the result lines. But that at least moves it out of the client and out of your tasks list. The problem happens when BOINC uploads the task but doesn't get an ACK (from acknowledged, the thing is in) from the server. Apparently BOINC then doesn't retry the upload and the task is in this stuck state until it times out.

If this task is still visible in the Transfer tab, you can do a Retry now on it and see if it uploads.
If it then doesn't, make sure task_debug is checked in Options->Event log options, do a retry on the upload and post the corresponding part of the event log here.
But this only helps if the task is still visible in the Transfer tab, else you have to force report it.
Jord

Ancient Astronaut Theorists suggest that in many ways, you can be considered an alien conspiracy!
ID: 1895643 · Report as offensive     Reply Quote
Profile Brent Norman
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 1820
Credit: 105,440,696
RAC: 450,132
Canada
Message 1895645 - Posted: 16 Oct 2017, 10:42:06 UTC - in response to Message 1895643.  

Thanks Jord, there was was some results in the client file as well.

Task sent and validated :) https://setiathome.berkeley.edu/workunit.php?wuid=2660401415

Much appeciated,
Brent
ID: 1895645 · Report as offensive     Reply Quote
Profile Ageless
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 14240
Credit: 3,525,914
RAC: 767
Netherlands
Message 1895649 - Posted: 16 Oct 2017, 12:35:23 UTC - in response to Message 1895645.  

Huzzah! :-)
ID: 1895649 · Report as offensive     Reply Quote

Message boards : Number crunching : Help with a Stuck Upload?


 
©2017 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.