Help with a Stuck Upload?

Message boards : Number crunching : Help with a Stuck Upload?
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1895513 - Posted: 15 Oct 2017, 16:33:42 UTC

I noticed my AMD has a stuck upload from the end of August (shows how often I look at that computer :D) that is nearing the deadline.

I imagine that it is <state>4</state> that has it in "Uploading" state, but what state would it be to change it to ready to report so that it will retry? It won't go any other way with resends, etc.

I don't have this output file either (and no slot for it):
<file_name>15no08ae.30420.7025.3.30.103.vlar_1_r1913111574_0</file_name>

This is the client state info about it:
<client_state>
================
<file>
<name>15no08ae.30420.7025.3.30.103.vlar</name>
<nbytes>365992.000000</nbytes>
<max_nbytes>0.000000</max_nbytes>
<md5_cksum>ddae92dd3fd362014979d9c7ff9268f9</md5_cksum>
<status>1</status>
<download_url>http://boinc2.ssl.berkeley.edu/sah/download_fanout/1b4/15no08ae.30420.7025.3.30.103.vlar</download_url>
</file>
<file>
<name>15no08ae.30420.7025.3.30.103.vlar_1_r1913111574_0</name>
<nbytes>23389.000000</nbytes>
<max_nbytes>65536.000000</max_nbytes>
<md5_cksum>f4d4a5d890b0a6c2188de503fd42b578</md5_cksum>
<status>0</status>
<upload_url>http://setiboincdata.ssl.berkeley.edu/sah_cgi/file_upload_handler</upload_url>
</file>
===========
<workunit>
<name>15no08ae.30420.7025.3.30.103.vlar</name>
<app_name>setiathome_v8</app_name>
<version_num>800</version_num>
<rsc_fpops_est>183887307748840.000000</rsc_fpops_est>
<rsc_fpops_bound>3677746154976800.000000</rsc_fpops_bound>
<rsc_memory_bound>33554432.000000</rsc_memory_bound>
<rsc_disk_bound>33554432.000000</rsc_disk_bound>
<file_ref>
<file_name>15no08ae.30420.7025.3.30.103.vlar</file_name>
<open_name>work_unit.sah</open_name>
</file_ref>
</workunit>
==========
<result>
<name>15no08ae.30420.7025.3.30.103.vlar_1</name>
<final_cpu_time>31127.080000</final_cpu_time>
<final_elapsed_time>32952.148880</final_elapsed_time>
<exit_status>0</exit_status>
<state>4</state>
<platform>windows_intelx86</platform>
<version_num>800</version_num>
<final_peak_working_set_size>95137792</final_peak_working_set_size>
<final_peak_swap_size>92172288</final_peak_swap_size>
<final_peak_disk_usage>41904</final_peak_disk_usage>
<stderr_out>
<![CDATA[
<stderr_txt>
setiathome_v8 8.00 DevC++/MinGW/g++ 4.8.1libboinc: 7.7.0Work Unit Info:...............WU true angle range is : 0.008002Optimal function choices:-------------------------------------------------------- name timing error-------------------------------------------------------- v_BaseLineSmooth (no other) v_vGetPowerSpectrumUnrolled 0.000780 0.00000 sse3_ChirpData_ak8 0.015591 0.00000 v_vTranspose4x16ntw 0.005796 0.00000 AK SSE folding 0.003139 0.00000 Flopcounter: 44660803580862.398000Spike count: 1Autocorr count: 0Pulse count: 9Triplet count: 3Gaussian count: 003:39:31 (4516): called boinc_finish(0)
</stderr_txt>
]]>
</stderr_out>
<wu_name>15no08ae.30420.7025.3.30.103.vlar</wu_name>
<report_deadline>1508830131.000000</report_deadline>
<received_time>1504232949.372196</received_time>
<file_ref>
<file_name>15no08ae.30420.7025.3.30.103.vlar_1_r1913111574_0</file_name>
<open_name>result.sah</open_name>
</file_ref>
</result>

================================
ID: 1895513 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13720
Credit: 208,696,464
RAC: 304
Australia
Message 1895634 - Posted: 16 Oct 2017, 7:36:37 UTC

I take it rebooting hasn't helped?
Grant
Darwin NT
ID: 1895634 · Report as offensive
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1895635 - Posted: 16 Oct 2017, 8:10:12 UTC

Nahh, that does nothing to help.
ID: 1895635 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 1895643 - Posted: 16 Oct 2017, 10:19:43 UTC
Last modified: 16 Oct 2017, 10:21:23 UTC

If this task isn't visible in the Transfer tab, you can try the following.
Exit BOINC completely.
Edit client_state.xml with a simple text editor (Notepad, Notepad++) and go to that entry.

Go to the end of this part of the entry:
<stderr_out>
<![CDATA[
<stderr_txt>
setiathome_v8 8.00 DevC++/MinGW/g++ 4.8.1libboinc: 7.7.0Work Unit Info:...............WU true angle range is : 0.008002Optimal function choices:-------------------------------------------------------- name timing error-------------------------------------------------------- v_BaseLineSmooth (no other) v_vGetPowerSpectrumUnrolled 0.000780 0.00000 sse3_ChirpData_ak8 0.015591 0.00000 v_vTranspose4x16ntw 0.005796 0.00000 AK SSE folding 0.003139 0.00000 Flopcounter: 44660803580862.398000Spike count: 1Autocorr count: 0Pulse count: 9Triplet count: 3Gaussian count: 003:39:31 (4516): called boinc_finish(0)
</stderr_txt>
]]>
</stderr_out>

and immediately after this put <ready_to_report/>

So this part becomes
<stderr_out>
<![CDATA[
<stderr_txt>
setiathome_v8 8.00 DevC++/MinGW/g++ 4.8.1libboinc: 7.7.0Work Unit Info:...............WU true angle range is : 0.008002Optimal function choices:-------------------------------------------------------- name timing error-------------------------------------------------------- v_BaseLineSmooth (no other) v_vGetPowerSpectrumUnrolled 0.000780 0.00000 sse3_ChirpData_ak8 0.015591 0.00000 v_vTranspose4x16ntw 0.005796 0.00000 AK SSE folding 0.003139 0.00000 Flopcounter: 44660803580862.398000Spike count: 1Autocorr count: 0Pulse count: 9Triplet count: 3Gaussian count: 003:39:31 (4516): called boinc_finish(0)
</stderr_txt>
]]>
</stderr_out>
<ready_to_report/>

Then restart BOINC.

This will force BOINC to report this task, which will fail to validate because it will not have the result lines. But that at least moves it out of the client and out of your tasks list. The problem happens when BOINC uploads the task but doesn't get an ACK (from acknowledged, the thing is in) from the server. Apparently BOINC then doesn't retry the upload and the task is in this stuck state until it times out.

If this task is still visible in the Transfer tab, you can do a Retry now on it and see if it uploads.
If it then doesn't, make sure task_debug is checked in Options->Event log options, do a retry on the upload and post the corresponding part of the event log here.
But this only helps if the task is still visible in the Transfer tab, else you have to force report it.
ID: 1895643 · Report as offensive
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1895645 - Posted: 16 Oct 2017, 10:42:06 UTC - in response to Message 1895643.  

Thanks Jord, there was was some results in the client file as well.

Task sent and validated :) https://setiathome.berkeley.edu/workunit.php?wuid=2660401415

Much appeciated,
Brent
ID: 1895645 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 1895649 - Posted: 16 Oct 2017, 12:35:23 UTC - in response to Message 1895645.  

Huzzah! :-)
ID: 1895649 · Report as offensive

Message boards : Number crunching : Help with a Stuck Upload?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.