Upload problems.

Message boards : Number crunching : Upload problems.
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 3 · 4 · 5 · 6

AuthorMessage
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 552278 - Posted: 24 Apr 2007, 10:16:25 UTC - in response to Message 552258.  

Hi.
For last couple of days I have been having upload problems on only one of my computers. On this machine I have 4 stuck uploads (WUs dated Apr. 22nd), while downloads and scheduler contacts happen without a single glitch. Unfortunately I don't remember reading anywhere, how to "unstick" these uploads (IMO it has to be a problem with partial file upload).
TIA,

Could you post some lines from the message tab, please, to show what happens when it (re)tries to upload the results?

I tried to do some troubleshooting a while back on messages which looked like:

Error on file upload: Socket Read incomplete: asked for [a lot of bytes], received [fewer bytes]

- which may look like a partial upload, but in fact turned out to be a complete upload of a smaller file: for some reason, BOINC had completed computation of a WU, signalled it was ready to upload, and then re-started computation of the same WU - overwriting the results file in the process.

If that's happened to you, I don't think there's any recovery possible - just abort the uploads in the 'Transfers' tab and move on. But best to get a second opinion on the messages first.
ID: 552278 · Report as offensive
Vid Vidmar*
Volunteer tester
Avatar

Send message
Joined: 19 Aug 99
Posts: 136
Credit: 1,830,317
RAC: 0
Slovenia
Message 552307 - Posted: 24 Apr 2007, 11:58:57 UTC - in response to Message 552278.  


Could you post some lines from the message tab, please, to show what happens when it (re)tries to upload the results?

I tried to do some troubleshooting a while back on messages which looked like:

Error on file upload: Socket Read incomplete: asked for [a lot of bytes], received [fewer bytes]

- which may look like a partial upload, but in fact turned out to be a complete upload of a smaller file: for some reason, BOINC had completed computation of a WU, signalled it was ready to upload, and then re-started computation of the same WU - overwriting the results file in the process.

If that's happened to you, I don't think there's any recovery possible - just abort the uploads in the 'Transfers' tab and move on. But best to get a second opinion on the messages first.


Thanks for a swift response.
As soon as I get home I will paste those log messages, but as far as I can remember all they say is http error. Symptoms are these: if I retry upload, there is a message: "retrying upload of ...", then a pause of couple of minutes (during which upload status reaches 100% in fist few seconds, transfered bytes are even greater than filesize by some 100s of bytes - around 0.5k greater) and after that http error, followed by reference site check (checks ok.) and a message: "contact to reference site ... project may be down". Will also try using a proxy.

BOINC 5.4.11 on C2D e6600, 2GB ram, WINXP SP1, connected to ADSL 2048/384 through router.
ID: 552307 · Report as offensive
Vid Vidmar*
Volunteer tester
Avatar

Send message
Joined: 19 Aug 99
Posts: 136
Credit: 1,830,317
RAC: 0
Slovenia
Message 552419 - Posted: 24 Apr 2007, 21:14:45 UTC

Success.
I set BOINC to use a proxy and all uploads went through.

ID: 552419 · Report as offensive
W-K 666 Project Donor
Volunteer tester

Send message
Joined: 18 May 99
Posts: 19014
Credit: 40,757,560
RAC: 67
United Kingdom
Message 555180 - Posted: 28 Apr 2007, 3:15:08 UTC
Last modified: 28 Apr 2007, 3:26:02 UTC

Error on file upload :no command ??
Server rejected file, but it has Validation error?

Not logical to me. To be rejected I can accept, why I wouldn't have a clue. and to issued another copy to another host seems logical. But to go through validation, not logical.
If it was rejected and it can be communications error, but would not be there for validation.
It could have been partially received, missing the EOF marker, or of incorrect size. Go to validation and be designated invalid.

below is relevant msg's
27/04/2007 19:09:02|SETI@home|Starting task 27ja04ab.23969.28242.198574.3.50_0 using setiathome_enhanced version 517
27/04/2007 21:20:55|SETI@home|Computation for task 27ja04ab.23969.28242.198574.3.50_0 finished
27/04/2007 21:20:55|SETI@home Beta Test|Resuming task ap_15fe07aa_B0_P0_00586_20070423_09181.wu_2 using astropulse version 414
27/04/2007 21:20:57|SETI@home|[file_xfer] Started upload of file 27ja04ab.23969.28242.198574.3.50_0_0
27/04/2007 21:26:03||Project communication failed: attempting access to reference site
27/04/2007 21:26:03|SETI@home|[file_xfer] Temporarily failed upload of 27ja04ab.23969.28242.198574.3.50_0_0: http error
27/04/2007 21:26:03|SETI@home|Backing off 1 min 0 sec on upload of file 27ja04ab.23969.28242.198574.3.50_0_0
27/04/2007 21:26:04||Access to reference site succeeded - project servers may be temporarily down.
27/04/2007 21:27:03|SETI@home|[file_xfer] Started upload of file 27ja04ab.23969.28242.198574.3.50_0_0
27/04/2007 21:35:08||Project communication failed: attempting access to reference site
27/04/2007 21:35:08|SETI@home|[file_xfer] Temporarily failed upload of 27ja04ab.23969.28242.198574.3.50_0_0: http error
27/04/2007 21:35:08|SETI@home|Backing off 1 min 0 sec on upload of file 27ja04ab.23969.28242.198574.3.50_0_0
27/04/2007 21:35:09||Access to reference site succeeded - project servers may be temporarily down.
27/04/2007 21:36:08|SETI@home|[file_xfer] Started upload of file 27ja04ab.23969.28242.198574.3.50_0_0
27/04/2007 21:36:10|SETI@home|[error] Error on file upload: no command
27/04/2007 21:36:10|SETI@home|[file_xfer] Permanently failed upload of 27ja04ab.23969.28242.198574.3.50_0_0
27/04/2007 21:36:10|SETI@home|Giving up on upload of 27ja04ab.23969.28242.198574.3.50_0_0: server rejected file
27/04/2007 22:00:53|SETI@home|Sending scheduler request: Requested by user
27/04/2007 22:00:53|SETI@home|Reporting 2 tasks
27/04/2007 22:00:57|SETI@home|Scheduler RPC succeeded [server version 509]
27/04/2007 22:00:57|SETI@home|Deferring communication for 11 sec
27/04/2007 22:00:57|SETI@home|Reason: requested by project

The workunit is 126901614

Note green line that is at 65 hours 24% complete, if this problem happened to a unit like that, I cam see some very seriously upset people.

Andy
ID: 555180 · Report as offensive
Odysseus
Volunteer tester
Avatar

Send message
Joined: 26 Jul 99
Posts: 1808
Credit: 6,701,347
RAC: 6
Canada
Message 555534 - Posted: 28 Apr 2007, 18:46:42 UTC - in response to Message 555180.  
Last modified: 28 Apr 2007, 18:51:09 UTC

Note green line that is at 65 hours 24% complete, if this problem happened to a unit like that, I cam see some very seriously upset people.

“You pays your money and you takes your chances” with beta projects—and even with some that aren’t (but maybe should be) … I’ve wasted hundreds of CPU-hours’ worth of effort for one reason or another beyond my control. It’s disappointing when that happens, sure, but “very seriously upset[ting]”? How would those people react to loss of a job, breakup of a relationship, or a death in the family? ;)

ID: 555534 · Report as offensive
Previous · 1 . . . 3 · 4 · 5 · 6

Message boards : Number crunching : Upload problems.


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.