**THREAD CLOSED, THREAD CLOSED**

Message boards : Number crunching : **THREAD CLOSED, THREAD CLOSED**
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · Next

AuthorMessage
W-K 666 Project Donor
Volunteer tester

Send message
Joined: 18 May 99
Posts: 19144
Credit: 40,757,560
RAC: 67
United Kingdom
Message 177475 - Posted: 13 Oct 2005, 10:41:29 UTC

My earlier optimism was misplaced, after uploaded all units earlier, I returned and found both now have units waiting in the transfer queue. But I am still downloading new units to crunch.

Andy
ID: 177475 · Report as offensive
2of12
Avatar

Send message
Joined: 12 Jul 04
Posts: 54
Credit: 5,716,632
RAC: 0
United States
Message 177476 - Posted: 13 Oct 2005, 10:47:31 UTC

Ok I'm downloading new units,slowley. but not uploading going on for 24hrs now.
ID: 177476 · Report as offensive
Profile Jason Safoutin
Volunteer tester
Avatar

Send message
Joined: 8 Sep 05
Posts: 1386
Credit: 200,389
RAC: 0
United States
Message 177477 - Posted: 13 Oct 2005, 10:50:12 UTC

That is very weird. My WU"s have all uploaded. I am analyzing 1 WU now with one on stady by/ready to run. So far everything is checking out. i will only know if the issue exists when the next WU uploads or tries to.

Again i am noticing that the Results waiting to be sent on the Server Status page is growing enormously. I have never seen it this high. And will be suprised if it goes above 600,000
"By faith we understand that the universe was formed at God's command, so that what is seen was not made out of what was visible". Hebrews 11.3

ID: 177477 · Report as offensive
Profile Jason Safoutin
Volunteer tester
Avatar

Send message
Joined: 8 Sep 05
Posts: 1386
Credit: 200,389
RAC: 0
United States
Message 177481 - Posted: 13 Oct 2005, 11:00:23 UTC
Last modified: 13 Oct 2005, 11:03:58 UTC

Juast a reminder. I DID e-mail Berkley about this issue at about 6 or 7pm eastern time Yesterday (Oct. 12, 2005). So I am sure they know about it. I see SOME people are still having the issue. At the moment, as far as I can see, I am not. But that may change. I would imagine that Berkley knows about the problem and when it will be fixed or what is wrong...I do not know :(
"By faith we understand that the universe was formed at God's command, so that what is seen was not made out of what was visible". Hebrews 11.3

ID: 177481 · Report as offensive
Ingleside
Volunteer developer

Send message
Joined: 4 Feb 03
Posts: 1546
Credit: 15,832,022
RAC: 13
Norway
Message 177485 - Posted: 13 Oct 2005, 11:26:59 UTC - in response to Message 177477.  

Again i am noticing that the Results waiting to be sent on the Server Status page is growing enormously. I have never seen it this high. And will be suprised if it goes above 600,000


Ready to send have been growing to 650k for weeks now, and dropping to 500k before starting to grow again.
ID: 177485 · Report as offensive
Profile The Gas Giant
Volunteer tester
Avatar

Send message
Joined: 22 Nov 01
Posts: 1904
Credit: 2,646,654
RAC: 0
Australia
Message 177486 - Posted: 13 Oct 2005, 11:27:08 UTC - in response to Message 177481.  

Juast a reminder. I DID e-mail Berkley about this issue at about 6 or 7pm eastern time Yesterday (Oct. 12, 2005). So I am sure they know about it. I see SOME people are still having the issue. At the moment, as far as I can see, I am not. But that may change. I would imagine that Berkley knows about the problem and when it will be fixed or what is wrong...I do not know :(

Still having the problem here. 4 trying to upload and 12 trying to download. A few uploads and 8 downloads have completed. No problems really as BOINC is doing its thing retrying and then backing off if it fails with the odd one getting through.

Live long and crunch.
ID: 177486 · Report as offensive
Daniel Schaalma
Volunteer tester
Avatar

Send message
Joined: 28 May 99
Posts: 297
Credit: 16,953,703
RAC: 0
United States
Message 177489 - Posted: 13 Oct 2005, 11:28:43 UTC

I am also having problems with both upload and download on all 17 of my machines. *Occasionally* I will get an upload or download to complete, but all my machines have a large backlog of results waiting to UL/DL. I've tried using a proxy server to do the transfers, but there is no difference. In all instances during uploads, the transfer stops after exactly 0.23KB of data is transfered. Then it times out, returns to 0.00KB transfered, and upon successful upload, what ever the size of the file, exactly 0.23KB is added to the file size, and the percentage goes over 100% by whatever percentage 0.23KB of the total file size is. For example, if the upload file size is 12.99KB, then it will show that 13.22KB of 12.99KB was transfered and 101.77% under Progress. If the upload file size is 9.70KB, then it will show that 9.93KB of 9.70KB was transfered and 102.37% under Progress. I think that there is definitely something wrong at the server end. For the last several weeks since the recovery from the last weeklong outage, we have had these Wednesday 3 hour outages for maintainance, and the recovery from these outages has been swift and hardly even noticeable. I have to concur with others that have suggested that either some "switch" has forgotten to get flipped, or possibly some sort of hardware malfunction such as another disk failure in the RAID array. At any rate, there is nothing we can do at the moment but wait until the staff at UCB arrive at work and go over the server logs and troubleshoot the issue. I wouldn't expect a resolution for at least another 6 hours or so. As of this post, it is going on 4:30 AM in Sunny California, and I would be surprised if they started work any earlier than 8:00 AM.

Regards, Daniel.
ID: 177489 · Report as offensive
Profile Thierry Van Driessche
Volunteer tester
Avatar

Send message
Joined: 20 Aug 02
Posts: 3083
Credit: 150,096
RAC: 0
Belgium
Message 177490 - Posted: 13 Oct 2005, 11:50:21 UTC
Last modified: 13 Oct 2005, 11:52:36 UTC

Failing up and downloads from Belgium too, no problems at all before the weekly outage.

Only a couple of last messages:

13/10/2005 12:00:24|SETI@home|Started download of 20oc03ab.15181.14096.47148.65
13/10/2005 12:00:47||Couldn't connect to hostname [setiboincdata.ssl.berkeley.edu]
13/10/2005 12:00:47|SETI@home|Temporarily failed download of 20oc03ab.15181.14096.47148.65: system I/O
13/10/2005 13:12:59|SETI@home|Started download of 20oc03ab.15181.14096.47148.65
13/10/2005 13:13:22||Couldn't connect to hostname [setiboincdata.ssl.berkeley.edu]
13/10/2005 13:13:22|SETI@home|Temporarily failed download of 20oc03ab.15181.14096.47148.65: system I/O
13/10/2005 13:25:53|SETI@home|Started upload of 30ap04aa.22175.14336.203404.19_0_0
13/10/2005 13:26:15||Couldn't connect to hostname [setiboincdata.ssl.berkeley.edu]
13/10/2005 13:26:15|SETI@home|Temporarily failed upload of 30ap04aa.22175.14336.203404.19_0_0: system I/O
13/10/2005 13:43:55|SETI@home|Started download of 20oc03ab.15181.14817.273560.202
13/10/2005 13:44:17||Couldn't connect to hostname [setiboincdata.ssl.berkeley.edu]
13/10/2005 13:44:17|SETI@home|Temporarily failed download of 20oc03ab.15181.14817.273560.202: system I/O

No problems with 3 other projects.
ID: 177490 · Report as offensive
Profile Darth Dogbytes™
Volunteer tester

Send message
Joined: 30 Jul 03
Posts: 7512
Credit: 2,021,148
RAC: 0
United States
Message 177493 - Posted: 13 Oct 2005, 11:59:09 UTC

None of my five hosts on Seti are able to up or download. Something must have gotten screwed up in the Wednesday maintance schedule.
Account frozen...
ID: 177493 · Report as offensive
Profile Thierry Van Driessche
Volunteer tester
Avatar

Send message
Joined: 20 Aug 02
Posts: 3083
Credit: 150,096
RAC: 0
Belgium
Message 177494 - Posted: 13 Oct 2005, 12:03:04 UTC
Last modified: 13 Oct 2005, 12:04:12 UTC

There is definitely something wrong as upload as well as download folders are not accessible.
ID: 177494 · Report as offensive
Astro
Volunteer tester
Avatar

Send message
Joined: 16 Apr 02
Posts: 8026
Credit: 600,015
RAC: 0
Message 177495 - Posted: 13 Oct 2005, 12:04:47 UTC

"circle the wagons",....I think the indians did it. The APACHE to be precise. See the clipping from Rom I found on the Classic BB:

>
> Those additional tasks are running on different machines. Actually the
> upload/download server only ever deals with the NFS file server and doesn't even
> know or care whether the other servers exist.
>
> All network I/O is handled through apache. Apache starts the
> file_upload_handler when a file is uploaded from a client.
ID: 177495 · Report as offensive
Profile Sharlee

Send message
Joined: 4 Jun 00
Posts: 8
Credit: 737,012
RAC: 0
United States
Message 177496 - Posted: 13 Oct 2005, 12:07:02 UTC

I am having the same sort of problems...have units in que to upload but fails. I am also having a problem with the boinc scheduler...it doesn't want to give me any Seti work units. It goes 4 days then when forced gives me 8-10 work units and o requests for 4 more days. I can finish them all in 10 to 12.5 hours. I have the scheduler set for 60 climate prediction/40 seti. I have to force it to download WU's by suspending the other project. Something definately isn't working right.
ID: 177496 · Report as offensive
Profile Wayne
Volunteer tester

Send message
Joined: 19 Oct 02
Posts: 13
Credit: 88,859
RAC: 0
Australia
Message 177498 - Posted: 13 Oct 2005, 12:12:21 UTC

I'm having trouble both uploading and downloading. My PC recently requested approx 16 WU's, of which only 6 have downloaded, 2 have 'Download Failed' and 8 are still on 'Downloading' - no doubt about to fail too. I also have 7 WU's waiting to upload, all of which have status 'Retry in xx:xx:xx'.

I have screens upon screens of...
13/10/2005 9:36:47 PM|SETI@home|Started download of 20oc03ab.15181.14769.598568.138
13/10/2005 9:36:47 PM|SETI@home|Unrecoverable error for result 20oc03ab.15181.14769.598568.138_0 (WU download error: couldn't get input files:<file_xfer_error> <file_name>20oc03ab.15181.14769.598568.138</file_name> <error_code>-200</error_code> <error_message></error_message></file_xfer_error>)
13/10/2005 9:36:48 PM|SETI@home|Deferring communication with project for 1 days, 23 hours, 23 minutes, and 27 seconds
13/10/2005 9:37:08 PM|SETI@home|Temporarily failed download of 20oc03ab.15181.14769.598568.138: -106
13/10/2005 9:37:08 PM|SETI@home|Backing off 1 minutes and 1 seconds on download of file 20oc03ab.15181.14769.598568.138
13/10/2005 9:37:42 PM|SETI@home|Started upload of 30ap04aa.22175.16386.617302.9_0_0
13/10/2005 9:37:48 PM|SETI@home|Started download of 20oc03ab.15181.14769.598568.147
13/10/2005 9:38:10 PM|SETI@home|Temporarily failed download of 20oc03ab.15181.14769.598568.147: -106
13/10/2005 9:38:10 PM|SETI@home|Backing off 1 minutes and 13 seconds on download of file 20oc03ab.15181.14769.598568.147
13/10/2005 9:38:10 PM|SETI@home|Started download of 20oc03ab.15181.14769.598568.138
13/10/2005 9:38:29 PM|SETI@home|Temporarily failed upload of 30ap04aa.22175.16386.617302.9_0_0: 500
13/10/2005 9:38:29 PM|SETI@home|Backing off 2 hours, 21 minutes, and 9 seconds on upload of file 30ap04aa.22175.16386.617302.9_0_0
ID: 177498 · Report as offensive
Ricky@SETI.USA
Avatar

Send message
Joined: 4 Sep 04
Posts: 453
Credit: 1,586,857
RAC: 0
United States
Message 177505 - Posted: 13 Oct 2005, 12:51:34 UTC - in response to Message 177498.  

I'm having trouble both uploading and downloading. My PC recently requested approx 16 WU's, of which only 6 have downloaded, 2 have 'Download Failed' and 8 are still on 'Downloading' - no doubt about to fail too. I also have 7 WU's waiting to upload, all of which have status 'Retry in xx:xx:xx'.

I have screens upon screens of...
13/10/2005 9:36:47 PM|SETI@home|Started download of 20oc03ab.15181.14769.598568.138
13/10/2005 9:36:47 PM|SETI@home|Unrecoverable error for result 20oc03ab.15181.14769.598568.138_0 (WU download error: couldn't get input files:<file_xfer_error> <file_name>20oc03ab.15181.14769.598568.138</file_name> <error_code>-200</error_code> <error_message></error_message></file_xfer_error>)
13/10/2005 9:36:48 PM|SETI@home|Deferring communication with project for 1 days, 23 hours, 23 minutes, and 27 seconds
13/10/2005 9:37:08 PM|SETI@home|Temporarily failed download of 20oc03ab.15181.14769.598568.138: -106
13/10/2005 9:37:08 PM|SETI@home|Backing off 1 minutes and 1 seconds on download of file 20oc03ab.15181.14769.598568.138
13/10/2005 9:37:42 PM|SETI@home|Started upload of 30ap04aa.22175.16386.617302.9_0_0
13/10/2005 9:37:48 PM|SETI@home|Started download of 20oc03ab.15181.14769.598568.147
13/10/2005 9:38:10 PM|SETI@home|Temporarily failed download of 20oc03ab.15181.14769.598568.147: -106
13/10/2005 9:38:10 PM|SETI@home|Backing off 1 minutes and 13 seconds on download of file 20oc03ab.15181.14769.598568.147
13/10/2005 9:38:10 PM|SETI@home|Started download of 20oc03ab.15181.14769.598568.138
13/10/2005 9:38:29 PM|SETI@home|Temporarily failed upload of 30ap04aa.22175.16386.617302.9_0_0: 500
13/10/2005 9:38:29 PM|SETI@home|Backing off 2 hours, 21 minutes, and 9 seconds on upload of file 30ap04aa.22175.16386.617302.9_0_0


I am getting the same thing here but not seen the errors that you show. Don't know how to copy and paste from log to here but my is also backing off in hours now where at 1 time it was backing off in sec or mins. EVERYTHING was fine before the weekly outage thingy.
Ricky


ID: 177505 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13769
Credit: 208,696,464
RAC: 304
Australia
Message 177506 - Posted: 13 Oct 2005, 12:55:29 UTC


Might as well add to the list.

Can contact the scheduler, scheduler responds, but no work available. 2nd time around it queued up a bunch or Work Units to download.

Unable to return *any* results, most new Work Units not downloading, those that do are only coming down at about 4kB/s (usually around 16kB/s).
Grant
Darwin NT
ID: 177506 · Report as offensive
Profile Karl Roos
Avatar

Send message
Joined: 19 Mar 01
Posts: 36
Credit: 206,258,788
RAC: 0
United States
Message 177509 - Posted: 13 Oct 2005, 12:58:33 UTC

Good thing they worked out the bugs when migrating from Classic.
ID: 177509 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13769
Credit: 208,696,464
RAC: 304
Australia
Message 177511 - Posted: 13 Oct 2005, 12:58:58 UTC


Looking at the graphs, apart from a small spike at around 00:00, since the outage the throughput has been slowly but steadily declining.
Grant
Darwin NT
ID: 177511 · Report as offensive
Profile [B@H] Ray
Volunteer tester
Avatar

Send message
Joined: 1 Sep 00
Posts: 485
Credit: 45,275
RAC: 0
United States
Message 177520 - Posted: 13 Oct 2005, 13:13:19 UTC

It is 6 AM PDST (Berkley time), and one system still can't upload 3 units, 2 went up from it. I am sure that they will go sometime today. Don't know what the SETI team did during the down time, but they should read this thread then find out what is going on with the system.
Ray


Pizza@Home Rays Place Rays place Forums
ID: 177520 · Report as offensive
Profile Steve @ SETI.USA
Avatar

Send message
Joined: 5 Sep 04
Posts: 189
Credit: 1,016,797
RAC: 0
United States
Message 177523 - Posted: 13 Oct 2005, 13:15:26 UTC

Well, at least the server status page shows all is well. I do wonder if/when BOINC will ever stop feeling like a beta test project. After a month of relatively smooth operation, I guess we were overdue for something like this to start happening again.

http://www.setiusa.net
ID: 177523 · Report as offensive
Profile Fuzzy Hollynoodles
Volunteer tester
Avatar

Send message
Joined: 3 Apr 99
Posts: 9659
Credit: 251,998
RAC: 0
Message 177524 - Posted: 13 Oct 2005, 13:22:14 UTC

Here we go again! :-D


"I'm trying to maintain a shred of dignity in this world." - Me

ID: 177524 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · Next

Message boards : Number crunching : **THREAD CLOSED, THREAD CLOSED**


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.