Things working ok again.

Message boards : Number crunching : Things working ok again.
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · Next

AuthorMessage
Ulrich Metzner
Volunteer tester
Avatar

Send message
Joined: 3 Jul 02
Posts: 1256
Credit: 13,565,513
RAC: 13
Germany
Message 81355 - Posted: 20 Feb 2005, 22:42:38 UTC

From the technical news page:

February 20, 2005 - 18:45 UTC
The data server NAS box may be OK at this point, although we have not subjected it to the full production load yet. We currently have the data server turned off so that the file deleter can at least partially clear a large backlog of workunit and result files that are scheduled for deletion. This backlog is a hold over problem from when our database server was on a slow machine. Validation and assimilation are both currently on.

Aloha, Uli

ID: 81355 · Report as offensive
Bob Chr. Laryea
Avatar

Send message
Joined: 1 May 02
Posts: 122
Credit: 83,877
RAC: 0
Denmark
Message 81356 - Posted: 20 Feb 2005, 22:47:07 UTC - in response to Message 81353.  

>
> Once again things are green, but still no response from the Scheduler &
> still unable to return results.
>

Same problem here. :-(
Regards
ID: 81356 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13961
Credit: 208,696,464
RAC: 304
Australia
Message 81357 - Posted: 20 Feb 2005, 22:49:49 UTC - in response to Message 81355.  

> From the technical news page:
>
> February 20, 2005 - 18:45 UTC
> The data server NAS box may be OK at this point, although we have not
> subjected it to the full production load yet. We currently have the data
> server turned off
so that the file deleter can at least partially clear a
> large backlog of workunit and result files that are scheduled for deletion.
> This backlog is a hold over problem from when our database server was on a
> slow machine. Validation and assimilation are both currently on.


That was at 18:45 UTC - February 20, 2005
As of 20 Feb 2005 22:40:08 UTC all is green other than the splitters.

Maybe they need another status box?
Grant
Darwin NT
ID: 81357 · Report as offensive
Ulrich Metzner
Volunteer tester
Avatar

Send message
Joined: 3 Jul 02
Posts: 1256
Credit: 13,565,513
RAC: 13
Germany
Message 81362 - Posted: 20 Feb 2005, 23:10:26 UTC - in response to Message 81357.  
Last modified: 20 Feb 2005, 23:10:51 UTC

> That was at 18:45 UTC - February 20, 2005
> As of 20 Feb 2005 22:40:08 UTC all is green other than the splitters.
>
> Maybe they need another status box?
>

I think the whole system is running normal (hence the green statii) but is disconnected from the net to play 'catch up' with the back log.

Aloha, Uli

ID: 81362 · Report as offensive
Hans Dorn
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 3 Apr 99
Posts: 2262
Credit: 26,448,570
RAC: 0
Germany
Message 81366 - Posted: 20 Feb 2005, 23:16:41 UTC - in response to Message 81362.  


> I think the whole system is running normal (hence the green statii) but is
> disconnected from the net to play 'catch up' with the back log.

It looks like the NAS box can't take the heat. They're trying to get rid of completed results to lower the load.

I'm wondering what kind of cr*p is sold as professional hardware nowadays.


Regards Hans

P.S:
Yep I'm getting kinda angry now.


ID: 81366 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13961
Credit: 208,696,464
RAC: 304
Australia
Message 81368 - Posted: 20 Feb 2005, 23:25:32 UTC - in response to Message 81362.  

> I think the whole system is running normal (hence the green statii) but is
> disconnected from the net to play 'catch up' with the back log.

If it's not online, then it should be Red IMHO.
Grant
Darwin NT
ID: 81368 · Report as offensive
NICK

Send message
Joined: 25 Jan 04
Posts: 7
Credit: 15,795
RAC: 0
United States
Message 81380 - Posted: 21 Feb 2005, 0:02:21 UTC

Nuttins workin on seti. Been lookin at same numbers for a week now. Everything says failed messages. No uploads. No downloads. Goin to bar and think this over. Couple of beers never hurt anyone but I think if I drink til this is fixed, I'll be a mess.
ID: 81380 · Report as offensive
Nuadormrac
Volunteer tester
Avatar

Send message
Joined: 7 Apr 00
Posts: 136
Credit: 1,703,351
RAC: 0
United States
Message 81388 - Posted: 21 Feb 2005, 0:23:00 UTC

The status page says all is OK, and the validation backlog has certainly been going down (which has given me some credit from the pending). But the schedulers haven't responded for me, much the same as everyone else is seeing.

I've pretty much suspeded SETI (a 4.20 and 4.22 feature) and resume it once a day to try uploading what I have. I've got enough work from other projects to keep my CPU busy. If some people are getting something uploaded occassionally, I might just let it keep connecting the last day or 2 before deadline to see if those results can get reported before deadline though.

ID: 81388 · Report as offensive
Profile Prognatus

Send message
Joined: 6 Jul 99
Posts: 1600
Credit: 391,546
RAC: 0
Norway
Message 81390 - Posted: 21 Feb 2005, 0:26:01 UTC


[As of 21 Feb 2005 0:20:09 UTC]
Ready to send 202,023
In progress 1,414,052
Waiting for validation 3,866
Waiting to transition 0

The "Ready to send"-number is not changing...
Does this mean they're not sending any WU's out from BOINC SETI?

And what does "In progress" means? This number is increasing...

ID: 81390 · Report as offensive
Profile Everette Dobbins

Send message
Joined: 13 Jan 00
Posts: 291
Credit: 22,594,655
RAC: 0
United States
Message 81395 - Posted: 21 Feb 2005, 0:33:13 UTC - in response to Message 81366.  

>
> > I think the whole system is running normal (hence the green statii) but
> is
> > disconnected from the net to play 'catch up' with the back log.
>
> It looks like the NAS box can't take the heat. They're trying to get rid of
> completed results to lower the load.
>
> I'm wondering what kind of cr*p is sold as professional hardware nowadays.
>
>
> Regards Hans
>
> P.S:
> Yep I'm getting kinda angry now.
>
>
>
I dont think that any of the hardware they are useing is bought it is all donated its free. Even with vendor support the hardware seems to keep coming up with new issues. I think seti@home 2 is the favored program. If the vendors are donating hardware to see how they will handle under extreme conditions they picked the right program. Hope they fix enough issues to get us going again. I am going to be scaleing back my operation to 5 computers give the new guys a chance to try this out. When they swap over seticlassic we will probably be having these upsets alot.
ID: 81395 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 81401 - Posted: 21 Feb 2005, 0:57:19 UTC - in response to Message 81357.  


> That was at 18:45 UTC - February 20, 2005
> As of 20 Feb 2005 22:40:08 UTC all is green other than the splitters.
>
> Maybe they need another status box?

We know that they need a bigger server closet before they can add another status box.

I suspect that when Classic ends they'll have more hardware and more room.
ID: 81401 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13961
Credit: 208,696,464
RAC: 304
Australia
Message 81552 - Posted: 21 Feb 2005, 12:29:19 UTC - in response to Message 81401.  

> We know that they need a bigger server closet before they can add another
> status box.

Nah, by a another status box i was refering to the red/green indicator boxes on the Server Status page.
Might be worth adding one specifically for the NAS- when it falls over it doesn't matter if the Schedulers are working or not, no data can move in either direction.
Grant
Darwin NT
ID: 81552 · Report as offensive
Aurora Borealis
Volunteer tester
Avatar

Send message
Joined: 14 Jan 01
Posts: 3075
Credit: 5,631,463
RAC: 0
Canada
Message 81594 - Posted: 21 Feb 2005, 15:35:30 UTC
Last modified: 21 Feb 2005, 16:07:46 UTC

I finally got to ULDL a few WU today for the first time in a week. I guess the servers may actually be up for real.

---EDIT---
Looks like I was premature in my exuberance.
I cleared an WU that had previously been ULed, but no new upload.
I DL a WU, but it ended with
2/21/05 10:47:56 AM|SETI@home|signature verification error for setiathome_4.09_windows_intelx86.pdb
2/21/05 10:47:56 AM|SETI@home|Checksum or signature error for setiathome_4.09_windows_intelx86.pdb
2/21/05 10:47:56 AM|SETI@home|Unrecoverable error for result 04ja05aa.28108.10001.229830.102_4 (app_version download error: couldn't get input files: setiathome_4.09_windows_intelx86.pdb -120 signature verification error)
2/21/05 10:47:57 AM|SETI@home|Deferring communication with project for 59 seconds

Three attempts have now failed.



Boinc V7.2.42
Win7 i5 3.33G 4GB, GTX470
ID: 81594 · Report as offensive
Profile Claudius
Avatar

Send message
Joined: 26 Mar 01
Posts: 21
Credit: 23,892,200
RAC: 13
Germany
Message 81605 - Posted: 21 Feb 2005, 15:57:19 UTC - in response to Message 81594.  

Nothing working here...i still can compute some undone WU but i cant upload finished WU'S


SETI@home - 2005-02-21 16:46:10 - Giving up on download of 04ja05aa.28108.9712.140892.151: Downloaded file had wrong size: expected 361971, got 0
SETI@home - 2005-02-21 16:46:10 - MD5 computation error for 04ja05aa.28108.9712.140892.151: -108
SETI@home - 2005-02-21 16:46:10 - Checksum or signature error for 04ja05aa.28108.9712.140892.151
SETI@home - 2005-02-21 16:46:10 - Unrecoverable error for result 04ja05aa.28108.9712.140892.151_6 (WU download error: couldn't get input files:
04ja05aa.28108.9712.140892.151: MD5 computation error
)
SETI@home - 2005-02-21 16:46:13 - Scheduler RPC to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi failed
SETI@home - 2005-02-21 16:46:13 - No schedulers responded
SETI@home - 2005-02-21 16:46:13 - Deferring communication with project for 1 minutes and 0 seconds
--- - 2005-02-21 16:47:14 - May run out of work in 1.00 days; requesting more
SETI@home - 2005-02-21 16:47:14 - Requesting 71382 seconds of work
SETI@home - 2005-02-21 16:47:14 - Sending request to scheduler: http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi
SETI@home - 2005-02-21 16:47:21 - Scheduler RPC to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi succeeded
SETI@home - 2005-02-21 16:47:21 - Message from server: Not sending work - last RPC too recent: 92 sec
SETI@home - 2005-02-21 16:47:21 - No work from project
SETI@home - 2005-02-21 16:47:21 - Deferring communication with project for 10 minutes and 0 seconds



What the heck ?

Is that normal ? Whats going on ?



http://www.martin-karch.de
ID: 81605 · Report as offensive
Arm

Send message
Joined: 12 Sep 03
Posts: 308
Credit: 15,584,777
RAC: 0
Message 81608 - Posted: 21 Feb 2005, 16:13:07 UTC

Maybe it is good idea to upgrade the client to 4.19 before trying to UL. 4.19 can handle better ULs and DLs.

ID: 81608 · Report as offensive
EdwardPF
Volunteer tester

Send message
Joined: 26 Jul 99
Posts: 389
Credit: 236,772,605
RAC: 374
United States
Message 81609 - Posted: 21 Feb 2005, 16:13:38 UTC

In the last 10 minutes I have gotten a full download to and upload from all 5 of my computers ... It looks like things are working ...

EdwardPF
ID: 81609 · Report as offensive
Aurora Borealis
Volunteer tester
Avatar

Send message
Joined: 14 Jan 01
Posts: 3075
Credit: 5,631,463
RAC: 0
Canada
Message 81611 - Posted: 21 Feb 2005, 16:22:29 UTC - in response to Message 81608.  

> Maybe it is good idea to upgrade the client to 4.19 before trying to UL. 4.19
> can handle better ULs and DLs.
>
>
I'm already using 4.22. No joy for now.



Boinc V7.2.42
Win7 i5 3.33G 4GB, GTX470
ID: 81611 · Report as offensive
Arm

Send message
Joined: 12 Sep 03
Posts: 308
Credit: 15,584,777
RAC: 0
Message 81616 - Posted: 21 Feb 2005, 16:30:00 UTC - in response to Message 81611.  

> > Maybe it is good idea to upgrade the client to 4.19 before trying to UL.
> 4.19
> > can handle better ULs and DLs.
> >
> >
> I'm already using 4.22. No joy for now.
>
>
>
Strange. For me 4.19 is fine on Windows XP. Ive UL-ed more than 40 WUs. Most of them had personal time-counter above 35 mins and UL-ed wo errors...

ID: 81616 · Report as offensive
Profile UBT - PaulT
Volunteer tester

Send message
Joined: 17 Dec 00
Posts: 25
Credit: 173,834
RAC: 0
United Kingdom
Message 81617 - Posted: 21 Feb 2005, 16:33:19 UTC - in response to Message 81616.  

> Strange. For me 4.19 is fine on Windows XP. Ive UL-ed more than 40 WUs. Most
> of them had personal time-counter above 35 mins and UL-ed wo errors...
>
>

Same here. Took about 3 hours but I have managed to upload about 35 WUs and download another 20
ID: 81617 · Report as offensive
Profile Claudius
Avatar

Send message
Joined: 26 Mar 01
Posts: 21
Credit: 23,892,200
RAC: 13
Germany
Message 81619 - Posted: 21 Feb 2005, 16:35:10 UTC - in response to Message 81616.  

> > > Maybe it is good idea to upgrade the client to 4.19 before trying to
> UL.
> > 4.19
> > > can handle better ULs and DLs.
> > >
> > >
> > I'm already using 4.22. No joy for now.
> >
> >
> >
> Strange. For me 4.19 is fine on Windows XP. Ive UL-ed more than 40 WUs. Most
> of them had personal time-counter above 35 mins and UL-ed wo errors...
>
>

i have different client version 4.16 / 4.19 / 4.66 on different computers and none of them can comunicate



http://www.martin-karch.de
ID: 81619 · Report as offensive
Previous · 1 · 2 · 3 · Next

Message boards : Number crunching : Things working ok again.


 
©2025 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.