Panic Mode On (9) Server problems


log in

Advanced search

Message boards : Number crunching : Panic Mode On (9) Server problems

Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 11 · Next
Author Message
Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar
Send message
Joined: 1 Mar 99
Posts: 1389
Credit: 74,079
RAC: 0
United States
Message 814097 - Posted: 2 Oct 2008, 20:21:38 UTC - in response to Message 814085.

I think it's fascinating that 22mr08aa still has the same two active channels as it had three weeks ago on 11 September, and 23mr08aa one.


Yeah.. I think those channels contain data that's causing our SETI@home splitters to spin for a loooong time. Maybe an angle range thing. Understanding this stuff is Eric's department. In any case, Splitters don't run for more than a week at a time due to the weekly outage, and when they restart they start at the beginning of the block they were processing. So if they are stopped during the middle of a data block that takes more than a week to process, then they'll never finish. Maybe that's what's happening. I'll make a note to bring it up next time we're all in the same place at the same time (a week or two from now?).

- Matt
____________
-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude

Profile KWSN Ekky Ekky Ekky
Avatar
Send message
Joined: 25 May 99
Posts: 922
Credit: 12,052,439
RAC: 13,878
United Kingdom
Message 814114 - Posted: 2 Oct 2008, 21:15:49 UTC - in response to Message 814091.

02/10/2008 22:15:05||Internet access OK - project servers may be temporarily down.

Problems?
____________

Grant (SSSF)
Send message
Joined: 19 Aug 99
Posts: 5864
Credit: 60,564,497
RAC: 47,689
Australia
Message 814340 - Posted: 3 Oct 2008, 10:12:31 UTC - in response to Message 814267.

And what's up with these wacko Cricket Graphs.....???

Some mighty big swings in bandwidth goin' on........

Traffic blips probably due to Astropulse.
____________
Grant
Darwin NT.

zoom314Project donor
Avatar
Send message
Joined: 30 Nov 03
Posts: 46517
Credit: 36,860,219
RAC: 5,202
United States
Message 814446 - Posted: 3 Oct 2008, 15:40:49 UTC - in response to Message 814340.

And what's up with these wacko Cricket Graphs.....???

Some mighty big swings in bandwidth goin' on........

Traffic blips probably due to Astropulse.

And downloads seem to be stuck.
____________
My Facebook, War Commander, 2015

Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar
Send message
Joined: 1 Mar 99
Posts: 1389
Credit: 74,079
RAC: 0
United States
Message 814455 - Posted: 3 Oct 2008, 16:13:56 UTC

Yep.. one of the two download servers was having NFS issues again. I kicked it. It's sending out work again.

- Matt
____________
-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude

zoom314Project donor
Avatar
Send message
Joined: 30 Nov 03
Posts: 46517
Credit: 36,860,219
RAC: 5,202
United States
Message 814490 - Posted: 3 Oct 2008, 17:57:35 UTC - in response to Message 814478.

Yep.. one of the two download servers was having NFS issues again. I kicked it. It's sending out work again.

- Matt
Thanks for the kick Matt.......
Hope you have invested in some steel-toed safety shoes for work with all of the kicking you've been doing lately............

Yeah We don't want Matt to break anything.
____________
My Facebook, War Commander, 2015

zoom314Project donor
Avatar
Send message
Joined: 30 Nov 03
Posts: 46517
Credit: 36,860,219
RAC: 5,202
United States
Message 814599 - Posted: 4 Oct 2008, 1:25:23 UTC

Connect Failures are happening here on failing to upload.
____________
My Facebook, War Commander, 2015

Profile arkaynProject donor
Volunteer tester
Avatar
Send message
Joined: 14 May 99
Posts: 3689
Credit: 48,728,916
RAC: 6,344
United States
Message 814614 - Posted: 4 Oct 2008, 3:03:31 UTC

I think the validator is stuck as well as my pending is going up and up.
____________

Profile Misfit
Volunteer tester
Avatar
Send message
Joined: 21 Jun 01
Posts: 21790
Credit: 2,510,901
RAC: 0
United States
Message 815004 - Posted: 5 Oct 2008, 8:27:09 UTC
Last modified: 5 Oct 2008, 8:30:16 UTC

10/5/2008 1:25:10 AM|SETI@home|Sending scheduler request: Requested by user. Requesting 34906 seconds of work, reporting 0 completed tasks
10/5/2008 1:25:32 AM||Project communication failed: attempting access to reference site
10/5/2008 1:25:33 AM||Access to reference site succeeded - project servers may be temporarily down.
10/5/2008 1:25:35 AM|SETI@home|Scheduler request failed: Couldn't connect to server
10/5/2008 1:28:40 AM|SETI@home|Scheduler request failed: HTTP service unavailable
____________

Join BOINC Synergy!

Profile UliProject donor
Volunteer tester
Avatar
Send message
Joined: 6 Feb 00
Posts: 9842
Credit: 5,465,758
RAC: 206
Germany
Message 815009 - Posted: 5 Oct 2008, 8:39:59 UTC

No problems now, got plenty of WUs today.
____________
Pluto will always be a planet to me.
Order your 15th Seti Anniversary Shirt today. Just PM me for details.
Cash Donation Specialist

Seti Ambassador

Profile MikeProject donor
Volunteer tester
Avatar
Send message
Joined: 17 Feb 01
Posts: 24535
Credit: 33,859,930
RAC: 23,612
Germany
Message 815012 - Posted: 5 Oct 2008, 9:07:25 UTC

Same here.
135 units left.

____________

Richard HaselgroveProject donor
Volunteer tester
Send message
Joined: 4 Jul 99
Posts: 8633
Credit: 51,562,119
RAC: 48,684
United Kingdom
Message 815037 - Posted: 5 Oct 2008, 12:45:20 UTC

These three-hourly spikes on the Cricket Graphs are getting to be a bit of a nuisance - I had a ghostly visitation this morning:

05/10/2008 10:34:31|SETI@home|Sending scheduler request: To fetch work. Requesting 90178 seconds of work, reporting 0 completed tasks

1011715794 331160961 5 Oct 2008 9:34:44 UTC (and twelve others)

05/10/2008 10:39:40||Project communication failed: attempting access to reference site
05/10/2008 10:39:42||Internet access OK - project servers may be temporarily down.
05/10/2008 10:39:42|SETI@home|Scheduler request failed: Timeout was reached

Profile MikeProject donor
Volunteer tester
Avatar
Send message
Joined: 17 Feb 01
Posts: 24535
Credit: 33,859,930
RAC: 23,612
Germany
Message 815049 - Posted: 5 Oct 2008, 13:49:35 UTC

No problems here.


05.10.2008 15:08:31|SETI@home|[file_xfer] Started download of file 22au08ag.4447.7025.6.8.18
05.10.2008 15:08:37|SETI@home|[file_xfer] Finished download of file 22au08ag.4447.7025.6.8.18
05.10.2008 15:08:37|SETI@home|[file_xfer] Throughput 77739 bytes/sec
05.10.2008 15:08:40|SETI@home|Sending scheduler request: To fetch work
05.10.2008 15:08:40|SETI@home|Requesting 219 seconds of new work, and reporting 1 completed tasks
05.10.2008 15:08:45|SETI@home|Scheduler RPC succeeded [server version 603]
05.10.2008 15:08:45|SETI@home|Deferring communication for 11 sec
05.10.2008 15:08:45|SETI@home|Reason: requested by project
05.10.2008 15:08:48|SETI@home|[file_xfer] Started download of file 22au08ag.4447.7025.6.8.174
05.10.2008 15:08:53|SETI@home|[file_xfer] Finished download of file 22au08ag.4447.7025.6.8.174
05.10.2008 15:08:53|SETI@home|[file_xfer] Throughput 84286 bytes/sec

____________

zoom314Project donor
Avatar
Send message
Joined: 30 Nov 03
Posts: 46517
Credit: 36,860,219
RAC: 5,202
United States
Message 815262 - Posted: 6 Oct 2008, 2:24:22 UTC

Ok I'm getting a few problems uploading.

10/5/2008 7:19:10 PM||Project communication failed: attempting access to reference site
10/5/2008 7:19:10 PM|SETI@home|[file_xfer] Temporarily failed upload of 17au08aa.28073.5797.12.8.190_1_0: HTTP error
10/5/2008 7:19:10 PM|SETI@home|Backing off 1 min 9 sec on upload of file 17au08aa.28073.5797.12.8.190_1_0
10/5/2008 7:19:12 PM||Access to reference site succeeded - project servers may be temporarily down.
10/5/2008 7:19:54 PM||Time passed...reporting result(s) now.
10/5/2008 7:19:54 PM|SETI@home|Sending scheduler request: To report completed tasks
10/5/2008 7:19:54 PM|SETI@home|Reporting 1 tasks
10/5/2008 7:20:21 PM|SETI@home|[file_xfer] Started upload of file 17au08aa.28073.5797.12.8.190_1_0
10/5/2008 7:20:43 PM||Project communication failed: attempting access to reference site
10/5/2008 7:20:43 PM|SETI@home|[file_xfer] Temporarily failed upload of 17au08aa.28073.5797.12.8.190_1_0: connect() failed
10/5/2008 7:20:43 PM|SETI@home|Backing off 2 min 32 sec on upload of file 17au08aa.28073.5797.12.8.190_1_0
10/5/2008 7:20:44 PM||Access to reference site succeeded - project servers may be temporarily down.
10/5/2008 7:22:10 PM||Project communication failed: attempting access to reference site
____________
My Facebook, War Commander, 2015

Profile UliProject donor
Volunteer tester
Avatar
Send message
Joined: 6 Feb 00
Posts: 9842
Credit: 5,465,758
RAC: 206
Germany
Message 815264 - Posted: 6 Oct 2008, 2:30:43 UTC

Yah, that graph can make you seasick.
@ Joker, I just uploaded a minute or less ago and things seem to be fine.

I know, hiccups are no fun.
____________
Pluto will always be a planet to me.
Order your 15th Seti Anniversary Shirt today. Just PM me for details.
Cash Donation Specialist

Seti Ambassador

zoom314Project donor
Avatar
Send message
Joined: 30 Nov 03
Posts: 46517
Credit: 36,860,219
RAC: 5,202
United States
Message 815267 - Posted: 6 Oct 2008, 2:56:58 UTC - in response to Message 815264.
Last modified: 6 Oct 2008, 2:58:50 UTC

Yah, that graph can make you seasick.
@ Joker, I just uploaded a minute or less ago and things seem to be fine.

I know, hiccups are no fun.

It seems to have cleared up, Why I don't know as I had to stop 3 programs(FF3, BV1.42, Crunch3r's Boinc 6.10 and AK_v8 SSSE3x). Go figure.

Oh and My Dad got seasick, I don't and neither did My brother for that matter. Only 3 out 4 WU's were being worked on as FF3 was hogging all the cpu time.
____________
My Facebook, War Commander, 2015

DJStarfox
Send message
Joined: 23 May 01
Posts: 1045
Credit: 560,168
RAC: 442
United States
Message 815283 - Posted: 6 Oct 2008, 3:59:15 UTC - in response to Message 814097.

I think it's fascinating that 22mr08aa still has the same two active channels as it had three weeks ago on 11 September, and 23mr08aa one.


Yeah.. I think those channels contain data that's causing our SETI@home splitters to spin for a loooong time. Maybe an angle range thing. Understanding this stuff is Eric's department. In any case, Splitters don't run for more than a week at a time due to the weekly outage, and when they restart they start at the beginning of the block they were processing. So if they are stopped during the middle of a data block that takes more than a week to process, then they'll never finish. Maybe that's what's happening. I'll make a note to bring it up next time we're all in the same place at the same time (a week or two from now?).

- Matt


Matt:

You may have just hit something big. Is there a way to separate the splitting and the WU creation? I'm thinking the splitter could create all the files, write it's meta-data to a flat file (figure out later), and have the feeder or another process actually pickup the fresh WU and add them to the DB.

I guess there's a lot of cool stuff you could do if you had a few extra terabytes lying around. :g:

Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 11 · Next

Message boards : Number crunching : Panic Mode On (9) Server problems

Copyright © 2014 University of California