Panic Mode On (71) Server problems?

Message boards : Number crunching : Panic Mode On (71) Server problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · Next

AuthorMessage
Profile Dimly Lit Lightbulb 😀
Volunteer tester
Avatar

Send message
Joined: 30 Aug 08
Posts: 15399
Credit: 7,423,413
RAC: 1
United Kingdom
Message 1205955 - Posted: 14 Mar 2012, 23:15:27 UTC - in response to Message 1205933.  
Last modified: 14 Mar 2012, 23:17:09 UTC

Quite true, turnaround time is what counts and the BOINC database keeps that statistic for each app version on a host. The available BOINC feature uses that plus consecutive valid and having a quota at least equal to the basic setting of 100 to judge. The documentation is slightly out of date, but gives a reasonable overall view. Reading the code in sched_send.cpp and a few other source files gives accurate current info on how it works.

Having that feature turned on would of course be an additional load on the Scheduler processes, but with improved servers that may become feasible. What actual settings would be appropriate would take some thought, the general advice that about 25% of hosts ought to be considered reliable seems sensible. If set too tight, reissue tasks might occupy positions in the Feeder queue for too long.

Could we kill two birds with one stone and have two feeder queues in different instances of the scheduler? Ideally one with a big queue for the relable hosts that would include resends and a second with a smaller queue for the others.

LHC@Home 1.0 uses reliable hosts for resends (quick turnaround and high validated tasks) but I don't know how that's going for them, I've not read the forums for a while. But I do have a problem with two schedulers. Inevitably that would be seen as one for the high crunchers and one for the low crunchers. Besides, as has been said the servers are doing enough already, and that's quite a strain on them already despite the 100MB limit. The tasks will be done sooner or later, whether it's a day or months is irrelevant.
And to stay on topic, anybody have any idea on how many astropulses are flying around? I wouldn't mind a few :)

[edit]that's 5.05's I'm yakking about :)[/edit]

Member of the People Encouraging Niceness In Society club.

ID: 1205955 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1205957 - Posted: 14 Mar 2012, 23:19:56 UTC - in response to Message 1205955.  

And to stay on topic, anybody have any idea on how many astropulses are flying around? I wouldn't mind a few :)

[edit]that's 5.05's I'm yakking about :)[/edit]

Yeah, just look at the server status page. The AP numbers have not been updated to include v6 yet. :D
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1205957 · Report as offensive
Profile Dimly Lit Lightbulb 😀
Volunteer tester
Avatar

Send message
Joined: 30 Aug 08
Posts: 15399
Credit: 7,423,413
RAC: 1
United Kingdom
Message 1205978 - Posted: 15 Mar 2012, 0:46:38 UTC - in response to Message 1205957.  

And to stay on topic, anybody have any idea on how many astropulses are flying around? I wouldn't mind a few :)

[edit]that's 5.05's I'm yakking about :)[/edit]

Yeah, just look at the server status page. The AP numbers have not been updated to include v6 yet. :D

Aye, not updated and from the crickets it looks like astropulses V6 are being split like 5.05's. Hold on to yer socks :)

Member of the People Encouraging Niceness In Society club.

ID: 1205978 · Report as offensive
LadyL
Volunteer tester
Avatar

Send message
Joined: 14 Sep 11
Posts: 1679
Credit: 5,230,097
RAC: 0
Message 1206131 - Posted: 15 Mar 2012, 14:02:49 UTC

Making steady progress here.

Looking at cricket, anybody feels like jinxing it?
I'm not the Pope. I don't speak Ex Cathedra!
ID: 1206131 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1206140 - Posted: 15 Mar 2012, 14:23:24 UTC - in response to Message 1206131.  

Looking at cricket, anybody feels like jinxing it?

I'll do it. QUACK! Still about 3 hours away from that, but it's too late now. :p

Also of note, I started my stats spreadsheet back up since we're over on v6 now. aside from stock not reporting how much was blanked, it will provide a good baseline to see how much better the optimization is compared to stock.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1206140 · Report as offensive
LadyL
Volunteer tester
Avatar

Send message
Joined: 14 Sep 11
Posts: 1679
Credit: 5,230,097
RAC: 0
Message 1206151 - Posted: 15 Mar 2012, 14:42:20 UTC - in response to Message 1206140.  

Looking at cricket, anybody feels like jinxing it?

I'll do it. QUACK! Still about 3 hours away from that, but it's too late now. :p

Also of note, I started my stats spreadsheet back up since we're over on v6 now. aside from stock not reporting how much was blanked, it will provide a good baseline to see how much better the optimization is compared to stock.


CPU? on benches between 2x-3x with another 20-25% extra on AVX.
r548/r555 is also faster than r409, but I can't find the values in a hurry.
I'm not the Pope. I don't speak Ex Cathedra!
ID: 1206151 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51469
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1206161 - Posted: 15 Mar 2012, 15:08:49 UTC - in response to Message 1206131.  

Making steady progress here.

Looking at cricket, anybody feels like jinxing it?

No jinxes please....
All is going very well here in MB land.
Even with the new AP rollout, all boxen are up to their limits and keeping topped off.

Kitties chase away the duckens.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1206161 · Report as offensive
David S
Volunteer tester
Avatar

Send message
Joined: 4 Oct 99
Posts: 18352
Credit: 27,761,924
RAC: 12
United States
Message 1206195 - Posted: 15 Mar 2012, 16:47:48 UTC - in response to Message 1206161.  

Kitties chase away the duckens.

Could the kitties please chase away the geesen that crappen all over all the sidewalken?

David
Sitting on my butt while others boldly go,
Waiting for a message from a small furry creature from Alpha Centauri.

ID: 1206195 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51469
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1206214 - Posted: 15 Mar 2012, 17:52:38 UTC - in response to Message 1206195.  

Kitties chase away the duckens.

Could the kitties please chase away the geesen that crappen all over all the sidewalken?

Dunno if it's the duckens or the geesen, but something crappen on the servers.
Although the cricket is still going full bore, and I can't see anything amiss on server status, suddenly none of my rigs have been issued a single MB task for about 45 minutes.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1206214 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51469
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1206220 - Posted: 15 Mar 2012, 18:12:49 UTC

And then it came back, but you can see the chink in the Cricket graph.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1206220 · Report as offensive
Blake Bonkofsky
Volunteer tester
Avatar

Send message
Joined: 29 Dec 99
Posts: 617
Credit: 46,383,149
RAC: 0
United States
Message 1206221 - Posted: 15 Mar 2012, 18:13:41 UTC - in response to Message 1206220.  

I was gonna say, I just got 25 MB's in one hit... Maybe they had to reboot something?
ID: 1206221 · Report as offensive
Profile arkayn
Volunteer tester
Avatar

Send message
Joined: 14 May 99
Posts: 4438
Credit: 55,006,323
RAC: 0
United States
Message 1206222 - Posted: 15 Mar 2012, 18:27:54 UTC

All it took was one mention of that yellow thing.

ID: 1206222 · Report as offensive
Profile Belthazor
Volunteer tester
Avatar

Send message
Joined: 6 Apr 00
Posts: 219
Credit: 10,373,795
RAC: 13
Russia
Message 1206228 - Posted: 15 Mar 2012, 18:40:36 UTC

10 MB-splitters and 12 AP-splitters are working now together. Are they testing new servers?
ID: 1206228 · Report as offensive
David S
Volunteer tester
Avatar

Send message
Joined: 4 Oct 99
Posts: 18352
Credit: 27,761,924
RAC: 12
United States
Message 1206239 - Posted: 15 Mar 2012, 19:25:52 UTC - in response to Message 1206237.  

I'll let my computers run here and on Beta, and whatever they get they will crunch. I expect my RAC to crash, but as long as I have a high enough RAC to post on the boards, it's fine with me.

Your (if you really are Sten-Arne; this new attitude is cause for doubt) RAC is higher than mine, so you should be okay.

David
Sitting on my butt while others boldly go,
Waiting for a message from a small furry creature from Alpha Centauri.

ID: 1206239 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1206240 - Posted: 15 Mar 2012, 19:29:18 UTC - in response to Message 1206228.  

10 MB-splitters and 12 AP-splitters are working now together. Are they testing new servers?

Where do you get that information? Server status page shows AP1-6 (6), and MB 6,7,10,11,12 (5).
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1206240 · Report as offensive
Profile Belthazor
Volunteer tester
Avatar

Send message
Joined: 6 Apr 00
Posts: 219
Credit: 10,373,795
RAC: 13
Russia
Message 1206248 - Posted: 15 Mar 2012, 20:04:16 UTC - in response to Message 1206240.  

10 MB-splitters and 12 AP-splitters are working now together. Are they testing new servers?

Where do you get that information? Server status page shows AP1-6 (6), and MB 6,7,10,11,12 (5).


Obviously those new splitters are not counting at the status page as well as APv6 tasks, eh?
Look at the "channels in progress": 10 MB & 12 AP, and so at the graphical "Splitter status"

ID: 1206248 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65795
Credit: 55,293,173
RAC: 49
United States
Message 1206253 - Posted: 15 Mar 2012, 20:15:40 UTC - in response to Message 1206248.  
Last modified: 15 Mar 2012, 20:19:38 UTC

10 MB-splitters and 12 AP-splitters are working now together. Are they testing new servers?

Where do you get that information? Server status page shows AP1-6 (6), and MB 6,7,10,11,12 (5).


Obviously those new splitters are not counting at the status page as well as APv6 tasks, eh?
Look at the "channels in progress": 10 MB & 12 AP, and so at the graphical "Splitter status"

Of course if one is on the anonymous platform, the server now says one does not have a v6 AP app in the app_info.xml file.

SETI@home 3/15/2012 12:27:40 PM Message from server: Your app_info.xml file doesn't have a usable version of AstroPulse v6.

The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 1206253 · Report as offensive
Profile shizaru
Volunteer tester
Avatar

Send message
Joined: 14 Jun 04
Posts: 1130
Credit: 1,967,904
RAC: 0
Greece
Message 1206255 - Posted: 15 Mar 2012, 20:18:20 UTC

Wow!

Synergy finally got to see some action and now there's no stopping her!
ID: 1206255 · Report as offensive
Profile Belthazor
Volunteer tester
Avatar

Send message
Joined: 6 Apr 00
Posts: 219
Credit: 10,373,795
RAC: 13
Russia
Message 1206264 - Posted: 15 Mar 2012, 20:31:25 UTC - in response to Message 1206253.  


Of course if one is on the anonymous platform, the server now says one does not have a v6 AP app in the app_info.xml file.


No, no, no! I mean that AP 6.01 tasks which was splitted and awaiting for crunchers or even sent to them, doesn't counting now at the server status page, there are only old AP 5.05 tasks.
And new splitters doesn't counting there too.

ID: 1206264 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1206299 - Posted: 15 Mar 2012, 22:22:11 UTC
Last modified: 15 Mar 2012, 22:22:56 UTC

I think the listed splitters ARE the new v6 ones. No more 505's will ever be split. I have seen in the past where "channels in progress" are more than the number of splitters. It is actually somewhat common.

I don't believe it means there are more splitters than what are listed, it may just be that if one channel is taking too long or has a lot of I/O wait, that particular instance may move on to another channel, so two channels appear as active even though only one process is working on them.


of course, Occam's Razor would suggest that there are indeed a lot of splitters now but the list hasn't been updated to show them.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1206299 · Report as offensive
Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · Next

Message boards : Number crunching : Panic Mode On (71) Server problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.