Alexei (Apr 01 2009)


log in

Advanced search

Message boards : Technical News : Alexei (Apr 01 2009)

Author Message
Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar
Send message
Joined: 1 Mar 99
Posts: 1391
Credit: 74,079
RAC: 10
United States
Message 881432 - Posted: 1 Apr 2009, 22:01:27 UTC

Let's see.. we're *still* waiting for the RAID resync's to finish and likewise the pulse table rebuild. Another day or two? Meanwhile, I cleared off enough space on the workunit machine such that we can keep producing/sending out work. We still can't assimilate very much until the pulse table rebuild is over, but at least the people can do science and get credit. I'm worried about mysql bloat with the large result table (over 2 million waiting for assimilation), but we've been here many times before and lived.

Lost in the chaos of outage recovery yesterday was a bunch of "make science status page" processes piling up on top of each other, causing extra stress on the science database, and eventually making the splitters jam up. Oops. I killed all those this morning and that particular dam broke. Now that we're catching up on satisfying workunit demand I think we'll be maxed out traffic-wise for a while, which isn't the worst of problems (that means work *is* flowing as fast as we can send it).

Lots of code walkthroughs with Jeff today regarding the NTPCker. It's getting to be a mature piece of code. Scoring mechanisms are almost all in place (though they still may need major tuning once we sift through enough real data). We're still concerned about our ability to actually keep it running "near time," i.e. will the database be able to handle the load? We shall see. A lot of database improvements to help this have unfortunately been blocked on the last couple of weeks' worth of problems with thumper.

Happy April Fool's Day! Don't believe anything anybody says! Actually that's good advice regardless of the day of the year.

____________
-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude

Profile Fred J. Verster
Volunteer tester
Avatar
Send message
Joined: 21 Apr 04
Posts: 3252
Credit: 31,903,520
RAC: 119
Netherlands
Message 881436 - Posted: 1 Apr 2009, 22:18:54 UTC - in response to Message 881432.
Last modified: 1 Apr 2009, 22:19:23 UTC

Thanks Matt, is the 'extra CUDA-LOAD', troublesome or does it work well, if you look at the amount work done.
Anyway, thanks for your UPdate, my keyboard is sluggish sometime, (think) it's the CUDA use.
Only, when I want to 'burn' an 'image' on CD/DVD, a will stop BOINC completeley.
There seems to be little CUDA MB WU's but also 'normal (6.03)MB WU's .
____________

Profile Borgholio
Avatar
Send message
Joined: 2 Aug 99
Posts: 653
Credit: 12,235,904
RAC: 2,026
United States
Message 881437 - Posted: 1 Apr 2009, 22:22:30 UTC - in response to Message 881432.



Lost in the chaos of outage recovery yesterday was a bunch of "make science status page" processes piling up on top of each other, causing extra stress on the science database, and eventually making the splitters jam up. Oops. I killed all those this morning and that particular dam broke. Now that we're catching up on satisfying workunit demand I think we'll be maxed out traffic-wise for a while, which isn't the worst of problems (that means work *is* flowing as fast as we can send it).



So the status page was causing the slow workunit creation last night? Will the page remain frozen until things settle down?


____________


You will be assimilated...bunghole!

Profile KWSN Ekky Ekky Ekky
Avatar
Send message
Joined: 25 May 99
Posts: 928
Credit: 12,562,890
RAC: 10,809
United Kingdom
Message 881439 - Posted: 1 Apr 2009, 22:25:16 UTC

Judging by the Cricket graphs you guys have certainly unplugged something. Let's hope it holds!
Congrats to you all for all the hard work.
____________

Profile Gundolf Jahn
Send message
Joined: 19 Sep 00
Posts: 3184
Credit: 361,286
RAC: 37
Germany
Message 881443 - Posted: 1 Apr 2009, 22:36:13 UTC - in response to Message 881437.

So the status page was causing the slow workunit creation last night? Will the page remain frozen until things settle down?

The server status page is frozen too (since 11:20 UTC).

Profile Dr. C.E.T.I.
Avatar
Send message
Joined: 29 Feb 00
Posts: 15993
Credit: 690,597
RAC: 0
United States
Message 881448 - Posted: 1 Apr 2009, 22:47:11 UTC


. . . got that one AP that was 'sitting' there forever ;)

Thank You for the Updates Matt - Accolades to ALl @ Berkeley . . .


____________
BOINC Wiki . . .

Science Status Page . . .

Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar
Send message
Joined: 1 Mar 99
Posts: 1391
Credit: 74,079
RAC: 10
United States
Message 881458 - Posted: 1 Apr 2009, 23:17:08 UTC

..yeah there was some additional mounting/network clogging gobbledygook that was blocking the regular server status page for a while. Separate problem, and I'm fixing that now. The science status page will be continue to be stuck on hold for the near term...

- Matt
____________
-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude

Profile Borgholio
Avatar
Send message
Joined: 2 Aug 99
Posts: 653
Credit: 12,235,904
RAC: 2,026
United States
Message 881468 - Posted: 1 Apr 2009, 23:53:22 UTC - in response to Message 881458.

Seems like work is flowing nicely now. The workunit creation rate is 4x faster than it's been in the last week. Thanks Matt!
____________


You will be assimilated...bunghole!

Profile RottenMutt
Avatar
Send message
Joined: 15 Mar 01
Posts: 999
Credit: 209,282,819
RAC: 50,707
United States
Message 881543 - Posted: 2 Apr 2009, 4:04:24 UTC - in response to Message 881468.

i would say 10x
woot, thanks for unjamming the splitters, good catch.
____________

Profile DarkRyder
Volunteer tester
Send message
Joined: 10 Sep 01
Posts: 5
Credit: 12,795,257
RAC: 959
United States
Message 881697 - Posted: 2 Apr 2009, 15:49:15 UTC

Hows all the problems going? How soon should we see work for cuda?
____________

Zydor
Send message
Joined: 4 Oct 03
Posts: 172
Credit: 491,111
RAC: 0
United Kingdom
Message 881708 - Posted: 2 Apr 2009, 16:34:20 UTC - in response to Message 881697.
Last modified: 2 Apr 2009, 16:35:06 UTC

Matt turned on the splitters full tilt on weds night, and the splitters are currently pouring out the WUs as fast as the 100Mb bandwidth from the Site will take, at last sight around 40+ per second, maybe more by now. It will take a while - probably a couple of days or more to refill everyone out there.

An earlier post put it nicely "Form an orderly queue at the Router :) "

Regards
Zy
____________

Profile DarkRyder
Volunteer tester
Send message
Joined: 10 Sep 01
Posts: 5
Credit: 12,795,257
RAC: 959
United States
Message 881742 - Posted: 2 Apr 2009, 18:18:10 UTC

lol. alrighty. thanks buddy.
____________

Profile RottenMutt
Avatar
Send message
Joined: 15 Mar 01
Posts: 999
Credit: 209,282,819
RAC: 50,707
United States
Message 881898 - Posted: 3 Apr 2009, 5:43:06 UTC - in response to Message 881543.
Last modified: 3 Apr 2009, 5:44:24 UTC

i would say 10x
woot, thanks for unjamming the splitters, good catch.


looks like the splitters are jammed up again:( result generation rate is near zero again...
____________

Profile RottenMutt
Avatar
Send message
Joined: 15 Mar 01
Posts: 999
Credit: 209,282,819
RAC: 50,707
United States
Message 881982 - Posted: 3 Apr 2009, 14:28:45 UTC - in response to Message 881900.

good chance we are out of disk storage space, but the cricket graphs now look like they did all week. :p
____________

Message boards : Technical News : Alexei (Apr 01 2009)

Copyright © 2014 University of California