Alexei (Apr 01 2009)

Message boards : Technical News : Alexei (Apr 01 2009)
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 1 Mar 99
Posts: 1444
Credit: 957,058
RAC: 0
United States
Message 881432 - Posted: 1 Apr 2009, 22:01:27 UTC

Let's see.. we're *still* waiting for the RAID resync's to finish and likewise the pulse table rebuild. Another day or two? Meanwhile, I cleared off enough space on the workunit machine such that we can keep producing/sending out work. We still can't assimilate very much until the pulse table rebuild is over, but at least the people can do science and get credit. I'm worried about mysql bloat with the large result table (over 2 million waiting for assimilation), but we've been here many times before and lived.

Lost in the chaos of outage recovery yesterday was a bunch of "make science status page" processes piling up on top of each other, causing extra stress on the science database, and eventually making the splitters jam up. Oops. I killed all those this morning and that particular dam broke. Now that we're catching up on satisfying workunit demand I think we'll be maxed out traffic-wise for a while, which isn't the worst of problems (that means work *is* flowing as fast as we can send it).

Lots of code walkthroughs with Jeff today regarding the NTPCker. It's getting to be a mature piece of code. Scoring mechanisms are almost all in place (though they still may need major tuning once we sift through enough real data). We're still concerned about our ability to actually keep it running "near time," i.e. will the database be able to handle the load? We shall see. A lot of database improvements to help this have unfortunately been blocked on the last couple of weeks' worth of problems with thumper.

Happy April Fool's Day! Don't believe anything anybody says! Actually that's good advice regardless of the day of the year.

-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude
ID: 881432 · Report as offensive
Profile Fred J. Verster
Volunteer tester
Avatar

Send message
Joined: 21 Apr 04
Posts: 3252
Credit: 31,903,643
RAC: 0
Netherlands
Message 881436 - Posted: 1 Apr 2009, 22:18:54 UTC - in response to Message 881432.  
Last modified: 1 Apr 2009, 22:19:23 UTC

Thanks Matt, is the 'extra CUDA-LOAD', troublesome or does it work well, if you look at the amount work done.
Anyway, thanks for your UPdate, my keyboard is sluggish sometime, (think) it's the CUDA use.
Only, when I want to 'burn' an 'image' on CD/DVD, a will stop BOINC completeley.
There seems to be little CUDA MB WU's but also 'normal (6.03)MB WU's .
ID: 881436 · Report as offensive
Profile Borgholio
Avatar

Send message
Joined: 2 Aug 99
Posts: 654
Credit: 18,623,738
RAC: 45
United States
Message 881437 - Posted: 1 Apr 2009, 22:22:30 UTC - in response to Message 881432.  



Lost in the chaos of outage recovery yesterday was a bunch of "make science status page" processes piling up on top of each other, causing extra stress on the science database, and eventually making the splitters jam up. Oops. I killed all those this morning and that particular dam broke. Now that we're catching up on satisfying workunit demand I think we'll be maxed out traffic-wise for a while, which isn't the worst of problems (that means work *is* flowing as fast as we can send it).



So the status page was causing the slow workunit creation last night? Will the page remain frozen until things settle down?


You will be assimilated...bunghole!

ID: 881437 · Report as offensive
Profile KWSN Ekky Ekky Ekky
Avatar

Send message
Joined: 25 May 99
Posts: 944
Credit: 52,956,491
RAC: 67
United Kingdom
Message 881439 - Posted: 1 Apr 2009, 22:25:16 UTC

Judging by the Cricket graphs you guys have certainly unplugged something. Let's hope it holds!
Congrats to you all for all the hard work.

ID: 881439 · Report as offensive
Profile Gundolf Jahn

Send message
Joined: 19 Sep 00
Posts: 3184
Credit: 446,358
RAC: 0
Germany
Message 881443 - Posted: 1 Apr 2009, 22:36:13 UTC - in response to Message 881437.  

So the status page was causing the slow workunit creation last night? Will the page remain frozen until things settle down?

The server status page is frozen too (since 11:20 UTC).
ID: 881443 · Report as offensive
Profile Dr. C.E.T.I.
Avatar

Send message
Joined: 29 Feb 00
Posts: 16019
Credit: 794,685
RAC: 0
United States
Message 881448 - Posted: 1 Apr 2009, 22:47:11 UTC


. . . got that one AP that was 'sitting' there forever ;)

Thank You for the Updates Matt - Accolades to ALl @ Berkeley . . .


BOINC Wiki . . .

Science Status Page . . .
ID: 881448 · Report as offensive
Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 1 Mar 99
Posts: 1444
Credit: 957,058
RAC: 0
United States
Message 881458 - Posted: 1 Apr 2009, 23:17:08 UTC

..yeah there was some additional mounting/network clogging gobbledygook that was blocking the regular server status page for a while. Separate problem, and I'm fixing that now. The science status page will be continue to be stuck on hold for the near term...

- Matt
-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude
ID: 881458 · Report as offensive
Profile Borgholio
Avatar

Send message
Joined: 2 Aug 99
Posts: 654
Credit: 18,623,738
RAC: 45
United States
Message 881468 - Posted: 1 Apr 2009, 23:53:22 UTC - in response to Message 881458.  

Seems like work is flowing nicely now. The workunit creation rate is 4x faster than it's been in the last week. Thanks Matt!
You will be assimilated...bunghole!

ID: 881468 · Report as offensive
Profile RottenMutt
Avatar

Send message
Joined: 15 Mar 01
Posts: 1011
Credit: 230,314,058
RAC: 0
United States
Message 881543 - Posted: 2 Apr 2009, 4:04:24 UTC - in response to Message 881468.  

i would say 10x
woot, thanks for unjamming the splitters, good catch.
ID: 881543 · Report as offensive
Profile DarkRyder
Volunteer tester

Send message
Joined: 10 Sep 01
Posts: 5
Credit: 25,554,963
RAC: 27
United States
Message 881697 - Posted: 2 Apr 2009, 15:49:15 UTC

Hows all the problems going? How soon should we see work for cuda?
ID: 881697 · Report as offensive
Zydor

Send message
Joined: 4 Oct 03
Posts: 172
Credit: 491,111
RAC: 0
United Kingdom
Message 881708 - Posted: 2 Apr 2009, 16:34:20 UTC - in response to Message 881697.  
Last modified: 2 Apr 2009, 16:35:06 UTC

Matt turned on the splitters full tilt on weds night, and the splitters are currently pouring out the WUs as fast as the 100Mb bandwidth from the Site will take, at last sight around 40+ per second, maybe more by now. It will take a while - probably a couple of days or more to refill everyone out there.

An earlier post put it nicely "Form an orderly queue at the Router :) "

Regards
Zy
ID: 881708 · Report as offensive
Profile DarkRyder
Volunteer tester

Send message
Joined: 10 Sep 01
Posts: 5
Credit: 25,554,963
RAC: 27
United States
Message 881742 - Posted: 2 Apr 2009, 18:18:10 UTC

lol. alrighty. thanks buddy.
ID: 881742 · Report as offensive
Profile RottenMutt
Avatar

Send message
Joined: 15 Mar 01
Posts: 1011
Credit: 230,314,058
RAC: 0
United States
Message 881898 - Posted: 3 Apr 2009, 5:43:06 UTC - in response to Message 881543.  
Last modified: 3 Apr 2009, 5:44:24 UTC

i would say 10x
woot, thanks for unjamming the splitters, good catch.


looks like the splitters are jammed up again:( result generation rate is near zero again...
ID: 881898 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51478
Credit: 1,018,363,574
RAC: 1,004
United States
Message 881900 - Posted: 3 Apr 2009, 6:09:16 UTC - in response to Message 881898.  

i would say 10x
woot, thanks for unjamming the splitters, good catch.


looks like the splitters are jammed up again:( result generation rate is near zero again...

Unfortunately....see Matt's explanation in today's tech news post.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 881900 · Report as offensive
Profile RottenMutt
Avatar

Send message
Joined: 15 Mar 01
Posts: 1011
Credit: 230,314,058
RAC: 0
United States
Message 881982 - Posted: 3 Apr 2009, 14:28:45 UTC - in response to Message 881900.  

good chance we are out of disk storage space, but the cricket graphs now look like they did all week. :p
ID: 881982 · Report as offensive

Message boards : Technical News : Alexei (Apr 01 2009)


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.