Holiday Checkup (Dec 23 2007)

Message boards : Technical News : Holiday Checkup (Dec 23 2007)
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 1 Mar 99
Posts: 1444
Credit: 957,058
RAC: 0
United States
Message 693979 - Posted: 23 Dec 2007, 19:05:25 UTC

Quick note:

We never really did recover from the science database issues from a couple days ago due to DOS'ing ourselves with fast workunits. Whatever. We chose to let things naturally pass through the system. Kinda like kidney stones. Meanwhile, one of the assimilators is failing with a brand new error. If any of us have time we'll try to check into that over the coming days, but we may be out of luck until we're all in the lab doing "extreme debugging" together on Wednesday. Hang in there!

- Matt
-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude
ID: 693979 · Report as offensive
Profile Dr. C.E.T.I.
Avatar

Send message
Joined: 29 Feb 00
Posts: 16019
Credit: 794,685
RAC: 0
United States
Message 693995 - Posted: 23 Dec 2007, 21:03:36 UTC


. . . been waiting / crunchin' for almost 8 Years - another day passes [c'est la vie] ;) - Thanks for the Post Matt

and Happy Holidays Sir . . .

richard
BOINC Wiki . . .

Science Status Page . . .
ID: 693995 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 693996 - Posted: 23 Dec 2007, 21:11:16 UTC

Thanx Matt....
Very glad that you are keeping tabs on things, but do enjoy your holiday season.

Looks like tha 01ja07 series is getting closer to being finished, hope that the 02dc06 next up yields some larger WUs and gives the splitters and servers some time to catch up.

Merry Christmas!

Mark.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 693996 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 693998 - Posted: 23 Dec 2007, 21:57:59 UTC

Just done a quick check on all WUs logged here.

530 out of 631 (83.99%) of all tasks issued after 12:00:00 UTC 22 Dec 2007 are shown as Angle Range 2.46 to 2.64

That compares with 399 out of 9766 (4.09%) of tasks issued in the previous four months.

Either something very odd about that data, or something very odd has happened to the splitters.

I think it's a very sensible decision to let it runs its course over the weekend, and I don't begrudge you your holiday break - Happy Christmas, and have a great time as far away from the lab as possible!

But when you come in for that "extreme debugging" session (as early as Wednesday? Shame on you!), please have a look for causes as well as coping strategies.
ID: 693998 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 694078 - Posted: 24 Dec 2007, 4:11:31 UTC - in response to Message 693998.  

Just done a quick check on all WUs logged here.

530 out of 631 (83.99%) of all tasks issued after 12:00:00 UTC 22 Dec 2007 are shown as Angle Range 2.46 to 2.64

That compares with 399 out of 9766 (4.09%) of tasks issued in the previous four months.

Either something very odd about that data, or something very odd has happened to the splitters.

I think it's a very sensible decision to let it runs its course over the weekend, and I don't begrudge you your holiday break - Happy Christmas, and have a great time as far away from the lab as possible!

But when you come in for that "extreme debugging" session (as early as Wednesday? Shame on you!), please have a look for causes as well as coping strategies.

I have to wonder what is going on as well.....
The new 02dc06aa series also appears to be coming up with all shorties....something doesn't seem right.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 694078 · Report as offensive
lwblack1051

Send message
Joined: 16 Dec 07
Posts: 1
Credit: 40,837
RAC: 0
United States
Message 694106 - Posted: 24 Dec 2007, 6:09:54 UTC - in response to Message 694078.  

Just done a quick check on all WUs logged here.

530 out of 631 (83.99%) of all tasks issued after 12:00:00 UTC 22 Dec 2007 are shown as Angle Range 2.46 to 2.64

That compares with 399 out of 9766 (4.09%) of tasks issued in the previous four months.

Either something very odd about that data, or something very odd has happened to the splitters.

I think it's a very sensible decision to let it runs its course over the weekend, and I don't begrudge you your holiday break - Happy Christmas, and have a great time as far away from the lab as possible!

But when you come in for that "extreme debugging" session (as early as Wednesday? Shame on you!), please have a look for causes as well as coping strategies.

I have to wonder what is going on as well.....
The new 02dc06aa series also appears to be coming up with all shorties....something doesn't seem right.


Allot of project server down messages of late. Spending more and more processor time on getting can't connect messages.

ID: 694106 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 694111 - Posted: 24 Dec 2007, 6:36:09 UTC - in response to Message 694106.  


Allot of project server down messages of late. Spending more and more processor time on getting can't connect messages.


'S 'cuz the splitters cannot create enough WUs to keep up with the demand from the host computers.....
They are being crunched and reported as fast as they can be created....
And the kitties say....'Stay tuned for further developements.......'
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 694111 · Report as offensive
Profile Jan Schotsmans
Avatar

Send message
Joined: 27 Oct 00
Posts: 98
Credit: 92,693
RAC: 0
Belgium
Message 694134 - Posted: 24 Dec 2007, 7:52:57 UTC - in response to Message 694111.  


Allot of project server down messages of late. Spending more and more processor time on getting can't connect messages.


'S 'cuz the splitters cannot create enough WUs to keep up with the demand from the host computers.....
They are being crunched and reported as fast as they can be created....
And the kitties say....'Stay tuned for further developements.......'


Lets hope the next tape is nearly all 80+ point WU's so that the splitters can create a new WU queue.

These 17 pointer ones are finishing in just over 20 minutes for me, that makes 2 units per 20 minutes on my C2D laptop, SETI would need to have 144 WU's ready per C2D level machine to keep this up, with a few million hosts running the app its obvious why the splitters can't keep up.
ID: 694134 · Report as offensive
Profile Rev. Tim Olivera

Send message
Joined: 15 Jan 06
Posts: 20
Credit: 1,717,714
RAC: 0
United States
Message 694158 - Posted: 24 Dec 2007, 10:50:44 UTC - in response to Message 693979.  

True to from just like on every long holiday no work! But the thing that made me really laugh this time was a letter asking for more folks to help with your program!! You can't keep the folks you hve now in work so you need more folks to leave there computers on to do nothing but use power and wear out parts!!! For 3 days now BOINC is Idle!! Keep up the good work guys don't let me down for Easter ot the 3 day in Feb I love to have my computers sucking up power and wearing out parts for nothing!!!

rev. Tim Olivera
ID: 694158 · Report as offensive
Profile RandyC
Avatar

Send message
Joined: 20 Oct 99
Posts: 714
Credit: 1,704,345
RAC: 0
United States
Message 694164 - Posted: 24 Dec 2007, 11:28:34 UTC - in response to Message 694158.  

True to from just like on every long holiday no work! But the thing that made me really laugh this time was a letter asking for more folks to help with your program!! You can't keep the folks you hve now in work so you need more folks to leave there computers on to do nothing but use power and wear out parts!!! For 3 days now BOINC is Idle!! Keep up the good work guys don't let me down for Easter ot the 3 day in Feb I love to have my computers sucking up power and wearing out parts for nothing!!!

rev. Tim Olivera


If you want to keep your systems busy when SAH is out of work, then attach to a backup project.

S@H has specifically stated they WILL NOT always have work available and advises users to attach to a backup project for this exact situation.
ID: 694164 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 694165 - Posted: 24 Dec 2007, 11:34:39 UTC - in response to Message 694164.  

True to from just like on every long holiday no work! But the thing that made me really laugh this time was a letter asking for more folks to help with your program!! You can't keep the folks you hve now in work so you need more folks to leave there computers on to do nothing but use power and wear out parts!!! For 3 days now BOINC is Idle!! Keep up the good work guys don't let me down for Easter ot the 3 day in Feb I love to have my computers sucking up power and wearing out parts for nothing!!!

rev. Tim Olivera


If you want to keep your systems busy when SAH is out of work, then attach to a backup project.

S@H has specifically stated they WILL NOT always have work available and advises users to attach to a backup project for this exact situation.

For once - just once - I think the Rev. Tim has a point. The "please help our project" letter doesn't say "please help our backup project". Co-ordination and timing, that's the thing: if you ask for help, it's only polite to be ready to accept it.
ID: 694165 · Report as offensive
Profile Jan Schotsmans
Avatar

Send message
Joined: 27 Oct 00
Posts: 98
Credit: 92,693
RAC: 0
Belgium
Message 694179 - Posted: 24 Dec 2007, 13:04:21 UTC

Instead of focussing and whining about the CPU cycle request letter sent arround, focus on the "WE NEED FUNDS" one that went arround recently too.

These guys are working severely underpaid for their skillevel, with maybe a quarter of the hardware, half the bandwidth and barely any of the cash they need to keep operating.
ID: 694179 · Report as offensive
Profile Neil Blaikie
Volunteer tester
Avatar

Send message
Joined: 17 May 99
Posts: 143
Credit: 6,652,341
RAC: 0
Canada
Message 694191 - Posted: 24 Dec 2007, 13:45:17 UTC

Am sure the guys get a ton of messages for help every day and physically don't have time to check every single one. If they did then they would have too much time on their hands.

I have just started to downloaded new tasks, I have a fast computer so it turns a task around in a little near 2 hours. I don't expect them to download quick, currently getting 2.26kpbs with a majority of the tasks in line to download pending.

I don't have a backup project so will just have to sit it out and get work when I can, am sure the guys at Berkeley would appreciate a few days off to relax at home and not have to worry about the work line being empty.

No work, ah well, will get it when it comes.

Happy Holidays all!
ID: 694191 · Report as offensive
Profile John Neale
Volunteer tester
Avatar

Send message
Joined: 16 Mar 00
Posts: 634
Credit: 7,246,513
RAC: 9
South Africa
Message 694201 - Posted: 24 Dec 2007, 14:31:59 UTC - in response to Message 694158.  
Last modified: 24 Dec 2007, 14:33:24 UTC

True to from (sic) just like on every long holiday no work!


Not true, Tim. The SETI servers have never churned out as many new workunits as they have in the past few days. They're actually at their limit, demonstrating what they're capable of.

For 3 days now BOINC is Idle!!


Not true, Tim. Your two active rigs have been receiving and returning work for the past three days.

Rig 1 Rig 2

I've taken issue with you before for your incessant whingeing about this project. It's not justified, in my opinion, but using untruths to propagate your message is disingenuous.
ID: 694201 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 694236 - Posted: 24 Dec 2007, 15:55:07 UTC

Well, the splitters are now screaming away at over 20 WUs/second creation rate. And they are not 17-19 pointers either. The ready to send cache is building, and the bandwidth is starting to relax. Most everybody (including the Reverend) should be slowly starting to get work to keep their crunchers busy.

@ Matt.......
Did this come about in the normal progress of things, or did you deep-six some data for now to ease things up a bit?

And might you create a process whereby the splitters monitor things and automatically suspend a dataset that consists of such short work and return to it once they have built the cache up again? Not to say that it should not be processed, but could it be done in smaller doses so as not to decimate the ready to send cache and send the bandwidth to the limits?
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 694236 · Report as offensive
Profile Neil Blaikie
Volunteer tester
Avatar

Send message
Joined: 17 May 99
Posts: 143
Credit: 6,652,341
RAC: 0
Canada
Message 694239 - Posted: 24 Dec 2007, 15:58:33 UTC

Maybe if more people take heed of the donation drive that the guys at Berkeley spent time sending to us all and more people sent money then perhaps things would be a bit better.

There will always be the "few" that have things to say regarding the running and subsequent problems associated with a massive project like this.

Come on though people.

It's A) Christmas and the guys at Berkeley have to work / check in on things, for what?, to keep us "joe public" happy. As someone mentioned earlier, attach to a backup project if you want to keep crunching workunits.

B) Anyone who runs servers on a large scale will know that sometimes things don't run smoothly and you get hiccups.

Matt and everyone at Berkeley does a fantastic job to keep everything running as smoothly as possible and they are human like the rest of us. They are allowed to have a break occasionally. With the resources they have and behind the scenes stuff that goes on that we don't see / get to hear about. They should be given as much credit as they can. No donations = ultimately no project and hundreds of hours of valuable time wasted due to lack of funding.

Enjoy the holidays and get a well deserved break. Thanks for all the hard work as always.
ID: 694239 · Report as offensive
seti@elrcastor.com
Volunteer tester

Send message
Joined: 30 Jan 00
Posts: 35
Credit: 4,879,559
RAC: 0
United States
Message 694265 - Posted: 24 Dec 2007, 17:22:14 UTC
Last modified: 24 Dec 2007, 17:22:41 UTC

@Seti People

Great Job on makeing things run so well, and have a great Christmas and New Years.
ID: 694265 · Report as offensive
Profile KWSN THE Holy Hand Grenade!
Volunteer tester
Avatar

Send message
Joined: 20 Dec 05
Posts: 3187
Credit: 57,163,290
RAC: 0
United States
Message 694268 - Posted: 24 Dec 2007, 17:34:33 UTC
Last modified: 24 Dec 2007, 17:35:28 UTC

On the "clients recently connected" front, I'm getting "last updated: December 31 1969 16:00:00" at 0930 PST 12/24/07, so there are still date issues with the apache logs...
.

Hello, from Albany, CA!...
ID: 694268 · Report as offensive
PhonAcq

Send message
Joined: 14 Apr 01
Posts: 1656
Credit: 30,658,217
RAC: 1
United States
Message 694276 - Posted: 24 Dec 2007, 18:12:19 UTC - in response to Message 694236.  


And might you create a process whereby the splitters monitor things and automatically suspend a dataset that consists of such short work and return to it once they have built the cache up again? Not to say that it should not be processed, but could it be done in smaller doses so as not to decimate the ready to send cache and send the bandwidth to the limits?


why couldn't the download process give the short time wu's to the slower clients and the longer (normal?) ones to the faster clients. I think biasing the work distribution could be used to even out the network loading, right?
ID: 694276 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 694278 - Posted: 24 Dec 2007, 18:26:30 UTC - in response to Message 694276.  


And might you create a process whereby the splitters monitor things and automatically suspend a dataset that consists of such short work and return to it once they have built the cache up again? Not to say that it should not be processed, but could it be done in smaller doses so as not to decimate the ready to send cache and send the bandwidth to the limits?


why couldn't the download process give the short time wu's to the slower clients and the longer (normal?) ones to the faster clients. I think biasing the work distribution could be used to even out the network loading, right?

That concept has been discussed, don't know if it is being considered or not....

BUT......it would not help in the recent situation where at least 3 entire datasets consisted of nothing but 17-19 point WUs and that's all that was being split for days on end..... The splitters would have to be able to have some logic involved that would cause them to switch off to some different datasets in such a situation.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 694278 · Report as offensive
1 · 2 · Next

Message boards : Technical News : Holiday Checkup (Dec 23 2007)


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.