Think Again Thursday (Apr 10 2008)


log in

Advanced search

Message boards : Technical News : Think Again Thursday (Apr 10 2008)

1 · 2 · Next
Author Message
Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar
Send message
Joined: 1 Mar 99
Posts: 1389
Credit: 74,079
RAC: 0
United States
Message 736948 - Posted: 10 Apr 2008, 17:53:43 UTC

We thought we had the hardware problem with the workunit download server diagnosed, but looks like we were wrong. False positive. The good news is that the kind folks who donated the thing have another ready to ship. But until we get it, that probably means potential random resets all weekend. Jeff just put an /etc/rc script in place so that upon reset/reboot there's a chance it'll be operational, meaning short glitches instead of multi-hour outages. That's the hope anyway. We might actually test that later today (if it doesn't reset itself on its own). There was discussion about how to implement a second workunit storage server so we don't have this single point of failure anymore. Not as easy as it sounds.

- Matt

____________
-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude

Profile John Clark
Volunteer tester
Avatar
Send message
Joined: 29 Sep 99
Posts: 16515
Credit: 4,418,829
RAC: 0
United Kingdom
Message 736949 - Posted: 10 Apr 2008, 17:56:20 UTC

Thanks for the update.

Back tracking is a pain, and implementing a solution is never easy when the reason appeared to have been identified.

Keep your chin up, as it will sort out eventually
____________
It's good to be back amongst friends and colleagues



Profile Andy Worth
Volunteer tester
Avatar
Send message
Joined: 23 Oct 02
Posts: 5807
Credit: 10,408,581
RAC: 0
United Kingdom
Message 736957 - Posted: 10 Apr 2008, 18:06:25 UTC

Ah, "Thinking Thursday" - whoever would have thought it? :D LOL.
Thanks for the update AND the hard work :)
____________


gomeyer
Volunteer tester
Send message
Joined: 21 May 99
Posts: 488
Credit: 50,157,953
RAC: 0
United States
Message 737041 - Posted: 10 Apr 2008, 20:39:07 UTC

Matt, When you get a chance could you kick BETA? Thanks again for the hard work!

Richard HaselgroveProject donor
Volunteer tester
Send message
Joined: 4 Jul 99
Posts: 8632
Credit: 51,519,532
RAC: 48,009
United Kingdom
Message 737048 - Posted: 10 Apr 2008, 20:54:52 UTC - in response to Message 737041.

Matt, When you get a chance could you kick BETA? Thanks again for the hard work!

And the download server, if the current slowdown isn't a deliberate test/repair.

Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar
Send message
Joined: 1 Mar 99
Posts: 1389
Credit: 74,079
RAC: 0
United States
Message 737053 - Posted: 10 Apr 2008, 21:08:59 UTC

Don't expect much joy for the next 10 hours in beta or the public project - the latest reset triggered a full RAID resync, which will take at least that long. In theory we could run the project while the devices sync up, but due to the extra I/O that will take about two weeks to complete. Sigh.

- Matt
____________
-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude

Profile Dr. C.E.T.I.
Avatar
Send message
Joined: 29 Feb 00
Posts: 15993
Credit: 690,597
RAC: 0
United States
Message 737077 - Posted: 10 Apr 2008, 22:09:59 UTC
Last modified: 10 Apr 2008, 22:12:46 UTC

. . . Thanks for Posting Matt - Nice goin' @ Berkeley > things are workin' better here w/ my boxes @ least ;O

< thought You might like a 'slight' distraction;

. . . Thinking Thursday: Magic and Ethics



. . . an ethical dilemma that’s obscure and on the surface, meaningless. However, to a certain subset of a subset of people,

it’s a huge problem that we wrestle with constantly, almost every single time we interact with the public




edit: nice edit Matt ;)
____________
BOINC Wiki . . .

Science Status Page . . .

gomeyer
Volunteer tester
Send message
Joined: 21 May 99
Posts: 488
Credit: 50,157,953
RAC: 0
United States
Message 737099 - Posted: 10 Apr 2008, 23:10:53 UTC - in response to Message 737053.

Don't expect much joy for the next 10 hours in beta or the public project - the latest reset triggered a full RAID resync, which will take at least that long. In theory we could run the project while the devices sync up, but due to the extra I/O that will take about two weeks to complete. Sigh.

- Matt

C'est la vie . . .

Savant
Send message
Joined: 13 Mar 08
Posts: 12
Credit: 250,927
RAC: 0
Canada
Message 737161 - Posted: 11 Apr 2008, 0:46:59 UTC
Last modified: 11 Apr 2008, 0:48:01 UTC

You do your best and we'll do ours (or is that hours?
I do mine with recycled electrons to keep the cost down.
Stan Fockner

zoom314Project donor
Avatar
Send message
Joined: 30 Nov 03
Posts: 46513
Credit: 36,855,611
RAC: 5,125
United States
Message 737211 - Posted: 11 Apr 2008, 3:18:25 UTC - in response to Message 737053.

Don't expect much joy for the next 10 hours in beta or the public project - the latest reset triggered a full RAID resync, which will take at least that long. In theory we could run the project while the devices sync up, but due to the extra I/O that will take about two weeks to complete. Sigh.

- Matt

Well 10 hours is better than 336 hours(2 weeks). :D
____________
My Facebook, War Commander, 2015

PhonAcq
Send message
Joined: 14 Apr 01
Posts: 1622
Credit: 22,367,958
RAC: 6,345
United States
Message 737213 - Posted: 11 Apr 2008, 3:23:29 UTC

It's at times like these I wonder if it would be better to scale back the project, fix/perfect the issues, and then restart the growth cycle. Not doing so seems like a huge psycho-strain by everyone and a source of volunteer-distress.

Another solution would be to rally the user base so that at a given UTC time we all chug 20oz of our individually favorite local brew together, as a team. That may lead to the only real manifestation of global warming, and concurrently may be the only true relief of our user/volunteer angst. I sugeest 1400UTC on Friday, April 11, but am open to alternative times and dates. (Obviously, I'm willing to risk the short term global water shortage when 20m later we all process our beer down the toilet. Life is not worth living unless you risk a bit.)

1mp0£173
Volunteer tester
Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 737217 - Posted: 11 Apr 2008, 3:36:19 UTC - in response to Message 737213.

It's at times like these I wonder if it would be better to scale back the project, fix/perfect the issues, and then restart the growth cycle. Not doing so seems like a huge psycho-strain by everyone and a source of volunteer-distress.

Another solution would be to rally the user base so that at a given UTC time we all chug 20oz of our individually favorite local brew together, as a team. That may lead to the only real manifestation of global warming, and concurrently may be the only true relief of our user/volunteer angst. I sugeest 1400UTC on Friday, April 11, but am open to alternative times and dates. (Obviously, I'm willing to risk the short term global water shortage when 20m later we all process our beer down the toilet. Life is not worth living unless you risk a bit.)

I won't argue against beer drinking, although 1400UTC is a little early here.

I may run out of work for the first time in years, which is saying something since I usually don't carry much of a cache. Then again, I might not run out of work.

So, just in case, I've reattached to LHC, since they seem to have work, and I'll still be here tomorrow when the download server will likely be back up.
____________

Profile Scarecrow
Avatar
Send message
Joined: 15 Jul 00
Posts: 4385
Credit: 459,193
RAC: 1
United States
Message 737233 - Posted: 11 Apr 2008, 4:43:53 UTC

In keeping with the current theme (beer drinking not excluded) I have the perfect title for tomorrow's thread.
(Trying Tuesday, Working Wednesday, Think again Thursday).....


ooops... never mind.... the censor won't let me post my thread title suggestion. ;)

1mp0£173
Volunteer tester
Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 737236 - Posted: 11 Apr 2008, 4:52:39 UTC - in response to Message 737233.

In keeping with the current theme (beer drinking not excluded) I have the perfect title for tomorrow's thread.
(Trying Tuesday, Working Wednesday, Think again Thursday).....


ooops... never mind.... the censor won't let me post my thread title suggestion. ;)

Think Battlestar Galactica....
____________

OzzFan
Volunteer tester
Avatar
Send message
Joined: 9 Apr 02
Posts: 13625
Credit: 31,026,897
RAC: 20,918
United States
Message 737240 - Posted: 11 Apr 2008, 5:27:32 UTC
Last modified: 11 Apr 2008, 5:28:00 UTC

Could always do Fukitol Friday. For those that don't know what that is, I can provide a link to the infomercial if you PM me. ;-)

Profile Scarecrow
Avatar
Send message
Joined: 15 Jul 00
Posts: 4385
Credit: 459,193
RAC: 1
United States
Message 737241 - Posted: 11 Apr 2008, 5:31:25 UTC - in response to Message 737240.

Could always do Fukitol Friday. For those that don't know what that is, I can provide a link to the infomercial if you PM me. ;-)


BINGO!

Iona
Avatar
Send message
Joined: 12 Jul 07
Posts: 567
Credit: 2,913,766
RAC: 2,342
United Kingdom
Message 737242 - Posted: 11 Apr 2008, 5:32:15 UTC - in response to Message 737233.

In keeping with the current theme (beer drinking not excluded) I have the perfect title for tomorrow's thread.
(Trying Tuesday, Working Wednesday, Think again Thursday).....


ooops... never mind.... the censor won't let me post my thread title suggestion. ;)



Your thought was easy to follow and appreciated by me and its great to start the day with a good giggle - although you guys may think differently! I suppose I'll just have to be patient; the odd thing is one of my PCs got more work, only a few minutes before the other tried to get more (and failed). Thats Life, I suppose.



____________
Don't take life too seriously, as you'll never come out of it alive!

Profile MarkJProject donor
Volunteer tester
Avatar
Send message
Joined: 17 Feb 08
Posts: 942
Credit: 25,104,586
RAC: 27,904
Australia
Message 737333 - Posted: 11 Apr 2008, 12:56:33 UTC - in response to Message 737053.

Don't expect much joy for the next 10 hours in beta or the public project - the latest reset triggered a full RAID resync, which will take at least that long. In theory we could run the project while the devices sync up, but due to the extra I/O that will take about two weeks to complete. Sigh.

- Matt


I think the next post should have the subject "running on empty" :-)

My machines will soon be out of work, even with 2 days extra cache on most of them. JD's Thurston won't know what to do with itself.
____________
BOINC blog

WinterKnight
Volunteer tester
Send message
Joined: 18 May 99
Posts: 8685
Credit: 24,978,307
RAC: 29,164
United Kingdom
Message 737342 - Posted: 11 Apr 2008, 13:49:47 UTC

You might be spared running on empty, someone got up early and opened the gates.
Cricket graph
____________
Only two things are infinite: the universe and human stupidity, and I am not sure about the former. - Albert Einstein

Profile KWSN THE Holy Hand Grenade!
Volunteer tester
Avatar
Send message
Joined: 20 Dec 05
Posts: 1960
Credit: 10,475,646
RAC: 10,798
United States
Message 737351 - Posted: 11 Apr 2008, 14:49:43 UTC - in response to Message 737342.
Last modified: 11 Apr 2008, 14:51:59 UTC

You might be spared running on empty, someone got up early and opened the gates.
Cricket graph


No, that's the end of the "ten hours" that Matt mentioned ;),

[add] the d/l server is exhibiting its usual signs of laboring under stress - breaking d/l's in the middle, not starting them, refusing to even consider them, etc.[/add]
____________
.

1 · 2 · Next

Message boards : Technical News : Think Again Thursday (Apr 10 2008)

Copyright © 2014 University of California