Oh Yeah.. That.. (Mar 28 2007)

Message boards : Technical News : Oh Yeah.. That.. (Mar 28 2007)
Message board moderation

To post messages, you must log in.

1 · 2 · 3 · Next

AuthorMessage
Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 1 Mar 99
Posts: 1444
Credit: 957,058
RAC: 0
United States
Message 537991 - Posted: 28 Mar 2007, 21:35:16 UTC

We never did claim to have totally solved the "slow feeder query" problem plaguing us a month ago (and well before that). The adjustments we made to the database and the way we set up our queries have helped, but last night and well into today mysql fell back into its old habits again. We have a policy to not care about this anymore as we don't have the time, the problem is relatively transient, and we'll be upgrading mysql versions soon enough. My gut tells me this is caused by some kind of mysql housecleaning that gets tickled every so often depending on load.

Aside from that we went ahead with our changes to the science database and employed new solaris versions of the assimilators and splitters. Later (i.e. tomorrow or beyond) we'll install linux versions of the assimilators and validators (thus getting the last remaining backend bits off of kryten). One thing at a time, folks.

The validator queue was growing again. Seems like kryten perhaps needed a reboot to clear its network pipes so I did that. Now the queue is draining. Damn pesky mounts! Soon the validators will run on bruno, i.e. the same machine with the result files. That'll be much better.

- Matt

-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude
ID: 537991 · Report as offensive
Wander Saito
Volunteer tester

Send message
Joined: 7 Jul 03
Posts: 555
Credit: 2,136,061
RAC: 0
Brazil
Message 538023 - Posted: 28 Mar 2007, 22:10:33 UTC

I don't recall in recent years such a flow of good news: new servers, more donations, astropulse released in Beta (the Windows version has just been released), processes getting more efficient... Good work guys! Bruno is certainly a welcome addition, but no server can replace all your hard work. Thanks!

Regards,
Wander

ID: 538023 · Report as offensive
KB7RZF
Volunteer tester
Avatar

Send message
Joined: 15 Aug 99
Posts: 9549
Credit: 3,308,926
RAC: 2
United States
Message 538036 - Posted: 28 Mar 2007, 22:30:44 UTC

I have to agree with Wander, this is great. Its nice to see this project getting the attention and needed donations that are desperately needed. Each and every person who has done any kind of hardware/money donation, thank you. To the rest of those that crunch, thanks for putting up with the problems that UCB has had. It sounds like soon things will be a whole lot more reliable than they ever have been for the project, and that is something that each and every one of us should be very proud about helping with.

To the project folks, thank you for the hard work and long hours you all have put in to keep things running. We are all very thankful, more than you all will ever know.

Long live SETI@Home, and happy crunching to all.

Jeremy
ID: 538036 · Report as offensive
Profile Dr. C.E.T.I.
Avatar

Send message
Joined: 29 Feb 00
Posts: 16019
Credit: 794,685
RAC: 0
United States
Message 538049 - Posted: 28 Mar 2007, 23:03:29 UTC


Keep Up with the Great Work @ Berkeley . . . A Fine Job @ ALL . . . Thanks
ID: 538049 · Report as offensive
Profile Fuzzy Hollynoodles
Volunteer tester
Avatar

Send message
Joined: 3 Apr 99
Posts: 9659
Credit: 251,998
RAC: 0
Message 538061 - Posted: 28 Mar 2007, 23:22:39 UTC - in response to Message 538023.  
Last modified: 28 Mar 2007, 23:23:04 UTC

I don't recall in recent years such a flow of good news: new servers, more donations, astropulse released in Beta (the Windows version has just been released), processes getting more efficient... Good work guys! Bruno is certainly a welcome addition, but no server can replace all your hard work. Thanks!

Regards,
Wander


And no servers without the donations, both money and hardware.

Thank you all, who have donated.



"I'm trying to maintain a shred of dignity in this world." - Me

ID: 538061 · Report as offensive
Profile Pappa
Volunteer tester
Avatar

Send message
Joined: 9 Jan 00
Posts: 2562
Credit: 12,301,681
RAC: 0
United States
Message 538082 - Posted: 29 Mar 2007, 1:18:04 UTC - in response to Message 538061.  
Last modified: 29 Mar 2007, 1:18:41 UTC

To add to Fuzzy's Note, I received an email that was sent to the poeple that donated roughly over $6000.00 of hardware to make the hardware "fixes" possible. Bruno is not new with exception of the CPU's, RAM and Drives... Making things "fit" took weeks and getting it settled in is taking weeks...

In NO Way should any of us start thinking we are out of the woods yet... The path that we are walking has a large clearing that we are going through... Donations are Still Important!

I don't recall in recent years such a flow of good news: new servers, more donations, astropulse released in Beta (the Windows version has just been released), processes getting more efficient... Good work guys! Bruno is certainly a welcome addition, but no server can replace all your hard work. Thanks!

Regards,
Wander


And no servers without the donations, both money and hardware.

Thank you all, who have donated.



Pappa

Please consider a Donation to the Seti Project.

ID: 538082 · Report as offensive
Cherokee150

Send message
Joined: 11 Nov 99
Posts: 192
Credit: 58,513,758
RAC: 74
United States
Message 538290 - Posted: 29 Mar 2007, 13:11:01 UTC
Last modified: 29 Mar 2007, 13:23:30 UTC

Matt,
It appears the upload/download process may have a problem. I have three computers that process SETI on two separate ISPs. One uses cable/modem and two use telephone modems.

They do not have any hard coded addresses, either.

While they have been able to upload completed work units, none have been able to download new work units or update their stats for over six hours now.

Recycling Boinc and SETI and rebooting have not helped, either.

This leads me to believe there is a good chance the problem is not on my end.

I hate to start your day this way. Good luck!
ID: 538290 · Report as offensive
William Roeder
Volunteer tester
Avatar

Send message
Joined: 19 May 99
Posts: 69
Credit: 523,414
RAC: 0
United States
Message 538324 - Posted: 29 Mar 2007, 14:49:28 UTC - in response to Message 538290.  

Matt,
While they have been able to upload completed work units, none have been able to download new work units or update their stats for over six hours now.


Same symptoms here on dialup.
ID: 538324 · Report as offensive
William Roeder
Volunteer tester
Avatar

Send message
Joined: 19 May 99
Posts: 69
Credit: 523,414
RAC: 0
United States
Message 538327 - Posted: 29 Mar 2007, 15:06:09 UTC - in response to Message 538324.  

Same symptoms here on dialup.


I tried ipconfig /flushdns and bonic -> Advanced -> retry communcations
didn't help at 10:56 EDT
project communication failed at 11:00:33 EDT
Downloads started at 11:00:48 EDT
ID: 538327 · Report as offensive
Profile Paul Hayslett Project Donor
Avatar

Send message
Joined: 3 Aug 00
Posts: 15
Credit: 14,207,862
RAC: 0
United States
Message 538333 - Posted: 29 Mar 2007, 15:28:39 UTC - in response to Message 538327.  

Same problem here, all machines, for a couple of hours. Cycled the router, flushed the caches, no joy.

BUT, Matt shouldn't waste his time looking at the SETI servers. I'm getting the same thing at Rosetta, starting at the same time. So the problem must lie elsewhere.
ID: 538333 · Report as offensive
Profile elendil
Avatar

Send message
Joined: 7 May 02
Posts: 28
Credit: 1,908,698
RAC: 0
Netherlands
Message 538338 - Posted: 29 Mar 2007, 15:37:35 UTC - in response to Message 538290.  

Matt,
It appears the upload/download process may have a problem. I have three computers that process SETI on two separate ISPs. One uses cable/modem and two use telephone modems.

*snip*

This leads me to believe there is a good chance the problem is not on my end.



I'm glad that I'm not the only one with that problem. First thought was that my provider was acting up again :(

-=[ Not all who wander are lost ]=-
ID: 538338 · Report as offensive
Profile Labbie
Avatar

Send message
Joined: 19 Jun 06
Posts: 4083
Credit: 5,930,102
RAC: 0
United States
Message 538349 - Posted: 29 Mar 2007, 16:17:02 UTC

I had this problem earlier today on 5 machines. Rebooted them all and no more problems. They have all uploaded, downloaded and reported.

But maybe I just got lucky.
ID: 538349 · Report as offensive
Profile Fuzzy Hollynoodles
Volunteer tester
Avatar

Send message
Joined: 3 Apr 99
Posts: 9659
Credit: 251,998
RAC: 0
Message 538356 - Posted: 29 Mar 2007, 16:40:34 UTC - in response to Message 538338.  
Last modified: 29 Mar 2007, 16:41:14 UTC

Matt,
It appears the upload/download process may have a problem. I have three computers that process SETI on two separate ISPs. One uses cable/modem and two use telephone modems.

*snip*

This leads me to believe there is a good chance the problem is not on my end.



I'm glad that I'm not the only one with that problem. First thought was that my provider was acting up again :(


I had two WU's stuck in download, one from here and one from Seti Beta (they come from the same server), and since I couldn't see any others report about problems with downloading, I assumed it was my host. But they seemed stuck so I aborted the transfer of both, and soon after I successfully downloaded a Seti Beta WU. And now I have an Astropulse WU in my cache.

So some WU's seem stuck and only aborting the transfer of them can help the problem to go away.


"I'm trying to maintain a shred of dignity in this world." - Me

ID: 538356 · Report as offensive
J. Chris Willmann

Send message
Joined: 14 Dec 99
Posts: 5
Credit: 15,966,380
RAC: 52
Canada
Message 538362 - Posted: 29 Mar 2007, 17:12:13 UTC

WTF?? I've got 3 machines running at two different lacations. They have not been able to upload, or download anything for the past 8 hours or so, despite plenty of work being available.

Oh, looks like the problem has cured itself as I was writting the above. All of a sudden both machines at my current location "miraculosly" fixed themeslves... Hmmm, looks fishy....
ID: 538362 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65747
Credit: 55,293,173
RAC: 49
United States
Message 538368 - Posted: 29 Mar 2007, 18:42:31 UTC
Last modified: 29 Mar 2007, 18:44:20 UTC

I've also been having this problem, I've done ipconfig /release /renew and /flushdns, I've repaired the connection and one time the server was offline and then for one time It worked, then It was back to what It was before, SNAFUed. And I have 4 PCs, 2-Intel QX6700 cpus and 3-AMD Opteron cpus, It's very frustating and depressing.

http://setiathome.berkeley.edu/forum_thread.php?id=38562
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 538368 · Report as offensive
Profile Paul Hayslett Project Donor
Avatar

Send message
Joined: 3 Aug 00
Posts: 15
Credit: 14,207,862
RAC: 0
United States
Message 538386 - Posted: 29 Mar 2007, 19:59:40 UTC - in response to Message 538368.  

All my machines are back up, SETI-crunching happily. I hope that means whatever was broken is now fixed and everyone else's clients will get new work soon.

The SETI crew can brag that they beat Rosetta today. R@H is still wedged tight.

ID: 538386 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65747
Credit: 55,293,173
RAC: 49
United States
Message 538423 - Posted: 29 Mar 2007, 21:41:03 UTC - in response to Message 538386.  

All my machines are back up, SETI-crunching happily. I hope that means whatever was broken is now fixed and everyone else's clients will get new work soon.

The SETI crew can brag that they beat Rosetta today. R@H is still wedged tight.

Ooops, So Rosetta is still borked, Oh well, That just proves the guys here can get things fixed pretty quickly as they are a pretty talented bunch of bananas. ;) Seti couldn't get along with less.
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 538423 · Report as offensive
William Roeder
Volunteer tester
Avatar

Send message
Joined: 19 May 99
Posts: 69
Credit: 523,414
RAC: 0
United States
Message 538689 - Posted: 30 Mar 2007, 13:39:27 UTC - in response to Message 538423.  

[quote]All my machines are back up, SETI-crunching happily. I hope that means whatever was broken is now fixed and everyone else's clients will get new work soon.


Unfortunately the problem has returned the next day. Now I can't upload either.
(Access succeeded - servers down)
ID: 538689 · Report as offensive
Profile KWSN THE Holy Hand Grenade!
Volunteer tester
Avatar

Send message
Joined: 20 Dec 05
Posts: 3187
Credit: 57,163,290
RAC: 0
United States
Message 538699 - Posted: 30 Mar 2007, 15:54:10 UTC - in response to Message 538689.  
Last modified: 30 Mar 2007, 15:54:39 UTC

[quote]All my machines are back up, SETI-crunching happily. I hope that means whatever was broken is now fixed and everyone else's clients will get new work soon.


Unfortunately the problem has returned the next day. Now I can't upload either.
(Access succeeded - servers down)


Servers are now (0853 PDT) back online - splitters are still offline, but they're not THAT important.
.

Hello, from Albany, CA!...
ID: 538699 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 538702 - Posted: 30 Mar 2007, 16:05:23 UTC - in response to Message 538362.  

WTF?? I've got 3 machines running at two different lacations. They have not been able to upload, or download anything for the past 8 hours or so, despite plenty of work being available.

Oh, looks like the problem has cured itself as I was writting the above. All of a sudden both machines at my current location "miraculosly" fixed themeslves... Hmmm, looks fishy....

Generally speaking, the best way to "fix" problems is to just wait them out.

BOINC is designed for less than 99.999% reliability at the project/server end, because five-nines reliability would put volunteer/public computing out of reach of most projects.

SETI is probably running somewhere between 90% and 99% reliable right now, and when there is a problem overnight, it gets fixed when Matt, Jeff, Bob, etc. come to work.

... and the client software says "woo hoo!" and sends whatever they've been holding for the last few hours.

If you read the fora (particularly technical news) you'll see that Kryten is getting really tired, and Kryten's load is being moved to another server.

Expect to see things get better as the changes get done..
ID: 538702 · Report as offensive
1 · 2 · 3 · Next

Message boards : Technical News : Oh Yeah.. That.. (Mar 28 2007)


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.