Panic Mode On (7) Server Problems! Closed for Renovation

Message boards : Number crunching : Panic Mode On (7) Server Problems! Closed for Renovation
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · 10 . . . 12 · Next

AuthorMessage
Profile John Clark
Volunteer tester
Avatar

Send message
Joined: 29 Sep 99
Posts: 16515
Credit: 4,418,829
RAC: 0
United Kingdom
Message 759936 - Posted: 28 May 2008, 16:02:19 UTC

On the latest Cricket update the hiccups seemed to have stopped.

Will they find anything when they get in, or should someone give them a clue?
It's good to be back amongst friends and colleagues



ID: 759936 · Report as offensive
Profile SATAN
Avatar

Send message
Joined: 27 Aug 06
Posts: 835
Credit: 2,129,006
RAC: 0
United Kingdom
Message 759937 - Posted: 28 May 2008, 16:05:21 UTC

Lets tease them, give them a clue later on.
ID: 759937 · Report as offensive
Profile jason_gee
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 24 Nov 06
Posts: 7489
Credit: 91,093,184
RAC: 0
Australia
Message 759947 - Posted: 28 May 2008, 16:18:50 UTC
Last modified: 28 May 2008, 16:19:33 UTC

I thought those were power glitches caused by the light going on in an ancient bar fridge, when Eric opens the fridge door looking for another beer .... I think we should really donate towards a more modern dedicated bar fridge that won't risk compromising data or network connectivity :O
"Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions.
ID: 759947 · Report as offensive
Fred W
Volunteer tester

Send message
Joined: 13 Jun 99
Posts: 2524
Credit: 11,954,210
RAC: 0
United Kingdom
Message 759959 - Posted: 28 May 2008, 16:48:24 UTC - in response to Message 759947.  
Last modified: 28 May 2008, 16:55:56 UTC

I thought those were power glitches caused by the light going on in an ancient bar fridge, when Eric opens the fridge door looking for another beer .... I think we should really donate towards a more modern dedicated bar fridge that won't risk compromising data or network connectivity :O

Nahhh. It was the cleaners sequentially unplugging various bits of critical kit to plug in the vac's. 's what cleaners are for, isn't it?

F.

[Edit]But Jason's suggestion paints a much more interesting picture.[/Edit]
ID: 759959 · Report as offensive
Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 1 Mar 99
Posts: 1444
Credit: 957,058
RAC: 0
United States
Message 761671 - Posted: 1 Jun 2008, 13:45:07 UTC

Looks like the assimilators are all failing for some reason. And on a weekend, too. Maybe somebody should alert the staff. Oh wait - that's me. I guess I'll look into it.

- Matt
-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude
ID: 761671 · Report as offensive
gomeyer
Volunteer tester

Send message
Joined: 21 May 99
Posts: 488
Credit: 50,370,425
RAC: 0
United States
Message 761676 - Posted: 1 Jun 2008, 13:52:21 UTC - in response to Message 761671.  

Looks like the assimilators are all failing for some reason. And on a weekend, too. Maybe somebody should alert the staff. Oh wait - that's me. I guess I'll look into it.

- Matt

That's dedication. Awake before 7:00 AM on a Sunday morning and checking his servers already. I'm impressed.
ID: 761676 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51519
Credit: 1,018,363,574
RAC: 1,004
United States
Message 761700 - Posted: 1 Jun 2008, 15:35:15 UTC - in response to Message 761676.  

Looks like the assimilators are all failing for some reason. And on a weekend, too. Maybe somebody should alert the staff. Oh wait - that's me. I guess I'll look into it.

- Matt

That's dedication. Awake before 7:00 AM on a Sunday morning and checking his servers already. I'm impressed.

Many's the time that one of the boyz has given up weekend time to run into the lab and give things a good kick to get things flowing again......they truly are a dedicated bunch....
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 761700 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 761788 - Posted: 1 Jun 2008, 19:41:23 UTC - in response to Message 758155.  

THanks for clearing that up MArk, seems to be an intermittent problem. I think i'll just kick the router in a minute.

Mind your toe when you give it a kick..........


Good advice... About 30 years ago I was having problems getting a "punch card" program to read through the IBM 2540 card reader/punch. The program was about 700 cards long and kept getting read errors. Out of frustration I hauled off and kicked the darn card reader! Broke a toe and walked with a limp for a year. The foolish things we do while still a teen....

Regards,
JDWhale

I kicked an IBM 1622, caught it right on the corner.
ID: 761788 · Report as offensive
Profile Steve Dodd

Send message
Joined: 29 May 99
Posts: 23
Credit: 8,695,373
RAC: 1
United States
Message 762082 - Posted: 2 Jun 2008, 15:27:14 UTC

So Matt,
I see the assimilators are still offline. Since you caught this on Sunday morning and they're still not working, wondering if it's serious?
ID: 762082 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13903
Credit: 208,696,464
RAC: 304
Australia
Message 762365 - Posted: 3 Jun 2008, 7:28:56 UTC - in response to Message 762082.  

wondering if it's serious?

Yep, there's a little message on the home page.
And Scarecrow's Graphs show just how constipated things are.

Grant
Darwin NT
ID: 762365 · Report as offensive
_heinz
Volunteer tester

Send message
Joined: 25 Feb 05
Posts: 744
Credit: 5,539,270
RAC: 0
France
Message 762370 - Posted: 3 Jun 2008, 8:10:39 UTC - in response to Message 761788.  

THanks for clearing that up MArk, seems to be an intermittent problem. I think i'll just kick the router in a minute.

Mind your toe when you give it a kick..........


Good advice... About 30 years ago I was having problems getting a "punch card" program to read through the IBM 2540 card reader/punch. The program was about 700 cards long and kept getting read errors. Out of frustration I hauled off and kicked the darn card reader! Broke a toe and walked with a limp for a year. The foolish things we do while still a teen....

Regards,
JDWhale

I kicked an IBM 1622, caught it right on the corner.

Hi Ned,
I remember me, a frustrated programmer opened the door of the "Central Processor Unit" and kicked into the circuit-boards, but this was his last day, he get fired immediately, so you and Whale had have very big luck....
It was a nice time to work with MFT, MVT etc on this machines.
heinz

ID: 762370 · Report as offensive
Profile hiamps
Volunteer tester
Avatar

Send message
Joined: 23 May 99
Posts: 4292
Credit: 72,971,319
RAC: 0
United States
Message 762403 - Posted: 3 Jun 2008, 12:51:09 UTC - in response to Message 762365.  

wondering if it's serious?

Yep, there's a little message on the home page.
And Scarecrow's Graphs show just how constipated things are.

Just upped my cache just in case.
Official Abuser of Boinc Buttons...
And no good credit hound!
ID: 762403 · Report as offensive
MarkJ Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 08
Posts: 1139
Credit: 80,854,192
RAC: 5
Australia
Message 762406 - Posted: 3 Jun 2008, 12:55:34 UTC - in response to Message 762082.  

So Matt,
I see the assimilators are still offline. Since you caught this on Sunday morning and they're still not working, wondering if it's serious?


Judging from this snippet (below) taken from the home page news, i'd say things are serious:

June 2, 2008
The assimilators started malfunctioning over the weekend, which indicates that there may be a problem in a portion of the science database. We've had to turn off work generation while we discuss our options. We will probably be out of work in 8 hours or so.

BOINC blog
ID: 762406 · Report as offensive
Profile RandyC
Avatar

Send message
Joined: 20 Oct 99
Posts: 714
Credit: 1,704,345
RAC: 0
United States
Message 762456 - Posted: 3 Jun 2008, 15:41:23 UTC - in response to Message 762406.  

So Matt,
I see the assimilators are still offline. Since you caught this on Sunday morning and they're still not working, wondering if it's serious?


Judging from this snippet (below) taken from the home page news, i'd say things are serious:

June 2, 2008
The assimilators started malfunctioning over the weekend, which indicates that there may be a problem in a portion of the science database. We've had to turn off work generation while we discuss our options. We will probably be out of work in 8 hours or so.


Ready to send is down to 52K. It was over 500K last night.
ID: 762456 · Report as offensive
Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 1 Mar 99
Posts: 1444
Credit: 957,058
RAC: 0
United States
Message 762469 - Posted: 3 Jun 2008, 16:06:05 UTC

Not super-serious. The current understanding is that some database spaces filled up, or at least became unwriteable. The solution may be simple but time consuming (off for another day or two). No data loss or server crash or anything like that. More details after the usual outage and the forums are back on line.

- Matt
-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude
ID: 762469 · Report as offensive
Profile hiamps
Volunteer tester
Avatar

Send message
Joined: 23 May 99
Posts: 4292
Credit: 72,971,319
RAC: 0
United States
Message 762471 - Posted: 3 Jun 2008, 16:07:30 UTC - in response to Message 762469.  

Not super-serious. The current understanding is that some database spaces filled up, or at least became unwriteable. The solution may be simple but time consuming (off for another day or two). No data loss or server crash or anything like that. More details after the usual outage and the forums are back on line.

- Matt

Thanks for the update Matt. Have a great day!
Official Abuser of Boinc Buttons...
And no good credit hound!
ID: 762471 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13903
Credit: 208,696,464
RAC: 304
Australia
Message 762724 - Posted: 4 Jun 2008, 8:01:15 UTC


Don't want to jinx things, but the system's looking pretty good.
The Work Units & Results Awaiting Deletion cleared not long after the daily outage. The Ready to Send buffer is almost full, and while doing that the Waiting for Validation & Waiting for Assimilation queues have been slowly draining; i expect once the RtS buffer is full they'll start to drain faster?

It would have been interesting to see how well things would have gone if we'd had a big batch of short Work Units going through, i expect these longer ones have helped minimise the server load a lot.
Grant
Darwin NT
ID: 762724 · Report as offensive
Profile Logan
Volunteer tester
Avatar

Send message
Joined: 26 Jan 07
Posts: 743
Credit: 918,353
RAC: 0
Spain
Message 764075 - Posted: 7 Jun 2008, 11:57:31 UTC
Last modified: 7 Jun 2008, 11:59:25 UTC

ID: 764075 · Report as offensive
Profile John Clark
Volunteer tester
Avatar

Send message
Joined: 29 Sep 99
Posts: 16515
Credit: 4,418,829
RAC: 0
United Kingdom
Message 764076 - Posted: 7 Jun 2008, 11:59:04 UTC
Last modified: 7 Jun 2008, 12:00:05 UTC

Just reporting the same experience in my thread.

Sorry to duplicate.
It's good to be back amongst friends and colleagues



ID: 764076 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51519
Credit: 1,018,363,574
RAC: 1,004
United States
Message 764077 - Posted: 7 Jun 2008, 11:59:17 UTC - in response to Message 764075.  

Well...

Another time the comunications are down...

Cricket graph

Our Mother of cache.....don't fail me now.......LOL.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 764077 · Report as offensive
Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · 10 . . . 12 · Next

Message boards : Number crunching : Panic Mode On (7) Server Problems! Closed for Renovation


 
©2025 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.