Panic Mode On (100) Server Problems?

Message boards : Number crunching : Panic Mode On (100) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 24 · 25 · 26 · 27 · 28 · 29 · 30 . . . 32 · Next

AuthorMessage
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 1731212 - Posted: 2 Oct 2015, 17:29:23 UTC - in response to Message 1731209.  
Last modified: 2 Oct 2015, 17:47:54 UTC

Thanks Matt, I'll stop fiddling further then.

Isn't it an idea to put that up as a news item, though?

Edit: BOINC domain popped up, just in time for me to fall into a big fight in the making over there. What is it with you people?
ID: 1731212 · Report as offensive
Profile ReiAyanami
Avatar

Send message
Joined: 6 Dec 05
Posts: 116
Credit: 222,900,202
RAC: 174
Japan
Message 1731218 - Posted: 2 Oct 2015, 17:41:55 UTC

Thank you Matt for the update.
I could connect to the server finally and it's been OK for last 2 hours.
I keep my fingers crossed ;)
ID: 1731218 · Report as offensive
Profile Jimbocous Project Donor
Volunteer tester
Avatar

Send message
Joined: 1 Apr 13
Posts: 1853
Credit: 268,616,081
RAC: 1,349
United States
Message 1731237 - Posted: 2 Oct 2015, 19:30:47 UTC
Last modified: 2 Oct 2015, 19:35:56 UTC

So looks like resolved? Nope, I guess I just got lucky once and was able to report the 300+ tasks and fill the cache once. For now, I guess that will have to do ...
BTW, the replica DB is offline again ...
ID: 1731237 · Report as offensive
OTS
Volunteer tester

Send message
Joined: 6 Jan 08
Posts: 369
Credit: 20,533,537
RAC: 0
United States
Message 1731249 - Posted: 2 Oct 2015, 20:59:06 UTC - in response to Message 1731177.  

183 WUs were acknowledged at 14:27 UTC and I have work coming in again. Life is good.



I had to go and call a no-hitter didn't I. Will I ever learn? Now 32 are waiting to be acknowledged and no new work.
ID: 1731249 · Report as offensive
Profile petri33
Volunteer tester

Send message
Joined: 6 Jun 02
Posts: 1668
Credit: 623,086,772
RAC: 156
Finland
Message 1731254 - Posted: 2 Oct 2015, 21:28:58 UTC

This is life, la laa laa li la,,,

lala la laa li la, la laa laa li la,

la la la, la life.

[opus5]

262+ not reported,

problems stil.
- master file download successful
- ,,, error.


la 3. lokakuuta 2015 00.15.40 | SETI@home | Scheduler request failed: Couldn't connect to server
la 3. lokakuuta 2015 00.15.41 | SETI@home | update requested by user
la 3. lokakuuta 2015 00.15.42 | | Internet access OK - project servers may be temporarily down.
la 3. lokakuuta 2015 00.15.45 | SETI@home | Fetching scheduler list
la 3. lokakuuta 2015 00.15.47 | SETI@home | Master file download succeeded
la 3. lokakuuta 2015 00.15.52 | SETI@home | Sending scheduler request: Requested by user.
la 3. lokakuuta 2015 00.15.52 | SETI@home | Reporting 262 completed tasks
la 3. lokakuuta 2015 00.15.52 | SETI@home | Requesting new tasks for CPU and NVIDIA
la 3. lokakuuta 2015 00.15.55 | | Project communication failed: attempting access to reference site
la 3. lokakuuta 2015 00.15.55 | SETI@home | Scheduler request failed: Couldn't connect to server
la 3. lokakuuta 2015 00.15.56 | | Internet access OK - project servers may be temporarily down.
la 3. lokakuuta 2015 00.15.56 | SETI@home | update requested by user
To overcome Heisenbergs:
"You can't always get what you want / but if you try sometimes you just might find / you get what you need." -- Rolling Stones
ID: 1731254 · Report as offensive
Profile ReiAyanami
Avatar

Send message
Joined: 6 Dec 05
Posts: 116
Credit: 222,900,202
RAC: 174
Japan
Message 1731255 - Posted: 2 Oct 2015, 21:29:33 UTC

OK, it lasted only for another half an hour. Now I can't connect for last 3 hours....mmmm
ID: 1731255 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1731258 - Posted: 2 Oct 2015, 21:33:18 UTC

I know I made a long post yesterday explaining that engineering a network isn't trivial and straight-forward, and I still think that it is a routing table problem. I remember in the Cisco labs, the whole class got together and we each divided and conquered and configured a router and had it connected to a second NIC in each of our workstations (there were 24 of us in the class), and we were using a variety of routing table protocols (RIP, RIPv2, IGRP, EIGRP, OSPF, and BGP) to see how each of those propagates and to see the pros and cons of them all.

I don't remember the specifics of which ones did what, but I do remember BGP (which is what the Internet at large uses) taking the longest to propagate and to have the complete map of the network, and it is also one that worked as expected for a few minutes, until most of the workstations were unreachable, and it was because one person's finger slipped when typing in a string of characters or numbers and ended up doing a 3 instead of a 2 or something akin to that. Basically, everything was fine until their misconfigured router tried syncing up with everything else.

It happens, and it took the whole class a while to figure it out. We even went so far as to go into the admin console and do 'clear nvram; restart' and started over with a blank router.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1731258 · Report as offensive
Profile JaundicedEye
Avatar

Send message
Joined: 14 Mar 12
Posts: 5375
Credit: 30,870,693
RAC: 1
United States
Message 1731260 - Posted: 2 Oct 2015, 21:34:08 UTC
Last modified: 2 Oct 2015, 21:35:41 UTC

So much for the sound of knuckles rapping loudly on wood........[edit](and now it is Friday).

"Sour Grapes make a bitter Whine." <(0)>
ID: 1731260 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13727
Credit: 208,696,464
RAC: 304
Australia
Message 1731263 - Posted: 2 Oct 2015, 21:43:51 UTC - in response to Message 1731160.  
Last modified: 2 Oct 2015, 22:07:29 UTC

2 days work now all reported ok and new work downloaded. Thank you to whoever fixed it.

It's still broken.

Still taking multiple attempts to report & get new work.
Still getting sticky downloads.


EDIT- it doesn't appear to be taking as many attempts as it was before finally getting through.
Grant
Darwin NT
ID: 1731263 · Report as offensive
Herb Smith
Volunteer tester

Send message
Joined: 28 Jan 07
Posts: 76
Credit: 31,615,205
RAC: 0
United States
Message 1731264 - Posted: 2 Oct 2015, 21:44:07 UTC

It appears the problem came back for me at about 18:00 UTC. But at least I have full cache. So good for 12 hours on my big box, 24 on others.

Herb
ID: 1731264 · Report as offensive
Herb Smith
Volunteer tester

Send message
Joined: 28 Jan 07
Posts: 76
Credit: 31,615,205
RAC: 0
United States
Message 1731265 - Posted: 2 Oct 2015, 21:46:12 UTC
Last modified: 2 Oct 2015, 21:49:29 UTC

.aa
ID: 1731265 · Report as offensive
Profile betreger Project Donor
Avatar

Send message
Joined: 29 Jun 99
Posts: 11361
Credit: 29,581,041
RAC: 66
United States
Message 1731278 - Posted: 2 Oct 2015, 22:10:02 UTC - in response to Message 1731268.  

Works pretty good here now. No need for manual updates. My computers are well fed with work, and new work keeps getting in when needed.

But they continue to not split APs.
ID: 1731278 · Report as offensive
Baiteh

Send message
Joined: 10 Sep 15
Posts: 34
Credit: 7,705,483
RAC: 0
United Kingdom
Message 1731289 - Posted: 2 Oct 2015, 22:34:09 UTC

Can't get new credit here at all. It seems to have uploaded my work, just not getting new ones.
ID: 1731289 · Report as offensive
Profile JaundicedEye
Avatar

Send message
Joined: 14 Mar 12
Posts: 5375
Credit: 30,870,693
RAC: 1
United States
Message 1731296 - Posted: 2 Oct 2015, 23:00:39 UTC

Campus is aware of the issues. It's out of our control.
Is it a fair question to ask if this situation affects any other sectors of Campus IT or just SETI?

"Sour Grapes make a bitter Whine." <(0)>
ID: 1731296 · Report as offensive
OGM

Send message
Joined: 14 Apr 15
Posts: 12
Credit: 1,001,458
RAC: 0
Portugal
Message 1731297 - Posted: 2 Oct 2015, 23:04:24 UTC - in response to Message 1731182.  

... and it's not working again...
ID: 1731297 · Report as offensive
OTS
Volunteer tester

Send message
Joined: 6 Jan 08
Posts: 369
Credit: 20,533,537
RAC: 0
United States
Message 1731316 - Posted: 3 Oct 2015, 0:16:56 UTC - in response to Message 1731296.  

Campus is aware of the issues. It's out of our control.
Is it a fair question to ask if this situation affects any other sectors of Campus IT or just SETI?


I am willing to bet it isn't affecting chancellor Dirks and his staff. :)
ID: 1731316 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13727
Credit: 208,696,464
RAC: 304
Australia
Message 1731317 - Posted: 3 Oct 2015, 0:18:53 UTC - in response to Message 1731316.  

Ah, we're back.
For a while there I could get the home page, but nothing else. Then I couldn't even get the home page.
Lets see how long this lasts.
Grant
Darwin NT
ID: 1731317 · Report as offensive
OTS
Volunteer tester

Send message
Joined: 6 Jan 08
Posts: 369
Credit: 20,533,537
RAC: 0
United States
Message 1731324 - Posted: 3 Oct 2015, 0:46:41 UTC - in response to Message 1731317.  

Ah, we're back.
For a while there I could get the home page, but nothing else. Then I couldn't even get the home page.
Lets see how long this lasts.


Not for everyone. Still seeing "Internet access OK - project servers may be temporarily down." with every connection. 102 times in a row now. The last good connection was at 18:00:17 UTC.
ID: 1731324 · Report as offensive
Profile Oz
Avatar

Send message
Joined: 6 Jun 99
Posts: 233
Credit: 200,655,462
RAC: 212
United States
Message 1731348 - Posted: 3 Oct 2015, 1:47:17 UTC
Last modified: 3 Oct 2015, 1:49:14 UTC

Along with posting here, might I suggest that people send a "problem with service" email to:

itcsshelp@berkeley.edu

A few dozen (or hundred or thousand) emails might help Berkeley IST grasp the scope of the problem.

(Be polite!)
Member of the 20 Year Club



ID: 1731348 · Report as offensive
Profile Donald L. Johnson
Avatar

Send message
Joined: 5 Aug 02
Posts: 8240
Credit: 14,654,533
RAC: 20
United States
Message 1731352 - Posted: 3 Oct 2015, 2:07:08 UTC - in response to Message 1731348.  

Along with posting here, might I suggest that people send a "problem with service" email to:

itcsshelp@berkeley.edu

A few dozen (or hundred or thousand) emails might help Berkeley IST grasp the scope of the problem.

(Be polite!)

And I would suggest NOT doing that.

Matt said they are aware of the problem and are working on it.
Flooding the Help Desk with Trouble reports about a problem they are already working on may serve only to piss-off the folks on the help-desk, and possibly IT management.
Donald
Infernal Optimist / Submariner, retired
ID: 1731352 · Report as offensive
Previous · 1 . . . 24 · 25 · 26 · 27 · 28 · 29 · 30 . . . 32 · Next

Message boards : Number crunching : Panic Mode On (100) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.