Panic Mode On (80) Server Problems?

Message boards : Number crunching : Panic Mode On (80) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · 11 · 12 . . . 24 · Next

AuthorMessage
Kevin Benfield

Send message
Joined: 29 Dec 03
Posts: 39
Credit: 30,085,439
RAC: 0
United Kingdom
Message 1324718 - Posted: 4 Jan 2013, 22:00:23 UTC - in response to Message 1324690.  

Yes, I see I have stated to get some, hopefully it will continue, cache was set to 1 day, although I did lower to 0.5 day to see if that helped.

as tasks hopefully come in I will see if I can gradually increase cache number.
ID: 1324718 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1324719 - Posted: 4 Jan 2013, 22:06:11 UTC - in response to Message 1324718.  
Last modified: 4 Jan 2013, 22:08:06 UTC

Yes, I see I have stated to get some, hopefully it will continue, cache was set to 1 day, although I did lower to 0.5 day to see if that helped.

as tasks hopefully come in I will see if I can gradually increase cache number.

There are two cache settings, for Boinc 7 you should set the first one to a higher number, and the second to a lower number, ie 1 + 0.01
(Boinc 7 will wait for the amount of work cached to fall below the 1 setting before asking again, if you put a higher number in the second setting and a low number in the 1st, Boinc 7 will wait until it's almost empty before asking again)

Claggy
ID: 1324719 · Report as offensive
Kevin Benfield

Send message
Joined: 29 Dec 03
Posts: 39
Credit: 30,085,439
RAC: 0
United Kingdom
Message 1324720 - Posted: 4 Jan 2013, 22:10:35 UTC - in response to Message 1324719.  

the number for the second cache is currently 0
ID: 1324720 · Report as offensive
.clair.

Send message
Joined: 4 Nov 04
Posts: 1300
Credit: 55,390,408
RAC: 69
United Kingdom
Message 1324729 - Posted: 4 Jan 2013, 23:00:31 UTC

With the way the cricket graph is slowly building i wonder if some thing is being used to further track down our gremlins and see where the break point is from a different angle.
and still lots wu RTS.
ID: 1324729 · Report as offensive
Profile ivan
Volunteer tester
Avatar

Send message
Joined: 5 Mar 01
Posts: 783
Credit: 348,560,338
RAC: 223
United Kingdom
Message 1324739 - Posted: 5 Jan 2013, 0:00:10 UTC - in response to Message 1324729.  

With the way the cricket graph is slowly building i wonder if some thing is being used to further track down our gremlins and see where the break point is from a different angle.
and still lots wu RTS.

My impression is that they are throttling the feeder, and may be slowly releasing the throttle. Either that or the way things are working the proportion of APs being sent out is increasing for <waves hands> some reason </w>.
ID: 1324739 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 38111
Credit: 261,360,520
RAC: 489
Australia
Message 1324760 - Posted: 5 Jan 2013, 2:05:57 UTC - in response to Message 1324739.  

Now that it seems that the feeder is back up to speed hopefully the backlog of unsent work over the last few days will clear up soon as well.

Cheers.
ID: 1324760 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1324806 - Posted: 5 Jan 2013, 5:25:27 UTC

If I get one more AP, I'll have a full 10-day cache. I guess that's about the only good thing to being CPU-only is it makes a large cache less than the limits.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1324806 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13946
Credit: 208,696,464
RAC: 304
Australia
Message 1324814 - Posted: 5 Jan 2013, 6:12:51 UTC - in response to Message 1324806.  
Last modified: 5 Jan 2013, 6:13:05 UTC

I guess that's about the only good thing to being CPU-only is it makes a large cache less than the limits.

With MB my Core2 Duo can last for about 4 days with long running WUs, probably 5 if they were all VLARs.
On my i7 even with all VLARs the present limits wouldn't give me a days work.
Grant
Darwin NT
ID: 1324814 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51533
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1324924 - Posted: 5 Jan 2013, 14:39:44 UTC

Bad kitty juju here.......
Some time last night there was a power glitch, and EVERYTHING reset.
Most of the rigs restarted OK, but a couple of them hung, including the Frozen One, who's compressor does not appreciate being restarted like that.
Or the freaking modem and router, who apparently cannot come back up without a few restarts by the kitty paws.
Not a great good welcome morning here.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1324924 · Report as offensive
Keith White
Avatar

Send message
Joined: 29 May 99
Posts: 392
Credit: 13,035,233
RAC: 22
United States
Message 1324929 - Posted: 5 Jan 2013, 15:03:15 UTC

I'm concern that they haven't powered down for the scheduled electrical work. I'm concerned because I could see that the staff was told the work was delayed but sometime today someone else will simply cut the power, leading to a mess that'll take days to restore.
"Life is just nature's way of keeping meat fresh." - The Doctor
ID: 1324929 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51533
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1324930 - Posted: 5 Jan 2013, 15:06:14 UTC - in response to Message 1324929.  
Last modified: 5 Jan 2013, 15:07:33 UTC

I'm concern that they haven't powered down for the scheduled electrical work. I'm concerned because I could see that the staff was told the work was delayed but sometime today someone else will simply cut the power, leading to a mess that'll take days to restore.

Keith...

I am sure they have it covered.
The work was rescheduled.
I am sure we shall see a notice when they are gonna do it.
They're not simply gonna cut out the line to the SSL without notice.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1324930 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22793
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1324945 - Posted: 5 Jan 2013, 15:40:24 UTC

The fact that Matt has sent out a message saying that the outage has been cancelled indicates that the outage for the repairs to the SSL-wide power systems has been cancelled for this weekend, and that the whole of the SSL will be breathing a communal sigh of relief. One of two things has happened - the uni has not been able to undertake the repairs this weekend for whatever reason, or the uni has found a way to conduct the repairs without shutting down the whole of the SSL. I suspect the former is the more probable, so there will be a planned outage for the work to be done, and there will be notice of it being done so the whole lab can prepared for it.

(We are looking forward to version four of a similar outage where I work.)
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1324945 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 1324951 - Posted: 5 Jan 2013, 15:59:21 UTC

Looks like there can be some outage on Monday, though.
IST Service Status writes:
Scheduled Outage Space Sciences Laboratory

Outage Type: SCHEDULED OUTAGE
Date Submitted: Monday, January 7, 2013
Outage Start/End Time: 0630 – 0700
Groups Impacted: Space Sciences Laboratory
Equipment: sut1ds, sslringsut1fes

Description: Users on the PPCS-SSL-net_169.229.155.240/29 subnet will be without network connectivity while a switchport is reconfigured.

But perhaps that it only affects the forums, and not the data carrier.
ID: 1324951 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51533
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1324954 - Posted: 5 Jan 2013, 16:01:08 UTC - in response to Message 1324945.  

The fact that Matt has sent out a message saying that the outage has been cancelled indicates that the outage for the repairs to the SSL-wide power systems has been cancelled for this weekend, and that the whole of the SSL will be breathing a communal sigh of relief. One of two things has happened - the uni has not been able to undertake the repairs this weekend for whatever reason, or the uni has found a way to conduct the repairs without shutting down the whole of the SSL. I suspect the former is the more probable, so there will be a planned outage for the work to be done, and there will be notice of it being done so the whole lab can prepared for it.

(We are looking forward to version four of a similar outage where I work.)

If they have to do any transformer work, total outage is likely.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1324954 · Report as offensive
Profile Donald L. Johnson
Avatar

Send message
Joined: 5 Aug 02
Posts: 8240
Credit: 14,654,533
RAC: 20
United States
Message 1324957 - Posted: 5 Jan 2013, 16:05:14 UTC - in response to Message 1324951.  

Looks like there can be some outage on Monday, though.
IST Service Status writes:
Scheduled Outage Space Sciences Laboratory

Outage Type: SCHEDULED OUTAGE
Date Submitted: Monday, January 7, 2013
Outage Start/End Time: 0630 – 0700
Groups Impacted: Space Sciences Laboratory
Equipment: sut1ds, sslringsut1fes

Description: Users on the PPCS-SSL-net_169.229.155.240/29 subnet will be without network connectivity while a switchport is reconfigured.

But perhaps that it only affects the forums, and not the data carrier.

Only half an hour allotted, and before normal working hours at Berkeley - should be minimal impact on us, probably just the forums and front page.
Donald
Infernal Optimist / Submariner, retired
ID: 1324957 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51533
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1324958 - Posted: 5 Jan 2013, 16:07:27 UTC - in response to Message 1324957.  

Looks like there can be some outage on Monday, though.
IST Service Status writes:
Scheduled Outage Space Sciences Laboratory

Outage Type: SCHEDULED OUTAGE
Date Submitted: Monday, January 7, 2013
Outage Start/End Time: 0630 – 0700
Groups Impacted: Space Sciences Laboratory
Equipment: sut1ds, sslringsut1fes

Description: Users on the PPCS-SSL-net_169.229.155.240/29 subnet will be without network connectivity while a switchport is reconfigured.

But perhaps that it only affects the forums, and not the data carrier.

Only half an hour allotted, and before normal working hours at Berkeley - should be minimal impact on us, probably just the forums and front page.

Yeah, OK.
I am an electrician, and know what we can do, LOL.


"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1324958 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22793
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1325024 - Posted: 5 Jan 2013, 18:53:33 UTC

This is an ITS outage, not a power outage, and it only affects (one of) the out of building switches, so the servers will be able to carry on, but there may be no out of building connectivity depending on which switch they are working on and how that switch is being reconfigured.
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1325024 · Report as offensive
Keith White
Avatar

Send message
Joined: 29 May 99
Posts: 392
Credit: 13,035,233
RAC: 22
United States
Message 1325032 - Posted: 5 Jan 2013, 19:15:13 UTC - in response to Message 1324930.  
Last modified: 5 Jan 2013, 19:21:09 UTC

I'm concern that they haven't powered down for the scheduled electrical work. I'm concerned because I could see that the staff was told the work was delayed but sometime today someone else will simply cut the power, leading to a mess that'll take days to restore.

Keith...

I am sure they have it covered.
The work was rescheduled.
I am sure we shall see a notice when they are gonna do it.
They're not simply gonna cut out the line to the SSL without notice.

I take it you haven't worked with contractors, utilities or a university maintenance department before. It makes those 6 hour windows for the cable guy seem downright convenient. ;)

Edit: Okay, they did canceled it. Now back to your previously scheduled grumblings already in progress.
"Life is just nature's way of keeping meat fresh." - The Doctor
ID: 1325032 · Report as offensive
.clair.

Send message
Joined: 4 Nov 04
Posts: 1300
Credit: 55,390,408
RAC: 69
United Kingdom
Message 1325036 - Posted: 5 Jan 2013, 19:36:58 UTC - in response to Message 1325024.  

which switch they are working on and how that switch is being reconfigured.

That can be :-
software
firmware
hardware
reboot
steel toecap boot ....
hammer
wire cuters
trash IT
ID: 1325036 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22793
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1325042 - Posted: 5 Jan 2013, 19:51:33 UTC

Just now downloads are back to their normal - about as fast as a snail with arthritis moving across coarse glass paper...

Unlike when there were 4,000,000 awaiting download when I couldn't get any, but downloads were fast.

I just wonder if they were trying to see what happens if you overfill the tanks, and at what point the downloads "work well". Keep going down, its a long way below the current level.
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1325042 · Report as offensive
Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · 11 · 12 . . . 24 · Next

Message boards : Number crunching : Panic Mode On (80) Server Problems?


 
©2025 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.