Panic Mode On (82) Server Problems?

Message boards : Number crunching : Panic Mode On (82) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 11 · 12 · 13 · 14 · 15 · 16 · 17 . . . 24 · Next

AuthorMessage
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1348695 - Posted: 20 Mar 2013, 13:22:13 UTC - in response to Message 1348684.  

For forums without any form of network, it's weird to see them up. I would've expected both these and the BOINC forums to be unavailable, but albeit slow, they are available.

So it only truly affects uploads and downloads then.

I don't think anybody's quite figured that one out yet.
Would be interesting after the fact to know what the alternate, though very limited, bandwidth path was.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1348695 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 1348697 - Posted: 20 Mar 2013, 13:38:25 UTC - in response to Message 1348695.  

Would be interesting after the fact to know what the alternate, though very limited, bandwidth path was.

The secret pathway. ;-)
ID: 1348697 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1348698 - Posted: 20 Mar 2013, 13:39:27 UTC - in response to Message 1348697.  

Would be interesting after the fact to know what the alternate, though very limited, bandwidth path was.

The secret pathway. ;-)

Indeed!
Maybe it shall never be revealed.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1348698 · Report as offensive
Profile Fred E.
Volunteer tester

Send message
Joined: 22 Jul 99
Posts: 768
Credit: 24,140,697
RAC: 0
United States
Message 1348726 - Posted: 20 Mar 2013, 14:49:32 UTC

Looks like it is recovering. Cricket says it is back. Uploads are still a pain, but I got enough through to get a work request and got a big bunch. Downloads are okay thanks to timestamps.

Would be interesting after the fact to know what the alternate, though very limited, bandwidth path was.

The secret pathway. ;-)

Indeed!
Maybe it shall never be revealed

Also very curious.
Another Fred
Support SETI@home when you search the Web with GoodSearch or shop online with GoodShop.
ID: 1348726 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1348745 - Posted: 20 Mar 2013, 16:27:50 UTC - in response to Message 1348726.  

Looks like it is recovering. Cricket says it is back. Uploads are still a pain, but I got enough through to get a work request and got a big bunch. Downloads are okay thanks to timestamps.

Would be interesting after the fact to know what the alternate, though very limited, bandwidth path was.

The secret pathway. ;-)

Indeed!
Maybe it shall never be revealed

Also very curious.

I envision they were upgrading hardware and moving cables from one piece of equipment to another. Which saturated the old equipment for a duration making things slow for us.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1348745 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34744
Credit: 261,360,520
RAC: 489
Australia
Message 1348810 - Posted: 20 Mar 2013, 19:00:59 UTC - in response to Message 1348745.  

Looks like it is recovering. Cricket says it is back. Uploads are still a pain, but I got enough through to get a work request and got a big bunch. Downloads are okay thanks to timestamps.

Would be interesting after the fact to know what the alternate, though very limited, bandwidth path was.

The secret pathway. ;-)

Indeed!
Maybe it shall never be revealed

Also very curious.

I envision they were upgrading hardware and moving cables from one piece of equipment to another. Which saturated the old equipment for a duration making things slow for us.

Now if they were only doing the optical cables then there's no mystery as the copper line that SETI has has likely been doing the work.

Cheers.
ID: 1348810 · Report as offensive
Profile Floyd
Avatar

Send message
Joined: 19 May 11
Posts: 524
Credit: 1,870,625
RAC: 0
United States
Message 1349526 - Posted: 22 Mar 2013, 18:20:14 UTC
Last modified: 22 Mar 2013, 18:34:16 UTC

Blue line is sunk

http://fragment1.berkeley.edu/newcricket/grapher.cgi?target=%2Frouter-interfaces%2Finr-250%2Fgigabitethernet2_3;view=Octets;ranges=d

Project back off for 5+ hours on uploads and downloads.

Edit: looks like it just went down for a byte ( LOL ) and came back up for air again...
ID: 1349526 · Report as offensive
Profile ivan
Volunteer tester
Avatar

Send message
Joined: 5 Mar 01
Posts: 783
Credit: 348,560,338
RAC: 223
United Kingdom
Message 1349549 - Posted: 22 Mar 2013, 18:53:13 UTC - in response to Message 1349526.  

Blue line is sunk

Edit: looks like it just went down for a byte ( LOL ) and came back up for air again...

Yes, look like part of the 'net took a little nap. Remember, though, that that is a suppressed-zero graph; the blue line was at the bottom only because that was the lowest it had been in the graph's interval -- it only got down to about 5 Mbps.
ID: 1349549 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1349666 - Posted: 23 Mar 2013, 3:04:25 UTC
Last modified: 23 Mar 2013, 3:23:12 UTC

I could be wrong, but........

The kitties sense a disturbance in the force.
My cache has been holding steady near the limits since Tuesday's outage.

It has dropped over 100 tasks in the last half an hour.

And this is with 9 rigs running, not an anomaly with one rig.

Hope I am wrong.

Meowhmmmmmm.

EDIT...
Don't think I am wrong.
Cache is down almost another 100.
Something's gone asunder.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1349666 · Report as offensive
Profile Gatekeeper
Avatar

Send message
Joined: 14 Jul 04
Posts: 887
Credit: 176,479,616
RAC: 0
United States
Message 1349679 - Posted: 23 Mar 2013, 3:51:34 UTC - in response to Message 1349666.  
Last modified: 23 Mar 2013, 3:52:47 UTC

I could be wrong, but........

The kitties sense a disturbance in the force.
My cache has been holding steady near the limits since Tuesday's outage.

It has dropped over 100 tasks in the last half an hour.

And this is with 9 rigs running, not an anomaly with one rig.

Hope I am wrong.

Meowhmmmmmm.

EDIT...
Don't think I am wrong.
Cache is down almost another 100.
Something's gone asunder.


Yeah, something is messed up, but it doesn't seem to be the servers.

My work cache on three rigs is dropping as well, but if I go the projects tab, highlight seti, and check properties, I find that all three rigs are in "workfetch backoff" for both CPU and GPU, for time periods ranging from a few minutes to over 40 minutes.

The properties tab is the only place I can see this backoff. BOINC still runs scheduler requests every 5 or so minutes, and reports completed work, and asks for more. But the only return I get is "no work sent".

I've seen this behavior before, and have no idea why it occurs. I will say, though, that it eventually corrects itself.

Hope this post goes through though. I keep getting connection errors on the board here. Nowhere else, just the forums. This is the third time I've tried to post this.

EDIT: Ahh.. finally!
ID: 1349679 · Report as offensive
Horacio

Send message
Joined: 14 Jan 00
Posts: 536
Credit: 75,967,266
RAC: 0
Argentina
Message 1349686 - Posted: 23 Mar 2013, 4:17:40 UTC - in response to Message 1349679.  

EDIT...
Don't think I am wrong.
Cache is down almost another 100.
Something's gone asunder.


Yeah, something is messed up, but it doesn't seem to be the servers.

My work cache on three rigs is dropping as well, but if I go the projects tab, highlight seti, and check properties, I find that all three rigs are in "workfetch backoff" for both CPU and GPU, for time periods ranging from a few minutes to over 40 minutes.

The properties tab is the only place I can see this backoff. BOINC still runs scheduler requests every 5 or so minutes, and reports completed work, and asks for more. But the only return I get is "no work sent".

I've seen this behavior before, and have no idea why it occurs. I will say, though, that it eventually corrects itself.

Hope this post goes through though. I keep getting connection errors on the board here. Nowhere else, just the forums. This is the third time I've tried to post this.

EDIT: Ahh.. finally!


Of the 3 SETI hosts 2 have 200 tasks each and they are still getting replacements for the finished tasks, the 3rd one is slower and has only 145, but it is because is asking only for GPU tasks which have already reached the limits...

Wow... it's really a disturbance in the force... Im not having issu... mmhh... I'd better not say that... ;D
ID: 1349686 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13736
Credit: 208,696,464
RAC: 304
Australia
Message 1349698 - Posted: 23 Mar 2013, 6:04:24 UTC - in response to Message 1349679.  
Last modified: 23 Mar 2013, 6:05:18 UTC

The properties tab is the only place I can see this backoff. BOINC still runs scheduler requests every 5 or so minutes, and reports completed work, and asks for more. But the only return I get is "no work sent".

Just had a look in my log & noticed several "No work sent" responses in a row, then finally it allocated 10 WUs.
Requesting GPU & CPU work, i notice there's now a lot of VLARs about.
I suspect that's what's resulting in the "No work sent" messages- the Scheduler gets in a bit of a bind when CPU & GPU work is requested, but only work for the CPU is available.
Grant
Darwin NT
ID: 1349698 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1349760 - Posted: 23 Mar 2013, 11:46:17 UTC

Seems to have been a transient problem.
As of this morning, all is back up to snuff again.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1349760 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1350718 - Posted: 25 Mar 2013, 22:06:22 UTC

Was trying to figure out why my single-core machine was going through an AP and had like 25 or 30 restarts in the stderr output. Then I put two and two together and realized that my 20-month-old niece figured out there's a reset button on the front of that machine. Soooooo.. took the cover off and unplugged the button from the board. Problem solved.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1350718 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22199
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1350951 - Posted: 26 Mar 2013, 21:01:08 UTC

Is it just me?
But uploads are having a real struggle just now, with lots of re-tries and long periods of nothing.
Just a thought, is Bruno feeling OK, or is the thought of the change of environment getting to the old boy?

Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1350951 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34744
Credit: 261,360,520
RAC: 489
Australia
Message 1350955 - Posted: 26 Mar 2013, 21:16:48 UTC - in response to Message 1350951.  

Yep, uploading is like pulling hen's teeth. :-(

Cheers.
ID: 1350955 · Report as offensive
Lionel

Send message
Joined: 25 Mar 00
Posts: 680
Credit: 563,640,304
RAC: 597
Australia
Message 1350998 - Posted: 26 Mar 2013, 22:54:02 UTC - in response to Message 1350955.  

Yeh, I was up a 6:30 this morning (Melb, Aus) and it was like that then ... sucks doesn't it ...
ID: 1350998 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34744
Credit: 261,360,520
RAC: 489
Australia
Message 1351002 - Posted: 26 Mar 2013, 22:59:02 UTC - in response to Message 1350998.  

It does and the likely culprit, as usual, is far too many AP's going out at once.

Cheers.
ID: 1351002 · Report as offensive
Profile Mark Wyzenbeek
Avatar

Send message
Joined: 28 Jun 99
Posts: 134
Credit: 6,203,079
RAC: 0
United States
Message 1351012 - Posted: 26 Mar 2013, 23:18:51 UTC

Yep, upload problems here too.
The Universe is not only stranger than you imagine, it's stranger than you can imagine.

SETI@home classic workunits 1,405 CPU time 57,318 hours
ID: 1351012 · Report as offensive
Profile Gatekeeper
Avatar

Send message
Joined: 14 Jul 04
Posts: 887
Credit: 176,479,616
RAC: 0
United States
Message 1351015 - Posted: 26 Mar 2013, 23:26:19 UTC
Last modified: 26 Mar 2013, 23:26:39 UTC

ditto on the u/l problems. Cricket graph shows spiky since the outage, now trending into the toilet.
ID: 1351015 · Report as offensive
Previous · 1 . . . 11 · 12 · 13 · 14 · 15 · 16 · 17 . . . 24 · Next

Message boards : Number crunching : Panic Mode On (82) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.