Panic Mode On (56) Server problems?

Message boards : Number crunching : Panic Mode On (56) Server problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 . . . 11 · Next

AuthorMessage
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13904
Credit: 208,696,464
RAC: 304
Australia
Message 1157178 - Posted: 29 Sep 2011, 19:13:05 UTC - in response to Message 1157172.  


Looks like things are going to be erratic for a while- managed to download some work, but now downloads aren't happening, Scheduler requests time out & uploads aren't going through.
Grant
Darwin NT
ID: 1157178 · Report as offensive
Highlander
Avatar

Send message
Joined: 5 Oct 99
Posts: 167
Credit: 37,987,668
RAC: 16
Germany
Message 1157184 - Posted: 29 Sep 2011, 19:23:46 UTC - in response to Message 1157180.  
Last modified: 29 Sep 2011, 19:24:23 UTC

We're going down again shortly.


Well, you're right again ...
- Performance is not a simple linear function of the number of CPUs you throw at the problem. -
ID: 1157184 · Report as offensive
Profile SciManStev Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Jun 99
Posts: 6660
Credit: 121,090,076
RAC: 0
United States
Message 1157199 - Posted: 29 Sep 2011, 19:56:59 UTC

I built up a 5 or 6 day cache on Einstein, and set it to NNT. If Seti is functional when it depletes, I will load up with Seti work. I will keep evaluation what Seti is doing, although I am hoping that at some point the higher through put machines will be able to build a cache. I know this may take a couple of weeks or so, and I'm not concerned. The last couple of months has been a bit tough, but science still got done. ET may be in the data we crunched with flaky DCF's, or near empty machines.

Steve
Warning, addicted to SETI crunching!
Crunching as a member of GPU Users Group.
GPUUG Website
ID: 1157199 · Report as offensive
Profile [B^S] madmac
Volunteer tester
Avatar

Send message
Joined: 9 Feb 04
Posts: 1175
Credit: 4,754,897
RAC: 0
United Kingdom
Message 1157219 - Posted: 29 Sep 2011, 20:30:06 UTC

Got some downloaded now cannot upload yet server status states that it is running or is this wrong again?
ID: 1157219 · Report as offensive
Profile Donald L. Johnson
Avatar

Send message
Joined: 5 Aug 02
Posts: 8240
Credit: 14,654,533
RAC: 20
United States
Message 1157224 - Posted: 29 Sep 2011, 20:37:56 UTC

Cricket says we are down again. Hope it isn't for long. I have at least 2 days work on both my G4s and have set NNT, but I hear others are running on fumes, so......
Donald
Infernal Optimist / Submariner, retired
ID: 1157224 · Report as offensive
DMMD
Avatar

Send message
Joined: 14 Feb 00
Posts: 118
Credit: 71,564,960
RAC: 0
Message 1157229 - Posted: 29 Sep 2011, 20:53:20 UTC

My blushes Sten, surely there is a better venue for that ribawdwy!

Try this while you are waiting?

No Time To Lose
ID: 1157229 · Report as offensive
SupeRNovA
Volunteer tester
Avatar

Send message
Joined: 25 Oct 04
Posts: 131
Credit: 12,741,814
RAC: 0
Bulgaria
Message 1157255 - Posted: 29 Sep 2011, 21:38:40 UTC

i'm running milkyway@home i have done all my Seti units.
and in the milkyway@home the NVIDIA is at the loosing site for best performance cards... ATI is the king ...
let's hope they will fix the problem soon
ID: 1157255 · Report as offensive
Profile soft^spirit
Avatar

Send message
Joined: 18 May 99
Posts: 6497
Credit: 34,134,168
RAC: 0
United States
Message 1157257 - Posted: 29 Sep 2011, 21:39:42 UTC - in response to Message 1157253.  

My Guf is Empty for 36 hours now..can I please get some work!!!!


See News
Janice
ID: 1157257 · Report as offensive
bill

Send message
Joined: 16 Jun 99
Posts: 861
Credit: 29,352,955
RAC: 0
United States
Message 1157259 - Posted: 29 Sep 2011, 21:41:20 UTC
Last modified: 29 Sep 2011, 21:42:06 UTC

Just uploaded all wu. Just downloaded 50 wu.
All cuda fermis.
ID: 1157259 · Report as offensive
Profile Fred E.
Volunteer tester

Send message
Joined: 22 Jul 99
Posts: 768
Credit: 24,140,697
RAC: 0
United States
Message 1157261 - Posted: 29 Sep 2011, 21:43:03 UTC

May be work coming. Little guy just got gpu work on 3 consecutive fetches, all mb, mix of long /short. No retries, cricket says I'm not the only one, but not maxed out yet.
Another Fred
Support SETI@home when you search the Web with GoodSearch or shop online with GoodShop.
ID: 1157261 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1157264 - Posted: 29 Sep 2011, 21:51:46 UTC - in response to Message 1157261.  
Last modified: 29 Sep 2011, 21:53:47 UTC

May be work coming. Little guy just got gpu work on 3 consecutive fetches, all mb, mix of long /short. No retries, cricket says I'm not the only one, but not maxed out yet.

Yep:

29/09/2011 22:48:04 SETI@home [sched_op_debug] Starting scheduler request
29/09/2011 22:48:04 SETI@home Sending scheduler request: Requested by user.
29/09/2011 22:48:04 SETI@home Reporting 12 completed tasks, requesting new tasks for GPU
29/09/2011 22:48:04 SETI@home [sched_op_debug] CPU work request: 0.00 seconds; 0.00 CPUs
29/09/2011 22:48:04 SETI@home [sched_op_debug] NVIDIA GPU work request: 288072.26 seconds; 0.00 GPUs
29/09/2011 22:48:04 SETI@home [sched_op_debug] ATI GPU work request: 432000.00 seconds; 1.00 GPUs
29/09/2011 22:48:15 SETI@home Scheduler request completed: got 67 new tasks
29/09/2011 22:48:15 SETI@home [sched_op_debug] Server version 613
29/09/2011 22:48:15 SETI@home Message from server: No tasks are available for the applications you have selected
29/09/2011 22:48:15 SETI@home Message from server: No tasks are available for Astropulse v505
29/09/2011 22:48:15 SETI@home Message from server: Your preferences allow tasks from applications other than those selected
29/09/2011 22:48:15 SETI@home Message from server: Sending tasks from other applications
29/09/2011 22:48:15 SETI@home Project requested delay of 303 seconds
29/09/2011 22:48:15 SETI@home [sched_op_debug] estimated total CPU job duration: 0 seconds
29/09/2011 22:48:15 SETI@home [sched_op_debug] estimated total NVIDIA GPU job duration: 43059 seconds
29/09/2011 22:48:15 SETI@home [sched_op_debug] estimated total ATI GPU job duration: 0 seconds

Edit: and all downloaded now.

Claggy
ID: 1157264 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14690
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1157290 - Posted: 29 Sep 2011, 23:30:00 UTC - in response to Message 1157172.  

Stupid machines, it had gone to split requests and asked for CPU tasks first. And Whoopie it got 40.
Next time it requested fot the GPU and lo and behold, reached a &TYR*UG%PI& limit.

I don't understand this one. I was well down on the test rig, but after a few fetches and failed fetches, I got up to the expected 450 (400 GPU and 50 CPU) limit without any problems from partial limits for one or other device on the way.

After a rocky recovery from the failed RAID, we seem to be running OK now (and with plenty of non-shorties in the mix, which should help a lot).
ID: 1157290 · Report as offensive
Profile Mad Fritz
Avatar

Send message
Joined: 20 Jul 01
Posts: 87
Credit: 11,334,904
RAC: 0
Switzerland
Message 1157315 - Posted: 30 Sep 2011, 1:21:14 UTC - in response to Message 1157290.  


...we seem to be running OK now (and with plenty of non-shorties in the mix, which should help a lot).



Well, not for me

30.09.2011 03:18:00 | SETI@home | Requesting new tasks for CPU and ATI GPU
30.09.2011 03:18:43 |  | Project communication failed: attempting access to reference site
30.09.2011 03:18:43 | SETI@home | Scheduler request failed: Failure when receiving data from the peer
30.09.2011 03:18:45 |  | Internet access OK - project servers may be temporarily down.


ID: 1157315 · Report as offensive
Profile soft^spirit
Avatar

Send message
Joined: 18 May 99
Posts: 6497
Credit: 34,134,168
RAC: 0
United States
Message 1157318 - Posted: 30 Sep 2011, 1:31:14 UTC

pipes are running amazingly fast. It feels like something major got fixed.
Janice
ID: 1157318 · Report as offensive
bill

Send message
Joined: 16 Jun 99
Posts: 861
Credit: 29,352,955
RAC: 0
United States
Message 1157323 - Posted: 30 Sep 2011, 1:45:39 UTC - in response to Message 1157318.  

Have you taken a look at the Cricket graph?

I think this is how it's supposed to be if the servers
aren't maxed out.
ID: 1157323 · Report as offensive
Terror Australis
Volunteer tester

Send message
Joined: 14 Feb 04
Posts: 1817
Credit: 262,693,308
RAC: 44
Australia
Message 1157325 - Posted: 30 Sep 2011, 2:03:37 UTC

I got 30 units when the servers first came back up. Then there was the 2nd dip in the Cricket graphs, now I can't connect to upload or download more.

Going direct I get a "connect()failed" message
Through a proxy (tried 2 different ones) I get "http error"

If someone wants to start a fund raiser for a new router to cure the the "HE problem" I'll kick the can for $500 straight up

T.A.
ID: 1157325 · Report as offensive
bill

Send message
Joined: 16 Jun 99
Posts: 861
Credit: 29,352,955
RAC: 0
United States
Message 1157329 - Posted: 30 Sep 2011, 2:24:39 UTC - in response to Message 1157325.  

I got 30 units when the servers first came back up. Then there was the 2nd dip in the Cricket graphs, now I can't connect to upload or download more.

Going direct I get a "connect()failed" message
Through a proxy (tried 2 different ones) I get "http error"

If someone wants to start a fund raiser for a new router to cure the the "HE problem" I'll kick the can for $500 straight up

T.A.


I want to know exactly what hardware SAH needs
the money for.

They have been asked this before and have been
silent on the issue. I want to know first before
I send in any money for it. Until then, nada.
ID: 1157329 · Report as offensive
Terror Australis
Volunteer tester

Send message
Joined: 14 Feb 04
Posts: 1817
Credit: 262,693,308
RAC: 44
Australia
Message 1157331 - Posted: 30 Sep 2011, 2:35:52 UTC - in response to Message 1157329.  

I got 30 units when the servers first came back up. Then there was the 2nd dip in the Cricket graphs, now I can't connect to upload or download more.

Going direct I get a "connect()failed" message
Through a proxy (tried 2 different ones) I get "http error"

If someone wants to start a fund raiser for a new router to cure the the "HE problem" I'll kick the can for $500 straight up

T.A.


I want to know exactly what hardware SAH needs
the money for.

They have been asked this before and have been
silent on the issue. I want to know first before
I send in any money for it. Until then, nada.


From the "HE Connection Problems Thread"
Folks, the problem is KNOWN to be a flakey router (208.68.243.254) at the Hurricane Electric interconnect in Palo Alto, California. S@H staff is working with HE to get it either fixed or replaced, but until that happens, SOME folks' access to the S@H upload/download pipe will be blocked, and it may not always be the same ones.
ID: 1157331 · Report as offensive
Profile Donald L. Johnson
Avatar

Send message
Joined: 5 Aug 02
Posts: 8240
Credit: 14,654,533
RAC: 20
United States
Message 1157348 - Posted: 30 Sep 2011, 3:58:56 UTC - in response to Message 1157331.  

I want to know exactly what hardware SAH needs
the money for.

They have been asked this before and have been
silent on the issue. I want to know first before
I send in any money for it. Until then, nada.

From the "HE Connection Problems Thread"
Folks, the problem is KNOWN to be a flakey router (208.68.243.254) at the Hurricane Electric interconnect in Palo Alto, California. S@H staff is working with HE to get it either fixed or replaced, but until that happens, SOME folks' access to the S@H upload/download pipe will be blocked, and it may not always be the same ones.

It's actually more complicated than that. See this message.
From Vic's research, a replacement Router will cost around $30K - not cheap.

Let's let Eric & co. see what they can do, but if they ask for a fund drive, I'll be in.
Donald
Infernal Optimist / Submariner, retired
ID: 1157348 · Report as offensive
bill

Send message
Joined: 16 Jun 99
Posts: 861
Credit: 29,352,955
RAC: 0
United States
Message 1157362 - Posted: 30 Sep 2011, 4:45:48 UTC - in response to Message 1157331.  

(Some snippage)

From the "HE Connection Problems Thread"
Folks, the problem is KNOWN to be a flakey router (208.68.243.254) at the Hurricane Electric interconnect in Palo Alto, California. S@H staff is working with HE to get it either fixed or replaced, but until that happens, SOME folks' access to the S@H upload/download pipe will be blocked, and it may not always be the same ones.


Saw that. "Router" covers a lot of territory.
So who owns it? HE or SAH?

If they need a Data Blaster 9600 with Sapphire Knobs
and Dinochrome mud flaps let 'em say so. It's not like
I'm asking for the submission of a funding proposal.

I'd just like to know the name and model.
ID: 1157362 · Report as offensive
Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 . . . 11 · Next

Message boards : Number crunching : Panic Mode On (56) Server problems?


 
©2025 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.