Message boards :
Number crunching :
Panic Mode On (78) Server Problems?
Message board moderation
Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 22 · Next
Author | Message |
---|---|
Cherokee150 Send message Joined: 11 Nov 99 Posts: 192 Credit: 58,513,758 RAC: 74 |
Thank you, Juan! Now we know that they know, and we all know that means they will fix the problem as soon as they can, as they have always done before. :) |
Keith White Send message Joined: 29 May 99 Posts: 392 Credit: 13,035,233 RAC: 22 |
They will be acknowledged as report right now only if you select "No New Tasks" on the projects tab. Well it does on my system. Task gets done, it uploads and then the scheduler reports it, it goes through and the task vanishes from my client's task list. Don't know what to say. "Life is just nature's way of keeping meat fresh." - The Doctor |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13736 Credit: 208,696,464 RAC: 304 |
Something other than just bandwidth saturation is at play here. That's my feeling. There have been many times in the past where network traffic has been maxed out, and downloads are pretty much impossible, but you are still able to contact the Scheduler to report work & get more work allocated. The fact is that even now with the network traffic maxed out, if you do (some how) manage to get some work, it's downloading fairly quickly. Certainly much, much faster than in the past, and when you were still able to get a response from the Scheduler. Over the last few months we've had issues with Scheduler timeouts, but not for nearly as long as this time, nor nearly as severe- from memory i would get a response about 1 in 5 to 7 attemps. Now i'm lucky if it's 1 in 20, No New Tasks set or not. Hence i suspect it's a system configuration/load problem, not a network load one. Grant Darwin NT |
kittyman Send message Joined: 9 Jul 00 Posts: 51468 Credit: 1,018,363,574 RAC: 1,004 |
They will be acknowledged as report right now only if you select "No New Tasks" on the projects tab. It gets really hard to quantify it when I have 9 rigs trying to report 1000s of WUs. Hits and misses go by unnoticed by me. Until I check the stats page and I see some rigs have not reported for hours. That page is usually my barometer for the rigs, if I see one has not reported for a while, I suspect a crash and check it out. Not a reliable barometer at the moment. "Freedom is just Chaos, with better lighting." Alan Dean Foster |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13736 Credit: 208,696,464 RAC: 304 |
They will be acknowledged as report right now only if you select "No New Tasks" on the projects tab. My client_state is 2.4MB in size, my sched_request_setiathome.berkeley.edu is 450kB in size. I suspect yours are a lot smaller. You're in the US, i'm a few thousand kms away. End result- you may be able to get work, i'm lucky if i can even report work- even after 30min of endless Update clicking with No New Tasks set. Grant Darwin NT |
kittyman Send message Joined: 9 Jul 00 Posts: 51468 Credit: 1,018,363,574 RAC: 1,004 |
They will be acknowledged as report right now only if you select "No New Tasks" on the projects tab. Grant, don't feel special. The kitties are equally fukayed from the midwest USA..... I could be on the Berk campus and still be screwed as bad as you right now. "Freedom is just Chaos, with better lighting." Alan Dean Foster |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13736 Credit: 208,696,464 RAC: 304 |
Just to add to the present fun, i'm now getting some "Couldn't connect to server" messages in response to a Scheduler request. Grant Darwin NT |
kittyman Send message Joined: 9 Jul 00 Posts: 51468 Credit: 1,018,363,574 RAC: 1,004 |
Been getting those for the last 24 hours or more. #2 rig has not been able to get through for an hour and a half. "Freedom is just Chaos, with better lighting." Alan Dean Foster |
Cosmic_Ocean Send message Joined: 23 Dec 00 Posts: 3027 Credit: 13,516,867 RAC: 13 |
I'm having no trouble reporting 1-5 tasks every couple of hours. My cache got a little over-filled when I started hoarding APs, so I'm not asking for more work presently, which I suspect is nearly the equivalent of NNT, since both are effectively "not asking for more work." When I was downloading the APs, they were coming in at around 25KB/sec for each one, sometimes I had 5-8 of them going at a time. Linux laptop: record uptime: 1511d 20h 19m (ended due to the power brick giving-up) |
Tron Send message Joined: 16 Aug 09 Posts: 180 Credit: 2,250,468 RAC: 0 |
my linux boxes are reporting and obtaining normal work. but my one windoz box shows nothing but scheduler timeout. it seems to upload ok .. slowly but ok. |
kittyman Send message Joined: 9 Jul 00 Posts: 51468 Credit: 1,018,363,574 RAC: 1,004 |
You folks don't have a clue about hosts processing and trying to return 100s of results an hour. Not the same as saying....oh, my rig did 2 tasks last hour, and both got reported just fine. Yah, right. The kitties got YOUR back, bud. Not dissing anybody, but it's a different class of problems here. The kitties need access to the servers 24/7. "Freedom is just Chaos, with better lighting." Alan Dean Foster |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13736 Credit: 208,696,464 RAC: 304 |
When I was downloading the APs, they were coming in at around 25KB/sec for each one, sometimes I had 5-8 of them going at a time. Which is more support for the Scheduler issues being server related, not network traffic. Grant Darwin NT |
Tron Send message Joined: 16 Aug 09 Posts: 180 Credit: 2,250,468 RAC: 0 |
msattler wrote: The kitties need access to the servers 24/7. They ought to just send you 500gb unsplit raw drives to crunch :-) Then we'd all have some network bandwidth to spare. |
betreger Send message Joined: 29 Jun 99 Posts: 11361 Credit: 29,581,041 RAC: 66 |
I can't believe it is Karma. It is just that the big guys are constipating the system. Everybody knows that. Until a politically acceptable and economic doable solution is proposed this is just venting. |
betreger Send message Joined: 29 Jun 99 Posts: 11361 Credit: 29,581,041 RAC: 66 |
Tron, that gets away from the concept of distributed computing. |
kittyman Send message Joined: 9 Jul 00 Posts: 51468 Credit: 1,018,363,574 RAC: 1,004 |
msattler wrote:The kitties need access to the servers 24/7. Hey, that might work....LOL. I'll broach the subject with Eric in our next chat. I'd have to install the splitting software. I think the GPUUG has sent enough HDs for shuttle service. That would be a grand solution, I think. They would have to trust the kitties' science trail....err, tails. Something tells me that letting raw data out of the house would not work scientifically. The kitties are simply scintellated by the thought, though. "Freedom is just Chaos, with better lighting." Alan Dean Foster |
Tron Send message Joined: 16 Aug 09 Posts: 180 Credit: 2,250,468 RAC: 0 |
Tron, that gets away from the concept of distributed computing. they still distribute the work , and only say... the top 25 machines would participate in a HD exchange program I don't think the work would need to be split either.. just one long stream with nominal checkpointing. |
Cosmic_Ocean Send message Joined: 23 Dec 00 Posts: 3027 Credit: 13,516,867 RAC: 13 |
You folks don't have a clue about hosts processing and trying to return 100s of results an hour. I wasn't trying to say that it was a situation of "it must just be a problem on your end," I was merely pointing out that I don't have any/many scheduler contact issues, even when only reporting a very small number of tasks. Others are having connection issues when reporting a small number of tasks, and so are those who are reporting a large number. Consider my message as a data point on a graph, or a breadcrumb for trying to pin-point the actual problem. Linux laptop: record uptime: 1511d 20h 19m (ended due to the power brick giving-up) |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13736 Credit: 208,696,464 RAC: 304 |
...and to rub salt into the wounds, on the one occasion where the Scheduler responded to a request for work, i got 1 WU. 1,000 would be nice, a couple of thosand would be better. Probably at least half as many again to fill my caches. 1 WU! Grant Darwin NT |
Cherokee150 Send message Joined: 11 Nov 99 Posts: 192 Credit: 58,513,758 RAC: 74 |
Perhaps this might be a situation where there is more than one problem occurring at the same time. That, as most of us know from experience, makes diagnosis exceedingly difficult, which seems to fit the current crisis. Could this be a possibility? What do you think? |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.