Message boards :
Number crunching :
Panic Mode On (79) Server Problems?
Message board moderation
Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · 10 . . . 22 · Next
Author | Message |
---|---|
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13722 Credit: 208,696,464 RAC: 304 |
Just the Scheduler. Apparently (at least for now) they're able to use the campus network for the Scheduler traffic. If you look at the network graphs at present, instead of being around 14-20Mb/s it's been sitting around 10-12Mb/s inbound. I did some pings (posted a few posts before these from memory). No packet loss at all, where as the download server (i use .13 exclusively) is around 50-75% & the upload server is around 50% packet loss. EDIT- & the other real test will be to bump up the limits & see if things fall over again or not. Maybe 400 per core & 1200 per GPU to start with? Hint, hint. Grant Darwin NT |
kittyman Send message Joined: 9 Jul 00 Posts: 51468 Credit: 1,018,363,574 RAC: 1,004 |
Whatever works, I guess. Splitting the scheduler request comms from the download pipe makes a lot of sense. Over the last couple of weeks, I found that when my rigs did a scheduler request, most of the time when I checked my account page, contact WAS made by them at the time of the request. The problem was, they never got answered. So if the scheduler comms can be handled without too many errors, that should help to stop the ghost task generation. Then the only problem is downloads.....which kind of moderate things themselves, as when downloads are backed up, you get scheduler requests to report work which don't ask for new tasks. It's kinda like a salesman driving around in a Porche to take orders, but of course the delivery is by a much slower truck. And if da truck don't deliver da goods, the salesman don't get no more orders....LOL. "Freedom is just Chaos, with better lighting." Alan Dean Foster |
ivan Send message Joined: 5 Mar 01 Posts: 783 Credit: 348,560,338 RAC: 223 |
Now we just need a small tweak to divide those into 'before tonight' and 'after tonight', so we know what effect Eric's changes have had. Here's a graph of my response times (UTC) for the last couple of days -- I couldn't get it to embed, perhaps because of the https. Timed-out requests were set to 330 seconds. https://lh4.googleusercontent.com/-dde5ywVYBuM/UK4sHPNdySI/AAAAAAAAAY0/KCmDzfOo6lI/s800/setiresponse.png [Edit] Spoke too soon; everything's dropped off the cliff and it's timing out again... |
mikeej42 Send message Joined: 26 Oct 00 Posts: 109 Credit: 791,875,385 RAC: 9 |
11/22/2012 8:06:02 AM | SETI@home | Sending scheduler request: Requested by user. <PanicMode>1</PanicMode> It is going to be a long weekend.... [Edit] A U.S. Holyday (sic) for many. 4 days till the work week resumes. |
juan BFP Send message Joined: 16 Mar 07 Posts: 9786 Credit: 572,710,851 RAC: 3,799 |
http://fragment1.berkeley.edu/newcricket/grapher.cgi?target=%2Frouter-interfaces%2Finr-250%2Fgigabitethernet2_3;ranges=d;view=octets Cricket graphics: Shows Dive, dive... dive! But DL rises to incredibles > 250kbps! Without proxy! Any clues??? (edit) <PanicMode>1</PanicMode> +1 |
Fred E. Send message Joined: 22 Jul 99 Posts: 768 Credit: 24,140,697 RAC: 0 |
Would also note that the Server Status Page hasn't updated for over 2 hours. I had good luck w/o a proxy last night, although download speeds were low. If interested, here's a Cricket link that also shows the weekly graph: http://fragment1.berkeley.edu/newcricket/grapher.cgi?target=%2Frouter-interfaces%2Finr-250%2Fgigabitethernet2_3;view=Octets;ranges=d%3Aw Another Fred Support SETI@home when you search the Web with GoodSearch or shop online with GoodShop. |
juan BFP Send message Joined: 16 Mar 07 Posts: 9786 Credit: 572,710,851 RAC: 3,799 |
Allready notice that, stuck at 13:00 UTC hours ago... and belive nobody is in the lab because the Thanksgiving holiday besides the ghosts in the machine... i belive we could do nothing else beside open a beer or two and wait... will do my part ASAP, is normal working day here. |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14650 Credit: 200,643,578 RAC: 874 |
OK, I've been having a scout around. So far.... Well, it happened while I was out at lunch, OK? Nothing to do with me. Jeez, can't I even trust you guys to mind the shop while I go and fetch a sandwich ... LOL :) Something seemed to happen to the scheduler - quite suddenly - at about 13:24 UTC. One host got a timeout, everything else has been "Couldn't connect to server" since then. Synergy has been responding to pings, so I guess the server itself is running, but the programs we need to handle work requests and reports clearly aren't - maybe Apache has failed. I also see that the Server Status Page hasn't updated since [As of 22 Nov 2012 | 13:00:07 UTC]. That usually means that one of the auxiliary servers in the lab, that handles the glue that holds the whole ball of string together, has crashed. Some of the lab servers are on remotely-controlled power strips, so they can be given a remote kicking (power down and power back up). If this failure can be handled like that, we might see some resumption after the staff have finished their holiday lie-ins. Otherwise, we're probably reduced to hoping that some member of staff will accept the excuse to evade the Black Friday shopping trip tomorrow... |
juan BFP Send message Joined: 16 Mar 07 Posts: 9786 Credit: 572,710,851 RAC: 3,799 |
Take a beer on my acount to help in the waiting task and thanks for the info. Have a good day |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14650 Credit: 200,643,578 RAC: 874 |
|
kittyman Send message Joined: 9 Jul 00 Posts: 51468 Credit: 1,018,363,574 RAC: 1,004 |
|
tbret Send message Joined: 28 May 99 Posts: 3380 Credit: 296,162,071 RAC: 40 |
|
dancer42 Send message Joined: 2 Jun 02 Posts: 455 Credit: 2,422,890 RAC: 1 |
One of the red flags that a site has been hacked or spoofed is that you find grammatical or spelling errors that are out of the ordinary, and that make a message hard to read. Did anyone else notice the most recent message on the front page seems to have such errors? For example, "the lookup of result in process", "hosts being assigned large number or [of?] results to compute", and "The host. think it received", among others. These are not normal for the seti@home front page or any technical message one usually finds on the site. No it is the lgm's i just know it. lol |
ivan Send message Joined: 5 Mar 01 Posts: 783 Credit: 348,560,338 RAC: 223 |
...and so we're cranking up to rolling speed again... |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14650 Credit: 200,643,578 RAC: 874 |
Master database queries/second 3,438 Congratulations Eric - my, that turkey is going to taste nice when you get home. |
tbret Send message Joined: 28 May 99 Posts: 3380 Credit: 296,162,071 RAC: 40 |
Master database queries/second 3,438 +1 ...etc. |
juan BFP Send message Joined: 16 Mar 07 Posts: 9786 Credit: 572,710,851 RAC: 3,799 |
It´alive! Again... |
Gone Send message Joined: 31 May 99 Posts: 150 Credit: 125,779,206 RAC: 0 |
Yuuupp, thanks to Eric for fixing it on Thanksgiving. |
musicplayer Send message Joined: 17 May 10 Posts: 2430 Credit: 926,046 RAC: 0 |
Having problems reporting to the server? Why not try out the following: Check out your task list. The tasks that have finished up on your computer lists as "Ready to report" in your Tasks tab. There may be other tasks in this tab being in other states. Then check out your Messages tab or the separate Event log for the most recent versions of BOINC Manager. You may have tasks that are being uploaded in this list as well. This should work out for most of the time - at least I do not have to press the Retry button on the uploaded tasks, although they may sometimes hang a little while at 100 % uploaded before finishing up completely. Then choose the "Projects" tab and select "Update" for the selected project. After having done this return back to the tasks tab or the Messages tab / Event log and possibly alternating betweeen these tabs should tell you that the tasks have been reported. This could take a little while, sometimes a couple of minutes. If this does not work out, set "No new tasks" for the selected project and carry out the same process once more, namely push the "Update" button. This should work out, but if you are experienced on this project you may know when this does not work out without even trying it out. The only question is whether you should wait 5 minutes before trying to report with "Allow new tasks" before re-trying with "No new tasks" set active. Does the scheduler acknowledge a request when the client is unable to report to the scheduler? My assumption is that is so, but perhaps this is not correct. |
PCMS Send message Joined: 12 Aug 12 Posts: 2 Credit: 3,903,982 RAC: 0 |
I'm getting tired of that I can not receive tasks or send the calculated files back. I have downsized task in favor of another, which gives me access to upload and sending results. If this is not bedere, considering I stop to make my IDEL time with this service. hope that it will be this service will be bedere. |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.