Message boards :
Number crunching :
The Server Issues / Outages Thread - Panic Mode On! (119)
Message board moderation
Previous · 1 . . . 24 · 25 · 26 · 27 · 28 · 29 · 30 . . . 107 · Next
Author | Message |
---|---|
Jimbocous Send message Joined: 1 Apr 13 Posts: 1853 Credit: 268,616,081 RAC: 1,349 |
Yep, back to doing real work. Just got a few hundred for each of the starving clients. |
Ville Saari Send message Joined: 30 Nov 00 Posts: 1158 Credit: 49,177,052 RAC: 82,530 |
Immediately after I got my caches filled, I started getting just '0 tasks' returns and the caches are depleting again... |
W-K 666 Send message Joined: 18 May 99 Posts: 19087 Credit: 40,757,560 RAC: 67 |
Immediately after I got my caches filled, I started getting just '0 tasks' returns and the caches are depleting again... For me it was only for about 30 mins around 12 noon GMT that there were no new tasks.The cache has been re-filled since then. |
Freewill Send message Joined: 19 May 99 Posts: 766 Credit: 354,398,348 RAC: 11,693 |
|
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13746 Credit: 208,696,464 RAC: 304 |
Results returned and awaiting validation hits another record high. I wonder if it'll make to to 18 million by the 31st? Grant Darwin NT |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13746 Credit: 208,696,464 RAC: 304 |
Starting to get "Project has no tasks available."All the hosts that were in backoff mode are now contacting the Scheduler & returning work and asking for more. Demand exceeds supply. Grant Darwin NT |
Jord Send message Joined: 9 Jun 99 Posts: 15184 Credit: 4,362,181 RAC: 3 |
And then I couldn't post to the BOINC forums anymore. Posts stay in perpetual trying to send mode. Oh well. Ooh same here |
Stephen "Heretic" Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628 |
Starting to get "Project has no tasks available."All the hosts that were in backoff mode are now contacting the Scheduler & returning work and asking for more. Demand exceeds supply. . . Well I swapped over to the NBN tonight and now the wheels have well and truly fallen off. I can contact other sites but BOINC is borked. One machine uploads, reports and downloads just fine. The next machine can upload OK but attempts to report get "cannot contact server, internet is OK". The other two machine cannot upload anything, total fail and almost immediate project backoff. :( . . AAArrrggghhh!!! :( Stephen :( |
W-K 666 Send message Joined: 18 May 99 Posts: 19087 Credit: 40,757,560 RAC: 67 |
I have no idea why but the word KISS popped into my head ;-) |
Ville Saari Send message Joined: 30 Nov 00 Posts: 1158 Credit: 49,177,052 RAC: 82,530 |
Apparently the servers dried up again. The SSP stopped updating and both of my computers stopped getting any new work. Scheduler request still work but always return 0 tasks. The number of results in the database was at a new record when the SSP still updated: 24.186 million. |
Wiggo Send message Joined: 24 Jan 00 Posts: 34887 Credit: 261,360,520 RAC: 489 |
I can't say that I'm having any problems here at all really as my 2 main rigs are topping up on every 2nd or 3rd request. Cheers. |
juan BFP Send message Joined: 16 Mar 07 Posts: 9786 Credit: 572,710,851 RAC: 3,799 |
No new WU, eventually when one drops in my backyard, obviously a resend, is crunched almost instantly because my host is programmed to automatically start any resend first. For now the large cache is holding. Hope that will be fixed during the night. |
Speedy Send message Joined: 26 Jun 04 Posts: 1643 Credit: 12,921,799 RAC: 89 |
This situation may sort itself out. Possibly when results and progress drop below a certain number work will get sent out again. I am just guessing I have no hard facts. I have 15 tasks mailing for my GPU and 23 currently running on my CPU when they are finished I will be out of work until the server gives me more. I think a lot of us are in this boat. As I posted what happened I got given around 20 CPU tasks. Good luck everyone |
Dr Who Fan Send message Joined: 8 Jan 01 Posts: 3228 Credit: 715,342 RAC: 4 |
The server will probably limp along on what can be done remotely until further notice: California Governor Issues ‘Stay at Home’ Order for Residents |
Unixchick Send message Joined: 5 Mar 12 Posts: 815 Credit: 2,361,516 RAC: 22 |
We have 14 partial blc files to split currently. The 16 splitting channels usually work on 11 files. The splitters are going at full tilt right now filling up the RTS queue (YES!), but I fear not only running out of files to split, but that the rate of splitting will be lower as it is splitting fewer files. We still have Aricebo files, so we won't run out today though. Just wondering if they will add more tomorrow. ??? |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13746 Credit: 208,696,464 RAC: 304 |
Well, the Web site is almost dead. Forums aren't completely dead- just mostly dead. And the Scheduler is MIA again. I'm sure we did this yesterday. Grant Darwin NT |
Ville Saari Send message Joined: 30 Nov 00 Posts: 1158 Credit: 49,177,052 RAC: 82,530 |
The server will probably limp along on what can be done remotely until further noticeThe only things that can't be done remotely are hardware upgrades or pressing the reset button of a crashed and unresponsive server. Servers are managed remotely even when the person doing it is physically on site because servers in server racks rarely even have any local keyboards and displays. And even when there is a local console, no one wants to use it unless he has to because server rooms are too cold and too noisy places to work in. |
Ville Saari Send message Joined: 30 Nov 00 Posts: 1158 Credit: 49,177,052 RAC: 82,530 |
What I think the Seti staff should do is to write and run a simple script that moves all the database rows of workunits waiting for assimilation that aren't waiting for any unreturned results, the result rows linked to those workunits and the workunit and result files associated with them into a separate backup storage. That would remove about half of all the results in the database and the memory pressure they cause, which would probably resolve the server problems for a long enough time to last until the work distribution is stopped. And than after it is stopped and the database starts shrinking, those backed up workunits could be returned back to the assimilation queue to be processed. |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14654 Credit: 200,643,578 RAC: 874 |
I've visited two BOINC server rooms - the Einstein ATLAS cluster 10 years ago, and SETI's last summer. Both had a 'crash cart' parked in a corner - a trolley with monitor, keyboard and sundry useful tools. Probably a mouse too, though most server work seems to be done at the command line.The server will probably limp along on what can be done remotely until further noticeThe only things that can't be done remotely are hardware upgrades or pressing the reset button of a crashed and unresponsive server. Servers are managed remotely even when the person doing it is physically on site because servers in server racks rarely even have any local keyboards and displays. In SETI's case, each equipment rack is locked with its own security code. Eric opened one door to plug in the crash cart, and remotely shut down a different server which needed a hard disk replacing. That done and tested, he wheeled another trolley to the server we needed to upgrade, adjusted the working height, and slid that server out of the rack and onto the worksurface. I don't remember it being seriously cold, but certainly 'pleasantly cool' compared to the California summer outside. And I've still got the disposable earplugs I was advised to wear. Souvenir! Fun fact: Eric doesn't even have University authority to enter the server CoLo by himself. We had to meet Jeff Cobb outside to go through the formalities and be signed in. |
Speedy Send message Joined: 26 Jun 04 Posts: 1643 Credit: 12,921,799 RAC: 89 |
Sounds good. In saying this I do not know how to write such a thing plus is it worth writing one for 11 or 12 days? |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.