Message boards :
Number crunching :
The Server Issues / Outages Thread - Panic Mode On! (118)
Message board moderation
Previous · 1 . . . 9 · 10 · 11 · 12 · 13 · 14 · 15 . . . 94 · Next
Author | Message |
---|---|
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13854 Credit: 208,696,464 RAC: 304 |
Starting to get dribs and drabs of work. But all downloads are stalling out and backing off 5 hours.Yep, instant timeouts on Linux system downloads now. Windows system (hosts file used) they don't time out instantly, just count away with no download activity, then timeout after a few seconds or a minute or so. It's probably a good thing we can't download any work, as the splitters aren't actually producing any to replace what's already gone from the Ready-to-send buffer. Things are still very, very broken. Grant Darwin NT |
Zalster Send message Joined: 27 May 99 Posts: 5517 Credit: 528,817,460 RAC: 242 |
Starting to get dribs and drabs of work. But all downloads are stalling out and backing off 5 hours. Keith...why are you up?? hahaha |
Cosmic_Ocean Send message Joined: 23 Dec 00 Posts: 3027 Credit: 13,516,867 RAC: 13 |
After many hours and dozens of "no tasks available", I did the trick for resending lost tasks (even though there weren't any), and the scheduler requests started giving me small handfuls of them. Most of which are--as others have also reported--having transfer issues. But at least the scheduler is giving work out again. Thank you, staff. Linux laptop: record uptime: 1511d 20h 19m (ended due to the power brick giving-up) |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13854 Credit: 208,696,464 RAC: 304 |
But at least the scheduler is giving work out again.I wish that were so. For those brief periods where downloads are considered to be downloading, the Scheduler response to a work request is "Project has no tasks available." Grant Darwin NT |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13854 Credit: 208,696,464 RAC: 304 |
Downloads have started again, but "Project has no tasks available" seems the be the current response for work as the Ready-to-send is 0 & splitters aren't. Grant Darwin NT |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14679 Credit: 200,643,578 RAC: 874 |
Much the same here, except it slowly appears to be easing off and getting back to normal - both work fetch and downloads seem to be getting easier. |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13854 Credit: 208,696,464 RAC: 304 |
Hmm, managed to pick up some work (new work that is, not resends) in the last 30min or so, Ready-to-send showing 1200, but splitter output has been reported as 0 for about an hour now. Will see how things are come the morning. Night all. Grant Darwin NT |
NorthCup Send message Joined: 6 Jun 99 Posts: 108 Credit: 50,093,984 RAC: 5 |
The Slots are full - all linux-anonymous -systems are operational - thanks again to seti-staff and happy holidays at all - lg Klaus |
petri33 Send message Joined: 6 Jun 02 Posts: 1668 Credit: 623,086,772 RAC: 156 |
OK, I've applied Retvari's workround on my Linux Mint host running special sauce and the spoofed client. There's good news and bad news. Hi, anyone with compilation ability can make the special app to read the gpu number from init_data.xml. in main.cpp you can define BOINC_MAJOR_VERSION to 8 just before the version check ... // Patch for Cuda device selection, Care of Juha Sointusalo. // Deals with boinc api 7.5 onwards breaking change (mid major version), // from standard use of command line to field not present on preferred earlier versions. // init_data.xml has e.g. <gpu_device_num>0</gpu_device_num> #define BOINC_MAJOR_VERSION 8 #if (BOINC_MAJOR_VERSION >= 8) || ((BOINC_MAJOR_VERSION == 7) && (BOINC_MINOR_VERSION >= 5)) if (app_init_data.gpu_device_num >= 0) { gCUDADevPref = app_init_data.gpu_device_num + 1; fprintf(stderr, "app_init.xml specified GPU %d\r\n", app_init_data.gpu_device_num); } #endif ... I tested this and it says now on stderr app_init.xml specified GPU 0 To overcome Heisenbergs: "You can't always get what you want / but if you try sometimes you just might find / you get what you need." -- Rolling Stones |
Mike Send message Joined: 17 Feb 01 Posts: 34376 Credit: 79,922,639 RAC: 80 |
My caches are full again as well. With each crime and every kindness we birth our future. |
Wiggo Send message Joined: 24 Jan 00 Posts: 36765 Credit: 261,360,520 RAC: 489 |
My 2 cleaned rigs had their caches full again 45 mins before it's the day after Xmas here and I'm off to bed before that day gets here. Enjoy the festive season everyone. Cheers. |
betreger Send message Joined: 29 Jun 99 Posts: 11415 Credit: 29,581,041 RAC: 66 |
Both boxes are full, Merry Christmas to all. |
Tom M Send message Joined: 28 Nov 02 Posts: 5126 Credit: 276,046,078 RAC: 462 |
Any Stock hosts getting work? Is Resend lost tasks still on? My Windows box is purely stock and appears to have all its tasks reported and a full set of caches. Excepting Beta mind you. Which is still down. I will see if my Linux box has "finally" gotten all its gpu tasks cleared and hopefully the "reports" done. And turn back on the Anonymous platform. Right now I can complain about how slow Stock is under Linux. --Edit-- The Server immediately dumped a bunch of gpu tasks on me. When I started up my Weekend Warrior boxes they too immediately got tasks. May be a bit short on Cpu tasks though. Tom A proud member of the OFA (Old Farts Association). |
TBar Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768 |
Hey, just for Future Reference, Anonymous platform doesn't work on the BETA Server, I discovered that a few months ago. All you receive is Server Errors when trying to download work. You can contact the Server when running No New Tasks, but, that won't get you any work. I suspect if that code is ever moved to Main, All Hell will break loose ;-) BTW, I finally found out how to keep the Main Server from sending all those OpenCL tasks when trying to Spoof the CUDA Special App as Stock. Just add <no_opencl>1</no_opencl> to cc_config.xml, then restart BOINC, and then it will only send tasks for CUDA. It means you also won't run any APs, but, who needs APs when you have the Special App? Now I just have to figure out what to do with all these Ghosts that use to be OpenCL tasks. I kinda though Resend Lost Tasks would be running and take care of it..oh well. Merry Christmas to You too! |
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
Woke up this morning to a Xmas present of tasks downloading and running. Went to fire up the hosts I had turned off for the weekend. Replica is still behind and not showing the work yet. The validated tasks and the pendings significantly dropped from before the Grand Mal Outrage event. Thanks to the staff for taking my advice of rolling back the server software. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
Tom M Send message Joined: 28 Nov 02 Posts: 5126 Credit: 276,046,078 RAC: 462 |
Hey, just for Future Reference, Anonymous platform doesn't work on the BETA Server, I discovered that a few months ago. All you receive is Server Errors when trying to download work. You can contact the Server when running No New Tasks, but, that won't get you any work. I suspect if that code is ever moved to Main, All Hell will break loose ;-) Neat! So now you aren't actually running Anonymous anymore since you found the work around? Tom A proud member of the OFA (Old Farts Association). |
TBar Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768 |
I'm going to keep testing the App as Stock for now. I did find that boinc-master 7.5 doesn't work on the Mac as Stock, even though it seems to work on Linux. I might recompile the Apps with boinc-master 7.11 soon. |
Ville Saari Send message Joined: 30 Nov 00 Posts: 1158 Credit: 49,177,052 RAC: 82,530 |
My anon boxes have work and everything seems to be fine but the server status page says the replica database is again being left behind at a steady rate, which I guess will predict trouble in the future... |
Ian&Steve C. Send message Joined: 28 Sep 99 Posts: 4267 Credit: 1,282,604,591 RAC: 6,640 |
My anon boxes have work and everything seems to be fine but the server status page says the replica database is again being left behind at a steady rate, which I guess will predict trouble in the future... I think it will grow to some value and come back down as things stabilize, just like what happened after the last server change. Seti@Home classic workunits: 29,492 CPU time: 134,419 hours |
Unixchick Send message Joined: 5 Mar 12 Posts: 815 Credit: 2,361,516 RAC: 22 |
No panic. The replica lag time is decreasing, and the RTS is increasing. Happy Holidays! Thank you Seti crew for working so long yesterday to fix the issues! |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.