Posts by Freewill

61) Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (118) (Message 2028017)
Posted 17 Jan 2020 by Profile Freewill Project Donor
Post:
For some time now, each of my PCs has been getting exactly the number of new jobs as it reported completed. So, not filling the caches, but at least holding stable...for now.
They have filled their caches. The cache limit is just lower now.


No, I have already accounted for the new cache limits. I am about half what I should have at full.
62) Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (118) (Message 2028007)
Posted 17 Jan 2020 by Profile Freewill Project Donor
Post:
For some time now, each of my PCs has been getting exactly the number of new jobs as it reported completed. So, not filling the caches, but at least holding stable...for now.
63) Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (118) (Message 2027825)
Posted 16 Jan 2020 by Profile Freewill Project Donor
Post:
It's 2:30 am here but I can't go to bed before my computers can get work at the rate they can crunch it. Can't sleep if the fans start and stop randomly :(

I'm sure the heat is also nice to have in Finland. :) Maybe you'll hit the lottery and get a group of 100 tasks.
64) Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (118) (Message 2027817)
Posted 16 Jan 2020 by Profile Freewill Project Donor
Post:
I think it's the same for all of us, Villa Saari. The servers are probably busy with other tasks for incoming completed work and are prioritizing that over sending out new tasks. Just my guess.
65) Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (118) (Message 2027743)
Posted 15 Jan 2020 by Profile Freewill Project Donor
Post:
I'll just go do some crunching on other stuff :-) Can't really complain, in almost 21 years this has been the first really extended outage i've had.


Kind of disheartening for me, because as a newbie I went out with excitement to recommend the SETI stuff to my group of friends to get them involved, only to have it go down. Most of them sent me emails saying things like, "sorry, not worth my time if it cannot stay up" or, "It's broken, not gonna waste my time." I'll give it another day, but after that, I'll move on.


I understand the feeling. SETI@Home has been running for 20+ years and this is an atypically long weekly maintenance. Hang in there. I hope they'll be back online by end of day, but recovery could take another day as all those hungry PCs try to get data.
66) Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (118) (Message 2027727)
Posted 15 Jan 2020 by Profile Freewill Project Donor
Post:
I haven't been able to report my completed work from yesterday, even with NNT set and report limit reduced to 20. either hits a timeout or HTTP internal error

I just had one go through with NNT and then the next failed, so hit or miss.
67) Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (118) (Message 2027724)
Posted 15 Jan 2020 by Profile Freewill Project Donor
Post:
I just restarted using SETI again after many years, and the program is new to me. I downloaded it on my laptops, and tasks started running, but the tasks finished, and the program now says no work available to process.
I thought this would automatically upload the old work that had finished, and receive new work.
Do I need to do something, I am not sure what to do now?

We're having an unusually long server maintenance (always starts on Tuesdays). Give it another day or two and you should get some work.
68) Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (118) (Message 2027677)
Posted 15 Jan 2020 by Profile Freewill Project Donor
Post:
the communication from the guys running the show to us is pretty lacking to say the least.

Probably my biggest frustration with this project. Business as usual, I'm afraid.


Yes! I know they are over-worked. Just a 2 or 3 line post in the news section of what is planned for an upcoming outage and 2 or 3 lines of what actually happened would relieve a lot of frustration from the volunteers.
69) Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (118) (Message 2027665)
Posted 15 Jan 2020 by Profile Freewill Project Donor
Post:
Well, the boards are back this morning, USA ET, but still waiting on the servers to be enabled. What will we find?
70) Message boards : Number crunching : SETI/BOINC Milestones [ v2.0 ] - XXIX (Message 2027345)
Posted 11 Jan 2020 by Profile Freewill Project Donor
Post:
Congratulations! That is a really big milestone!
71) Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (118) (Message 2027061)
Posted 10 Jan 2020 by Profile Freewill Project Donor
Post:
Yep, every host in scheduler backoff due to "internal server error". Can't report work and caches falling.

Ditto for me. Almost a full moon to howl at...
72) Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (118) (Message 2026501)
Posted 5 Jan 2020 by Profile Freewill Project Donor
Post:
But now the server is out of tasks to send...oh well, at least something is happening.
73) Message boards : Number crunching : Special SETI@Home Fundraiser - 26 x 16TB datacentre drives required for the project's data storage (Message 2025712)
Posted 31 Dec 2019 by Profile Freewill Project Donor
Post:
You gave $100.00
to SETI@home
on December 31, 2019
via Visa

...and my company will match it for $200 total.
74) Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (118) (Message 2025495)
Posted 29 Dec 2019 by Profile Freewill Project Donor
Post:
Uh oh. I'm seeing no tasks as well, even though there are more than 200K ready to send.
75) Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (118) (Message 2025013)
Posted 26 Dec 2019 by Profile Freewill Project Donor
Post:
Just found it. In BOINC manager, the computing preferences were set a bit different (since the machines are different specs). I think it was an inadvertently low "use at most % of the CPUs" limit. Should have compared that earlier!
76) Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (118) (Message 2025012)
Posted 26 Dec 2019 by Profile Freewill Project Donor
Post:
I would recommend always using the sched_op_debug logging flag for the Event Log. That way at each scheduler connection you will get a printout of how many seconds of cpu work and gpu work you are requesting. If you don't ask for any seconds of cpu work, then you need to figure out why. Probably a configuration problem where you turned off the cpu for the host or the location venue.

Thanks for the suggestions everyone. I do have that debugging flag set. it shows no CPU work request:
Wed 25 Dec 2019 07:24:59 PM EST | SETI@home | [sched_op] Starting scheduler request
Wed 25 Dec 2019 07:24:59 PM EST | SETI@home | Sending scheduler request: To fetch work.
Wed 25 Dec 2019 07:24:59 PM EST | SETI@home | Reporting 15 completed tasks
Wed 25 Dec 2019 07:24:59 PM EST | SETI@home | Requesting new tasks for NVIDIA GPU
Wed 25 Dec 2019 07:24:59 PM EST | SETI@home | [sched_op] CPU work request: 0.00 seconds; 0.00 devices
Wed 25 Dec 2019 07:24:59 PM EST | SETI@home | [sched_op] NVIDIA GPU work request: 4629217.31 seconds; 0.00 devices
Wed 25 Dec 2019 07:25:02 PM EST | SETI@home | Scheduler request completed: got 15 new tasks
Wed 25 Dec 2019 07:25:02 PM EST | SETI@home | [sched_op] Server version 709
Wed 25 Dec 2019 07:25:02 PM EST | SETI@home | Project requested delay of 303 seconds
Wed 25 Dec 2019 07:25:03 PM EST | SETI@home | [sched_op] estimated total CPU task duration: 0 seconds
Wed 25 Dec 2019 07:25:03 PM EST | SETI@home | [sched_op] estimated total NVIDIA GPU task duration: 1482 seconds

Not sure why this is happening on this one host. It still has no CPU jobs and it did before the server problems. I just copied back the folder, so none of that changed. I only left in the dont_check_file_sizes flag set to 1. Could that do something like this? But, it is also set on the PC with CPU tasks. The other host at "home" location is getting CPU tasks and CPU and GPU are set for that location. I checked the cc_config and app_confi and app_info files. They look the same; I haven't done a diff compare. however.

Wiggo, T3500 is the PC name in case someone wanted to look it up. I have restarted the BOINC manager to no effect. I didn't quite understand what you were suggesting in your reply. Can you explain in a bit more detail? [/code]
77) Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (118) (Message 2024978)
Posted 25 Dec 2019 by Profile Freewill Project Donor
Post:
I have two of my rigs back on the special sauce app, a cuda90 version. One of them has both GPU and CPU tasks (T5810-Ubuntu) and the other only has GPU after quite a while (T3500-Ubuntu). All I did was restore the backed up setiathome.berkely.com folder. Before the server software issues, both were getting both GPU and CPU tasks. Any idea what I should check? Is is just not long enough for the server to make some CPU tasks?

Thanks and happy holidays!

Roger

I always find on my hosts that the gpu caches are always refilled first. Something to do with the APR of the apps and what the scheduler thinks is fastest. So wait until your gpu cache is fully filled before panicking on if your cpu cache is still not getting filled.


I just noticed the T3500 is only asking for GPU tasks. The preferences for its location should have both GPU and CPU tasks. Any other place where a variable could override that?
78) Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (118) (Message 2024967)
Posted 25 Dec 2019 by Profile Freewill Project Donor
Post:
Got SETI64-Ubuntu, my last and fastest rig back on special sauce. :) Good to see it tearing through those tasks. All back now except for needing a few GPU tasks, but that's minor.

Thanks to the SETI team for getting the servers back in order and my fellow Setizens for helping cope with and spoof stock. Back to Christmas feasting now...
79) Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (118) (Message 2024963)
Posted 25 Dec 2019 by Profile Freewill Project Donor
Post:
Thanks, Keith. Good to know. Neither looks quite full at the moment, but the 5810 may have been earlier.
80) Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (118) (Message 2024959)
Posted 25 Dec 2019 by Profile Freewill Project Donor
Post:
I have two of my rigs back on the special sauce app, a cuda90 version. One of them has both GPU and CPU tasks (T5810-Ubuntu) and the other only has GPU after quite a while (T3500-Ubuntu). All I did was restore the backed up setiathome.berkely.com folder. Before the server software issues, both were getting both GPU and CPU tasks. Any idea what I should check? Is is just not long enough for the server to make some CPU tasks?

Thanks and happy holidays!

Roger


Previous 20 · Next 20


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.