The Server Issues / Outages Thread - Panic Mode On! (117)

Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (117)
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 14 · 15 · 16 · 17 · 18 · 19 · 20 . . . 52 · Next

AuthorMessage
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 2012879 - Posted: 22 Sep 2019, 14:55:00 UTC - in response to Message 2012876.  
Last modified: 22 Sep 2019, 15:00:17 UTC

I guess Richard's technique does work. But, when I did drink beer, I didn't drink before 5 or 6 in the afternoon. It's morning here. :)

You could drink coffee, tea, water, or whatever you like, what is important is to take your mind out of the problem and relax. The problem will eventually be fixed and normally we can't do anything to help. Just maybe send an email to the lab guy's warning them about the problem as some does.
ID: 2012879 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 2012892 - Posted: 22 Sep 2019, 16:32:20 UTC - in response to Message 2012862.  

My problem is, I'm running Linux and don't have a clue about programming and compiling on Linux. :(

You don't have to understand programming, you just have to follow a recipe.

1) ./_autosetup
2) ./configure
3) ./make

Then your cake is baked.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 2012892 · Report as offensive
Profile Siran d'Vel'nahr
Volunteer tester
Avatar

Send message
Joined: 23 May 99
Posts: 7379
Credit: 44,181,323
RAC: 238
United States
Message 2012898 - Posted: 22 Sep 2019, 17:31:42 UTC - in response to Message 2012892.  

My problem is, I'm running Linux and don't have a clue about programming and compiling on Linux. :(

You don't have to understand programming, you just have to follow a recipe.

1) ./_autosetup
2) ./configure
3) ./make

Then your cake is baked.

Hi Keith,

Yeah, that seems to be the easy part. I'm talking about the source code and how many files it comprises and what to go through to alter the code, etc... :\ Oh and where to put the source code and what to transfer to the BOINC directory etc., etc. I really do not think I'm up to attempting to try.

When I made that statement about "spoofing" I was just stating that I can see a point to doing it since it doesn't take long for a GPU to get through a cache of WUs. With expected or unexpected outages, it can come in handy. :)

Have a great day! :)

Siran
CAPT Siran d'Vel'nahr - L L & P _\\//
Winders 11 OS? "What a piece of junk!" - L. Skywalker
"Logic is the cement of our civilization with which we ascend from chaos using reason as our guide." - T'Plana-hath
ID: 2012898 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 2012901 - Posted: 22 Sep 2019, 17:39:26 UTC
Last modified: 22 Sep 2019, 18:00:33 UTC

You could achieve the same goal without the need of compiling by using rescheduling. There are a lots of program available to do that. Some on this forums. Just need to dig a little.

I`m glad to here now you understand why we need to spoofing, the simple answer, the builds & GPU are so fast so a 100 WU limit is unreal.
ID: 2012901 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 2012903 - Posted: 22 Sep 2019, 17:59:42 UTC
Last modified: 22 Sep 2019, 18:01:13 UTC

Back to the thread topic.

Did anyone really knows why the Apache server stops to work on every Sunday morning?

As posted by Richard before, 3 times in a row it's hard to believe is a coincidence.
ID: 2012903 · Report as offensive
Ville Saari
Avatar

Send message
Joined: 30 Nov 00
Posts: 1158
Credit: 49,177,052
RAC: 82,530
Finland
Message 2012906 - Posted: 22 Sep 2019, 18:22:16 UTC

With 'spoofing' I can get any amount of GPU tasks but if the outage is longer than about 6 hours, I'll run out of CPU tasks too.

Spoofing the CPU count would be useless as Seti@Home only gives you 100 CPU tasks no matter how many CPU cores you have but I do have an idea about how to get more. I just haven't implemented it yet.

My idea is this: when the CPU gets close to starving, make the GPU donate a bunch of tasks to the CPU. This can be done with an external program with no need to modify Boinc client.
ID: 2012906 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 2012914 - Posted: 22 Sep 2019, 19:31:09 UTC - in response to Message 2012906.  

My idea is this: when the CPU gets close to starving, make the GPU donate a bunch of tasks to the CPU. This can be done with an external program with no need to modify Boinc client.

That's the idea behind "rescheduling"
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 2012914 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 2012937 - Posted: 23 Sep 2019, 1:11:21 UTC - in response to Message 2012871.  

I don't have too, just follow the instructions in: https://boinc.berkeley.edu/trac/wiki/CompileClient
The code is in the standard C++, last time i see something like it was >35 Years ago in my engineering school.
About the outage, that was fast! I only could drink a couple of beers. You see Richard technique works perfect. LOL


. . Thanks for that link but I think I will be up for a lot of reading before I would be ready to try it.

. . And I vote [1] for Richard (and Wiggos) technique.

Stephen

8^{
ID: 2012937 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 2012939 - Posted: 23 Sep 2019, 1:17:25 UTC - in response to Message 2012876.  

Hi Juan,
Sorry, but I'm not even going to attempt that. I dabbled in C++ programming and some B.A.S.I.C., but gave it up because my brain was not up to the task. It takes way too long for me to learn things, especially complicated things like computer programming. So, I stay away from it even if I could benefit from it as in this case. :)
I guess Richard's technique does work. But, when I did drink beer, I didn't drink before 5 or 6 in the afternoon. It's morning here. :)
Have a great day! :)
Siran


. . I share your trepidations about tackling a task like this, but maybe I will venture out further down the track, it would be a very positive move.

. . On the second point, maybe it's time to change :) Needs do as needs must :) </joke>

Stephen

:)
ID: 2012939 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 2012950 - Posted: 23 Sep 2019, 2:00:13 UTC - in response to Message 2012939.  

I seriously doubted my ability and no experience with programming or compiling until I was forced into that task by necessity to fix the bug I discovered in the client with a max_concurrent statement in multiple projects.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 2012950 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13736
Credit: 208,696,464
RAC: 304
Australia
Message 2012966 - Posted: 23 Sep 2019, 8:33:14 UTC

Just had an upload not go through on it's first attempt, and the web site here was MIA for a couple of minutes.
Grant
Darwin NT
ID: 2012966 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 2013016 - Posted: 23 Sep 2019, 18:16:00 UTC - in response to Message 2012763.  

While I'm personally finding this whole Linux thread interesting, perhaps a mod could move the whole thing to another thread as it clearly has nothing to do with the SETI servers being in Panic Mode?

Yeah, my bad. Sorry. I thought it would just be one or two posts and have the problem resolved, but it turned into more than that.

I posted here because this is probably the most-viewed thread where hopefully someone who knows the answer would see it.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 2013016 · Report as offensive
Profile cRunchy
Volunteer moderator
Avatar

Send message
Joined: 3 Apr 99
Posts: 3555
Credit: 1,920,030
RAC: 3
United Kingdom
Message 2013054 - Posted: 23 Sep 2019, 22:05:18 UTC - in response to Message 2013016.  

While I'm personally finding this whole Linux thread interesting, perhaps a mod could move the whole thing to another thread as it clearly has nothing to do with the SETI servers being in Panic Mode?

Yeah, my bad. Sorry. I thought it would just be one or two posts and have the problem resolved, but it turned into more than that.

I posted here because this is probably the most-viewed thread where hopefully someone who knows the answer would see it.


No blame to anyone. It's mostly a community thing the way threads develop.

I have PM'ed arkayn (the thread owner.)

Hopefully arkayn will take a look.

Otherwise a name change might be the best solution.
ID: 2013054 · Report as offensive
Profile arkayn
Volunteer tester
Avatar

Send message
Joined: 14 May 99
Posts: 4438
Credit: 55,006,323
RAC: 0
United States
Message 2013058 - Posted: 23 Sep 2019, 23:03:29 UTC

I prefer to keep the name the same, as this is the name that everyone looks for when seeing if it is a universal problems or not.


So, if you want to talk about compiling code or how to switch to linux, please use an appropriate thread as opposed to this one.

ID: 2013058 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 2013063 - Posted: 24 Sep 2019, 0:01:40 UTC - in response to Message 2013058.  

I prefer to keep the name the same, as this is the name that everyone looks for when seeing if it is a universal problems or not.
So, if you want to talk about compiling code or how to switch to linux, please use an appropriate thread as opposed to this one.


. . +1

Stephen

< shrug >
ID: 2013063 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 2013074 - Posted: 24 Sep 2019, 1:13:02 UTC - in response to Message 2013054.  

While I'm personally finding this whole Linux thread interesting, perhaps a mod could move the whole thing to another thread as it clearly has nothing to do with the SETI servers being in Panic Mode?

Yeah, my bad. Sorry. I thought it would just be one or two posts and have the problem resolved, but it turned into more than that.

I posted here because this is probably the most-viewed thread where hopefully someone who knows the answer would see it.


No blame to anyone. It's mostly a community thing the way threads develop.

I have PM'ed arkayn (the thread owner.)

Hopefully arkayn will take a look.

Otherwise a name change might be the best solution.

I went ahead and just made a thread in the Linux Q&A section, since it seems like this won't be a quick fix.

I'll see how it ends up going over there instead of littering this thread with it.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 2013074 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 2013190 - Posted: 24 Sep 2019, 22:10:25 UTC

Just wanted to mention an update here... I finally got it resolved, thanks to TBar.

He provided me with the boinc and boinccmd binaries for 6.10.37 and I had to do some fiddling-around to make it work, but..

2019-09-24 17:53:41 Starting BOINC client version 6.10.37 for x86_64-pc-linux-gnu
2019-09-24 17:53:41 log flags: file_xfer, sched_ops, task, sched_op_debug
2019-09-24 17:53:41 Libraries: libcurl/7.58.0 OpenSSL/1.0.2n zlib/1.2.11 libidn2/2.0.4 libpsl/0.19.1 (+libidn2/2.0.4) nghttp2/1.30.0 librtmp/2.3
2019-09-24 17:53:41 Data directory: /var/lib/boinc-client
2019-09-24 17:53:41 Processor: 8 GenuineIntel Intel(R) Core(TM) i7-4770K CPU @ 3.50GHz [Family 6 Model 60 Stepping 3]
2019-09-24 17:53:41 OS: Linux: 4.15.0-62-generic
...
2019-09-24 17:36:59 SETI@home update requested by user
2019-09-24 17:37:00 SETI@home [sched_op_debug] Fetching master file
2019-09-24 17:37:00 SETI@home Fetching scheduler list
2019-09-24 17:37:05 SETI@home [sched_op_debug] Got master file; parsing
2019-09-24 17:37:05 SETI@home [sched_op_debug] Found 1 scheduler URLs in master file
2019-09-24 17:37:05 SETI@home Master file download succeeded
2019-09-24 17:37:10 SETI@home [sched_op_debug] Starting scheduler request
2019-09-24 17:37:10 SETI@home Sending scheduler request: Requested by user.
2019-09-24 17:37:10 SETI@home Reporting 100 completed tasks, requesting new tasks
2019-09-24 17:37:10 SETI@home [sched_op_debug] CPU work request: 1731456.00 seconds; 4.00 idle CPUs
2019-09-24 17:37:15 SETI@home Scheduler request completed: got 0 new tasks
2019-09-24 17:37:15 SETI@home [sched_op_debug] Server version 709
2019-09-24 17:37:15 SETI@home Message from server: No tasks sent
2019-09-24 17:37:15 SETI@home Message from server: SETI@home v8 needs 32.00MB more disk space. You currently have 0.00 MB available and it needs 32.00 MB.
2019-09-24 17:37:15 SETI@home Project requested delay of 303 seconds


And I figured out that disk space issue. Apparently, when you set a max log size in cc_config, that value counts as "currently used", even if the file isn't anywhere near that number yet. Fixed that, and tried again:

2019-09-24 17:57:52 SETI@home [sched_op_debug] Starting scheduler request
2019-09-24 17:57:52 SETI@home Sending scheduler request: To fetch work.
2019-09-24 17:57:52 SETI@home Requesting new tasks
2019-09-24 17:57:52 SETI@home [sched_op_debug] CPU work request: 1731456.00 seconds; 4.00 idle CPUs
2019-09-24 17:57:57 SETI@home Scheduler request completed: got 24 new tasks
2019-09-24 17:57:57 SETI@home [sched_op_debug] Server version 709
2019-09-24 17:57:57 SETI@home Project requested delay of 303 seconds
2019-09-24 17:57:57 SETI@home [sched_op_debug] estimated total CPU job duration: 102436 seconds
2019-09-24 17:57:57 SETI@home [sched_op_debug] Deferring communication for 5 min 3 sec
2019-09-24 17:57:57 SETI@home [sched_op_debug] Reason: requested by project


So I'm back up and running, finally.

Again, Thank you, TBar! Also Keith for trying to help as well.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 2013190 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 2013193 - Posted: 24 Sep 2019, 22:30:56 UTC

Glad you sorted it out. I thought about grabbing that 6.10.37 branch as it had more recent updates to the 6.10.60 branch, but you said you wanted the same as your 6.10.58 so that is what I tried. Should have gone with my gut instinct. Doesn't mean I would have been successful though as I didn't have an old 12.04 lying around.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 2013193 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 2013208 - Posted: 25 Sep 2019, 0:13:52 UTC - in response to Message 2013193.  

Glad you sorted it out. I thought about grabbing that 6.10.37 branch as it had more recent updates to the 6.10.60 branch, but you said you wanted the same as your 6.10.58 so that is what I tried. Should have gone with my gut instinct. Doesn't mean I would have been successful though as I didn't have an old 12.04 lying around.

I just didn't want to go BEYOND the 6.10 branch. A slight rollback is fine. Just needed to get the openssl library to be newer than 0.9.8g is all, so that comms could happen properly.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 2013208 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13736
Credit: 208,696,464
RAC: 304
Australia
Message 2013233 - Posted: 25 Sep 2019, 6:55:01 UTC

The servers are certainly struggling at the moment- 160k+/hr coming in, and while the splitters are occasionally able to crank out the work, they just can't sustain the output needed to rebuild the ready-to-send buffer.
Grant
Darwin NT
ID: 2013233 · Report as offensive
Previous · 1 . . . 14 · 15 · 16 · 17 · 18 · 19 · 20 . . . 52 · Next

Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (117)


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.