Panic Mode On (113) Server Problems?

Message boards : Number crunching : Panic Mode On (113) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 18 · 19 · 20 · 21 · 22 · 23 · 24 . . . 37 · Next

AuthorMessage
Profile Zalster Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 27 May 99
Posts: 5517
Credit: 528,817,460
RAC: 242
United States
Message 1959112 - Posted: 7 Oct 2018, 19:50:10 UTC

Is it time to panic yet?? Oh wait, my computer been off for a week already haha, never mind....
ID: 1959112 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1959113 - Posted: 7 Oct 2018, 19:54:22 UTC - in response to Message 1959111.  

TBar repackaged the All-in-One recently to just a single download for both the 7.4.44 and 7.8.3 versions. The main files in the BOINC folder are the 7.8.3 versions. He mentioned in the message you quoted that the 7.4.44 version is in the /Docs folder in the archive.

If you want to run the 7.4.44 version with its 3000 task limit, then just copy the five main BOINC executable files into the BOINC directory. You need to copy boinc, boinmgr, boinccmd, boincscr and switcher.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1959113 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1959116 - Posted: 7 Oct 2018, 20:00:15 UTC

I don't know what to do about the repository versions since I never ran them. Probably can just delete the directories. No the TBar versions don't automatically start BOINC. You either start them manually or set them to run automatically with the Startup Applications utility.

I just start BOINC manually since the computers rarely get restarted and run all the time unless I need to reboot for an update or something. You can just mark boincmgr as a Favorite and Dock it to the Launcher panel for easy starting.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1959116 · Report as offensive
JohnDK Crowdfunding Project Donor*Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 28 May 00
Posts: 1222
Credit: 451,243,443
RAC: 1,127
Denmark
Message 1959117 - Posted: 7 Oct 2018, 20:00:51 UTC - in response to Message 1959113.  

If you want to run the 7.4.44 version with its 3000 task limit, then just copy the five main BOINC executable files into the BOINC directory. You need to copy boinc, boinmgr, boinccmd, boincscr and switcher.

I´m running repo version 7.6.31 under Mint 18.3, wondering if I can copy those files also.

I tried running 7.4.44 from the home folder which worked and I can attach the project, but it seems to be stuck in creating a computer ID.
ID: 1959117 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1959119 - Posted: 7 Oct 2018, 20:06:32 UTC - in response to Message 1959111.  

EDIT: nevermind, i got it. i missed that this package was not 7.4.44 natively and you had to extract the 7.4.44 components separately. got it now.

also, do i have to do anything special to remove my old service install of BOINC 7.9.3? when installing the dependencies and copying boinc folder to my home folder, i noticed my service install blanked out and removed my project.

running sudo apt-get remove boinc-client boinc-manager returned that neither were installed, which seemed odd to me. is it gone from something i did? can i just delete the boinc-client and boinc folders from my var/lib/ directory?

will this new version auto-start at system boot like the service install does?
No, the Berkeley version isn't set to autostart, you have to add the boincmgr to the startup list and be careful to name the Full path to boincmgr. You need to open the Synaptic Package Manager, search and remove everything in the System Folder, INCLUDING CONFIGURATION FILES, that has boinc or seti in the name. Be careful not to delete the Files in your Home Folder, but, any files related to boinc needs to be removed from the System folder. Most people find that difficult to do and leave configuration files in the System folder, which causes problems. It's best to do a Clean install of the System Folder and Never Install the Repository version...only way to be sure.
ID: 1959119 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1959120 - Posted: 7 Oct 2018, 20:07:10 UTC - in response to Message 1959117.  

Not sure about your issue. First I have heard of someone trying to run the TBar executables in a repository installation. Most just install it to the /Home folder as TBar intended. You can always just reuse your original host id by editing the client_state.xml and putting your old host ID back in. Just remember to increment your rep_seqno by 1 from what the website states is the number of times the host has contacted the project in the client_state file before you let the client contact the project.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1959120 · Report as offensive
RickToTheMax

Send message
Joined: 22 May 99
Posts: 105
Credit: 7,958,297
RAC: 0
Canada
Message 1959125 - Posted: 7 Oct 2018, 20:34:33 UTC
Last modified: 7 Oct 2018, 20:44:37 UTC

I was using the 7.8.3 from TBar, can i just replace the files and not care about host ID at all?
Or i still need to modify the client state file to keep the same ID?

Even tho i had 1000 WUs before the crash, i am about to run out on my single gpu host already because of a ton of noisy WUs =( 3000 limit would have helped probably!
ID: 1959125 · Report as offensive
Ian&Steve C.
Avatar

Send message
Joined: 28 Sep 99
Posts: 4267
Credit: 1,282,604,591
RAC: 6,640
United States
Message 1959126 - Posted: 7 Oct 2018, 20:54:05 UTC - in response to Message 1959119.  
Last modified: 7 Oct 2018, 21:19:02 UTC

EDIT: nevermind, i got it. i missed that this package was not 7.4.44 natively and you had to extract the 7.4.44 components separately. got it now.

also, do i have to do anything special to remove my old service install of BOINC 7.9.3? when installing the dependencies and copying boinc folder to my home folder, i noticed my service install blanked out and removed my project.

running sudo apt-get remove boinc-client boinc-manager returned that neither were installed, which seemed odd to me. is it gone from something i did? can i just delete the boinc-client and boinc folders from my var/lib/ directory?

will this new version auto-start at system boot like the service install does?
No, the Berkeley version isn't set to autostart, you have to add the boincmgr to the startup list and be careful to name the Full path to boincmgr. You need to open the Synaptic Package Manager, search and remove everything in the System Folder, INCLUDING CONFIGURATION FILES, that has boinc or seti in the name. Be careful not to delete the Files in your Home Folder, but, any files related to boinc needs to be removed from the System folder. Most people find that difficult to do and leave configuration files in the System folder, which causes problems. It's best to do a Clean install of the System Folder and Never Install the Repository version...only way to be sure.


ok thanks. for now it seems to be fine for now with just the deletion of the /var/lib/boinc-client folder which contained all the previous config files (i copied them into the /home/BOINC/ folder to keep all my previous stuff). i'll searh through and see if there is anything with boinc in it outside of my home folder, EDIT: looks like the only package remaining was libboinc something. it didnt seem to be causing any problems but i removed it anyway.

i've given the system a few reboots and everything is working normally. i dont care too much about autostart for the same reasons Keith listed, I was just curious of its exact setup. If i want it later, it seems to be fairly trivial to add autostart.
Seti@Home classic workunits: 29,492 CPU time: 134,419 hours

ID: 1959126 · Report as offensive
Profile Bernie Vine
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 26 May 99
Posts: 9954
Credit: 103,452,613
RAC: 328
United Kingdom
Message 1959133 - Posted: 7 Oct 2018, 21:19:31 UTC

This is the "Panic Mode" thread

Can we keep discussions in that vein thank you.
ID: 1959133 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1959134 - Posted: 7 Oct 2018, 21:26:34 UTC - in response to Message 1959126.  

Fine with me. But, don't be surprised after the next system update to find a repository version of boinc back in var/lib with a new system ID and loads of tasks.

For the rest of you, the first thing to do would be to backup the boinc-client folder. Then open Synaptic Package Manager and Remove, Including Configuration Files, anything installed pertaining to boinc or seti. Then search the system folder for anything leftover. After that, copy the Boinc All-In-One to Home and populate it with the boinc files from your backed-up boinc-client folder. If you're lucky, all the repository files were removed correctly and the Berkeley BOINC will work fine in your Home folder.

If you are already running the Berkeley version of BOINC from your Home folder, then there isn't any trouble updating, or downgrading, the installed version of BOINC. It's a simple matter of replacing the 5 boinc files in your BOINC folder as mentioned earlier.
ID: 1959134 · Report as offensive
Ian&Steve C.
Avatar

Send message
Joined: 28 Sep 99
Posts: 4267
Credit: 1,282,604,591
RAC: 6,640
United States
Message 1959142 - Posted: 7 Oct 2018, 22:10:35 UTC
Last modified: 7 Oct 2018, 22:10:56 UTC

is anyone seeing new tasks yet?

it looks like this system of mine is getting new work: https://setiathome.berkeley.edu/results.php?hostid=8571781

but nothing new for the rest
Seti@Home classic workunits: 29,492 CPU time: 134,419 hours

ID: 1959142 · Report as offensive
Profile betreger Project Donor
Avatar

Send message
Joined: 29 Jun 99
Posts: 11379
Credit: 29,581,041
RAC: 66
United States
Message 1959143 - Posted: 7 Oct 2018, 22:13:29 UTC

Ah, the SSP shows some activity. Life may return to normal in a few hours.
ID: 1959143 · Report as offensive
Profile Unixchick Project Donor
Avatar

Send message
Joined: 5 Mar 12
Posts: 815
Credit: 2,361,516
RAC: 22
United States
Message 1959147 - Posted: 7 Oct 2018, 22:21:59 UTC

we have 14 splitters running now. I think it will be a while until everyone has full caches though.

Thank you to whoever got the splitters running for taking time out of your Sunday!!
ID: 1959147 · Report as offensive
Profile Unixchick Project Donor
Avatar

Send message
Joined: 5 Mar 12
Posts: 815
Credit: 2,361,516
RAC: 22
United States
Message 1959148 - Posted: 7 Oct 2018, 22:36:35 UTC

The assimilators are NOT running. At what point will this be a problem??

Results returned and awaiting validation is at 5,470,134
and
Workunit files waiting for deletion is at 981,085

I'm assuming at some point the lack of clean up will cause problems.
ID: 1959148 · Report as offensive
ShadowBot 3.75
Volunteer tester

Send message
Joined: 28 Mar 00
Posts: 9
Credit: 23,708,744
RAC: 176
United States
Message 1959150 - Posted: 7 Oct 2018, 22:49:32 UTC - in response to Message 1959148.  

Results returned and awaiting validation isn't really an issue, since that is the number of results that are waiting for other results from users.
ID: 1959150 · Report as offensive
Profile Unixchick Project Donor
Avatar

Send message
Joined: 5 Mar 12
Posts: 815
Credit: 2,361,516
RAC: 22
United States
Message 1959158 - Posted: 7 Oct 2018, 23:50:43 UTC - in response to Message 1959150.  

Results returned and awaiting validation isn't really an issue, since that is the number of results that are waiting for other results from users.



The Results returned and awaiting validation is usually around 3.5 million, and now it is over 5 million. I don't think we have an extra 1.5 million WUs that are awaiting another result. Resends have been going out as far as I can tell.

I think the results are being validated, but not moving from this bucket to the next bucket. I'm guessing that once a result is validated AND has hung around for 24 hours that it is then moved to the results waiting for db purge.

It probably needs to write the result to the science db before it goes to the db purge bucket, and maybe that is a problem?? but the s@h science db seems to be running, so *shrug*

I know my valids are more like 48 hours instead of the normal 24ish hours though so there will be a point when the db gets too large, I'm just playing "guess when or if it will crash" to keep myself entertained.
ID: 1959158 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1959159 - Posted: 7 Oct 2018, 23:51:27 UTC

If you are lucky enough to snag some of the work from the RTS buffer, it is all for naught because the tasks currently there are all noise bombs.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1959159 · Report as offensive
Cruncher-American Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor

Send message
Joined: 25 Mar 02
Posts: 1513
Credit: 370,893,186
RAC: 340
United States
Message 1959160 - Posted: 7 Oct 2018, 23:56:17 UTC - in response to Message 1959159.  

If you are lucky enough to snag some of the work from the RTS buffer, it is all for naught because the tasks currently there are all noise bombs.


Yup. One of my machines had approx a zillion of those (on my 1080s, ran 15 secs rather than 10-15 mins and VALIDATED, so not my machine going bad) just before things (i.e., WUs being sent) went away.

Oddly enough, the other of my 2 crunchers had none of them before the world ended.
ID: 1959160 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1959163 - Posted: 8 Oct 2018, 0:08:38 UTC - in response to Message 1959160.  

I think the noise bombs cleared out finally. I had one machine that blew threw everything it got over a 3 or 4 task request period and they all validated. Finally getting good ones on that machine.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1959163 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1959165 - Posted: 8 Oct 2018, 0:36:59 UTC - in response to Message 1959163.  
Last modified: 8 Oct 2018, 0:39:51 UTC

I think the noise bombs cleared out finally. I had one machine that blew threw everything it got over a 3 or 4 task request period and they all validated. Finally getting good ones on that machine.


. . Overnight I dl'd 106 new tasks on this machine (1050ti) but this morning I only had about as many left as if I had dl'd none. They all evaporated like smoke, so one can only assume they were all noise bombs ...

. . But like you, this morning they seem to be healthier WUs.

. . Now all we need is someone to give the pfb splitter a kick and mount some more tapes in the next 24 hours before the current units are exhausted.

Stephen

:)
ID: 1959165 · Report as offensive
Previous · 1 . . . 18 · 19 · 20 · 21 · 22 · 23 · 24 . . . 37 · Next

Message boards : Number crunching : Panic Mode On (113) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.