Postponed: Waiting to acquire lock

Message boards : Number crunching : Postponed: Waiting to acquire lock
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 7 · 8 · 9 · 10 · 11 · 12 · 13 . . . 14 · Next

AuthorMessage
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1911584 - Posted: 7 Jan 2018, 19:42:55 UTC - in response to Message 1911581.  
Last modified: 7 Jan 2018, 19:46:48 UTC

As I said, the Science App uses the BOINC libraries from bonic-master. The App will not compile without those BOINC libraries. The BOINC API is from those BOINC Libraries, Not the Science App.
libboinc_api.a : /Users/Tom/boinc/api
ID: 1911584 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14656
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1911585 - Posted: 7 Jan 2018, 19:45:35 UTC - in response to Message 1911584.  

As I said, the Science App uses the BOINC libraries from bonic-master. The App will not compile without those BOINC libraries. The BOINC API is from those BOINC Libraries, Not the Science App.
Correct. That's how it's made.

But it lives and runs inside the application. If it has bugs, you can edit, compile, and test fixes. I can liaise with the BOINC developers and pass back any successful fixes for other people to use.
ID: 1911585 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1911586 - Posted: 7 Jan 2018, 19:49:40 UTC - in response to Message 1911585.  

You know, there are quite a few other people NOT having this problem. I'm not convinced there is a problem worth addressing.
As far as I know, None of my recommendations have been completed. I'm still waiting.
ID: 1911586 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1911594 - Posted: 7 Jan 2018, 20:03:09 UTC - in response to Message 1911586.  

You know, there are quite a few other people NOT having this problem. I'm not convinced there is a problem worth addressing.
As far as I know, None of my recommendations have been completed. I'm still waiting.

I already do today. maybe you miss one of my post.

Dry the cache made some reschedules and see if anything happening
Nothing happening Back to work as normal
Was running the SSE4.1 Builds when i do the test.

But as i tried to explain, the issue is not happening all the time, i just notice 3 or 4 times in the last weeks.
All when the cache was running at the end.
And there is something different now, we don't have a mix of WU just blc.

Please not misunderstand i not know nothing about code just a user who try to test what you guyÅ› tell me to do.

Since i not believe it's the app itself I'm back to the AVX2 builds

If that happening again i will post.
ID: 1911594 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14656
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1911595 - Posted: 7 Jan 2018, 20:08:27 UTC - in response to Message 1911594.  

OK, I was thinking it was about time to head out to the pub - I think we've about ground ourselves down to a halt for tonight, anyway.
ID: 1911595 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1911596 - Posted: 7 Jan 2018, 20:10:15 UTC - in response to Message 1911595.  

I concur.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1911596 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1911599 - Posted: 7 Jan 2018, 20:19:16 UTC - in response to Message 1911594.  
Last modified: 7 Jan 2018, 20:24:23 UTC

The Major part of running the cache empty was to Delete the Client_state.xml once the cache was empty so a New Client_state .xml would be created. I suggest turning Off Ubuntu networking before deleting the file from the BOINC folder and removing the slot folders. With networking disabled, move both the Client_state files to the Root level of Home and delete the slot folders. Start BOINC so a new Client_state.xml will be created, then Stop BOINC and look at the new Client_state.xml and see if it has the correct <hostid></hostid> & <rpc_seqno></rpc_seqno>. Use the numbers from the old Client_state.xml for the new one, I think you know the drill. Then Run BOINC Normally with a reduced cache setting, Without any Rescheduling, and see if you have the same trouble.

If you do have the same trouble with BOINC 7.8.3, try it with BOINC 7.4.44 as mentioned previously.
ID: 1911599 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14656
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1911600 - Posted: 7 Jan 2018, 20:23:30 UTC - in response to Message 1911599.  

Why on earth would anyome deliberately destroy all possible chance of recovering evidence? My comment to Jeff last night by PM was that the most likely outcome of following that advice was the dumping of all tasks, with them being marked 'abandoned' on the website.

I'm outa here - I've never liked the 'sledgehammer and two short planks' school of computer maintenance.
ID: 1911600 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1911602 - Posted: 7 Jan 2018, 20:26:43 UTC - in response to Message 1911600.  

I guess you missed the part about running the cache empty...somehow.
Drink one for Me, cause I stopped drinking Years ago.
ID: 1911602 · Report as offensive
Profile Jeff Buck Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 11 Feb 00
Posts: 1441
Credit: 148,764,870
RAC: 0
United States
Message 1911608 - Posted: 7 Jan 2018, 20:56:45 UTC

There's absolutely ZERO justification for trashing an existing client_state file based on the evidence accumulated here over the past two days, both from the actual incidents that have occurred in normal use and from the testing scenarios attempted so far. That's awful advice.
ID: 1911608 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1911610 - Posted: 7 Jan 2018, 21:02:19 UTC - in response to Message 1911608.  
Last modified: 7 Jan 2018, 21:07:39 UTC

What is going to happen if the cache is EMPTY? Please do tell.
If NOTHING is going to happen, why is it Awful?
Why are you PMing about things that CAN'T Happen with an EMPTY CACHE.
Am I dealing with drunks here or what?
The Current Client_State file has been EDITED countless times, that is very good reason to try a New one, that hasn't been edited.
ID: 1911610 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1911612 - Posted: 7 Jan 2018, 21:09:25 UTC

Please guy's not take that personal.

I i'm doing the kill of the client just to take out that from the questions.

Doing that now (i keep my cache very low).

Back ASAP
ID: 1911612 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1911616 - Posted: 7 Jan 2018, 21:36:08 UTC
Last modified: 7 Jan 2018, 21:36:25 UTC

OK Done back with a new client_state.xml

Rebuilding the cache now.

Leave running AVX2. Any news i post.
ID: 1911616 · Report as offensive
Profile Jeff Buck Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 11 Feb 00
Posts: 1441
Credit: 148,764,870
RAC: 0
United States
Message 1911619 - Posted: 7 Jan 2018, 21:46:45 UTC - in response to Message 1911616.  

OK Done back with a new client_state.xml
Just to verify that it truly is a "new" file, take a look in your current Event Log and make sure that there's no reference to a client_state_prev file. If it's there, it would appear shortly after startup.
ID: 1911619 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1911622 - Posted: 7 Jan 2018, 21:57:15 UTC

I believe is, I stop the Boinc. Move both files files to the desktop. Check nothing was left. And restart the Boinc again.

I see no reference for the old file in the Event Log .
ID: 1911622 · Report as offensive
Profile Jeff Buck Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 11 Feb 00
Posts: 1441
Credit: 148,764,870
RAC: 0
United States
Message 1911624 - Posted: 7 Jan 2018, 22:00:05 UTC - in response to Message 1911622.  

I believe is, I stop the Boinc. Move both files files to the desktop. Check nothing was left. And restart the Boinc again.

I see no reference for the old file in the Event Log .
Okay, since you moved both files out, BOINC wouldn't be able to find either. If you had just removed the current client_state file, BOINC would likely have just reverted to the client_state_prev file, which is essentially a backup.
ID: 1911624 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14656
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1911625 - Posted: 7 Jan 2018, 22:02:13 UTC - in response to Message 1911622.  
Last modified: 7 Jan 2018, 22:04:57 UTC

Did you preserve hostid and rpc_seqno, as the ... as he said? Auto-lookup may have worked, but best of luck.

Edit - looks like you got away with it - preserved stats. Host 8396902. That could have been showing zero everywhere.
ID: 1911625 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1911628 - Posted: 7 Jan 2018, 22:04:07 UTC - in response to Message 1911625.  

Did you preserve hostid and rpc_seqno, as the ... as he said? Auto-lookup may have worked, but best of luck.

Yes i know about that. I recover all the info of my host. Not want to lose my nice #2 Place. LOL
ID: 1911628 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1911629 - Posted: 7 Jan 2018, 22:12:20 UTC - in response to Message 1911628.  

Thanks Juan. It will be helpful to know if you still have the same trouble with both BOINC 7.8.3 & the older 7.4.44.
ID: 1911629 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1911633 - Posted: 7 Jan 2018, 22:30:50 UTC - in response to Message 1911629.  
Last modified: 7 Jan 2018, 22:37:03 UTC

Hope that could take some time.
To replicate the problem in real life i will going to wait for the next unscheduled outage or server problem to dry my caches and
Not see the problem appearing after only blc WU where crunching.
To you and all others that could sound idiot but i suspect the "shutdown timing problem" appears when only arecibo vlars are crunching on the GPU & CPU at least all the times when i noticed it was after that happening, at the end of my cache all the WU are Arecibo Vlars and i crunch them on the CPU & GPU since they crunch very fast on my GPU's too.
Will keep working using the rescheduled to try to pass the weekly outage and if the issue happening again will post & try to change to the older boinc.
Anyway i suggest to the devs to look why the GPU apps works with the lockfile and the CPU no. And if possible find a way to correct that.
The hosts & GPUs are each day become faster and maybe today We two only have the problem but who knows the future.
From my side i'm happy now. Now i know why the issue happens and how to bypass and keep my host running.

Thanks Jeff, Richard, TBar, Brent, Keith and all others who help me with this issue.

<edit> Now it's time for few sundays beers. Cheers.
ID: 1911633 · Report as offensive
Previous · 1 . . . 7 · 8 · 9 · 10 · 11 · 12 · 13 . . . 14 · Next

Message boards : Number crunching : Postponed: Waiting to acquire lock


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.