Rescheduling - Please don't beat me - read first!

Message boards : Number crunching : Rescheduling - Please don't beat me - read first!
Message board moderation

To post messages, you must log in.

AuthorMessage
Ulrich Metzner
Volunteer tester
Avatar

Send message
Joined: 3 Jul 02
Posts: 1256
Credit: 13,565,513
RAC: 13
Germany
Message 1446963 - Posted: 26 Nov 2013, 10:14:48 UTC

Hello there,

i have the following problem: I've got a big bunch of astropulse workunits scheduled for intel GPU (Server sees them as CPU) which are way to much and the intel GPU will be not able to crunch all of them on time. I already stopped BOINC to fetch AP units and think now of rescheduling the surplus AP units from intel GPU to Nvidia GPU. Unfortunately i found no way to reschedule between GPUs, only from CPU to GPU and vice versa. Any advice here, because i don't want to trash the WUs thereby annoying my wingmen?
Aloha, Uli

ID: 1446963 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34744
Credit: 261,360,520
RAC: 489
Australia
Message 1446966 - Posted: 26 Nov 2013, 10:38:45 UTC

Personally I wouldn't do anything, just let the system work it out. Any that you can't complete in time will be sent out again to others.

Cheers.
ID: 1446966 · Report as offensive
Profile BilBg
Volunteer tester
Avatar

Send message
Joined: 27 May 07
Posts: 3720
Credit: 9,385,827
RAC: 0
Bulgaria
Message 1446976 - Posted: 26 Nov 2013, 11:38:26 UTC - in response to Message 1446963.  
Last modified: 26 Nov 2013, 11:44:40 UTC

Unfortunately i found no way to reschedule between GPUs, only from CPU to GPU and vice versa


I think you can do this in 2 steps:
Intel GPU -> CPU
CPU -> Nvidia GPU

(by 'Preferred use:' - I think you'll need existing CPU AP app in app_info.xml
After adding CPU AP app to app_info.xml you have to restart BOINC so new info from app_info.xml is entered by BOINC to client_state.xml

First use 'Simulation mode, only create a new state file' to see/compare the changes in client_state.xml
)


 


- ALF - "Find out what you don't do well ..... then don't do it!" :)
 
ID: 1446976 · Report as offensive
Profile Link
Avatar

Send message
Joined: 18 Sep 03
Posts: 834
Credit: 1,807,369
RAC: 0
Germany
Message 1446990 - Posted: 26 Nov 2013, 12:43:38 UTC - in response to Message 1446966.  

Personally I wouldn't do anything, just let the system work it out. Any that you can't complete in time will be sent out again to others.

That's the best way to go, if you are sure you can't cruch them all, you can eventually abort some, so they can be send out sooner. But actually you don't need to do anything, BOINC is designed to handle such thing on it's own.
ID: 1446990 · Report as offensive
Ulrich Metzner
Volunteer tester
Avatar

Send message
Joined: 3 Jul 02
Posts: 1256
Credit: 13,565,513
RAC: 13
Germany
Message 1446998 - Posted: 26 Nov 2013, 14:22:35 UTC

Thank you all for your suggestions.
Greatly appreciated!
Aloha, Uli

ID: 1446998 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1447001 - Posted: 26 Nov 2013, 14:46:29 UTC - in response to Message 1446963.  
Last modified: 26 Nov 2013, 15:06:14 UTC

Hello there,

i have the following problem: I've got a big bunch of astropulse workunits scheduled for intel GPU (Server sees them as CPU) which are way to much and the intel GPU will be not able to crunch all of them on time. I already stopped BOINC to fetch AP units and think now of rescheduling the surplus AP units from intel GPU to Nvidia GPU. Unfortunately i found no way to reschedule between GPUs, only from CPU to GPU and vice versa. Any advice here, because i don't want to trash the WUs thereby annoying my wingmen?

If you aren't over your GPU limit of 100, you can set your preferences for Use NVIDIA GPU only and then remove the wanted files from the client_state.xml. They will be resent as lost tasks to the device your preferences indicate. Anything over the limit will be listed as Timed out - no response and sent to others.

First, set the preferences to Use Only NVIDIA GPU and then hit the Update button a few times until it is asking for NVIDIA. Select the AP files you want to transfer and Suspend them. Shut down BOINC and wait about 15 seconds. Open the client_state.xml with notepad and search for suspended. Remove the entire results entry for each Suspended file, everything between and including <result> ... </result>. Save the client_state.xml. Don't make any mistakes or the entire file will be rejected and everything resent. Launch BOINC, then hit the Update button until all Lost Tasks are resent in groups of 20. Then adjust your preferences to normal.
ID: 1447001 · Report as offensive
Profile Uli
Volunteer tester
Avatar

Send message
Joined: 6 Feb 00
Posts: 10923
Credit: 5,996,015
RAC: 1
Germany
Message 1447236 - Posted: 27 Nov 2013, 3:12:09 UTC

Uli, I don't leave anything up to Boinc as it tends at times to over fetch and can't complete things on time.
I just suspend the currently running ones and let the others run up to a few percent. That way you can complete them and the server is not any wiser, as it thinks you started them on time.
Pluto will always be a planet to me.

Seti Ambassador
Not to late to order an Anni Shirt
ID: 1447236 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22200
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1447285 - Posted: 27 Nov 2013, 5:58:47 UTC

Its not starting on time but completing on time.

Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1447285 · Report as offensive
Profile Uli
Volunteer tester
Avatar

Send message
Joined: 6 Feb 00
Posts: 10923
Credit: 5,996,015
RAC: 1
Germany
Message 1447292 - Posted: 27 Nov 2013, 6:22:35 UTC

Sorry Rob. but it has worked for me.
The server will only cancel not started on time. So if you get a few percent in, you are fine.
Rob did you do a test run like I did?
Pluto will always be a planet to me.

Seti Ambassador
Not to late to order an Anni Shirt
ID: 1447292 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22200
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1447295 - Posted: 27 Nov 2013, 6:29:20 UTC

yes, I had a few on a retiring cruncher that timed out, not started on time, and they were aborted by the server.
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1447295 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34744
Credit: 261,360,520
RAC: 489
Australia
Message 1447296 - Posted: 27 Nov 2013, 6:30:10 UTC

The only thing is that doing that is what winds up getting some w/u's stuck in limbo.

Cheers.
ID: 1447296 · Report as offensive
Profile Uli
Volunteer tester
Avatar

Send message
Joined: 6 Feb 00
Posts: 10923
Credit: 5,996,015
RAC: 1
Germany
Message 1447300 - Posted: 27 Nov 2013, 6:32:55 UTC

They timed out, because they were not started in time.
Thus my fiddle.

Pluto will always be a planet to me.

Seti Ambassador
Not to late to order an Anni Shirt
ID: 1447300 · Report as offensive
Profile Uli
Volunteer tester
Avatar

Send message
Joined: 6 Feb 00
Posts: 10923
Credit: 5,996,015
RAC: 1
Germany
Message 1447316 - Posted: 27 Nov 2013, 6:52:19 UTC - in response to Message 1447296.  

The only thing is that doing that is what winds up getting some w/u's stuck in limbo.

Cheers.

Sorry Wiggo, I only have one stuck.
It is part of a known problem.

I was born with black hair, then lost it all and returned as a blond with curls.
Now I am a Brunette with some gray.
Pluto will always be a planet to me.

Seti Ambassador
Not to late to order an Anni Shirt
ID: 1447316 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34744
Credit: 261,360,520
RAC: 489
Australia
Message 1447321 - Posted: 27 Nov 2013, 6:57:10 UTC


I was born with black hair, then lost it all and returned as a blond with curls.
Now I am a Brunette with some gray.

Myself I'm grey with a little brunette. ;-)

Cheers.
ID: 1447321 · Report as offensive
Profile Link
Avatar

Send message
Joined: 18 Sep 03
Posts: 834
Credit: 1,807,369
RAC: 0
Germany
Message 1447350 - Posted: 27 Nov 2013, 9:18:25 UTC - in response to Message 1447316.  

The only thing is that doing that is what winds up getting some w/u's stuck in limbo.

Cheers.

Sorry Wiggo, I only have one stuck.
It is part of a known problem.

Yes, and since we know the problem, we also know what to do for it should not occur more often than really unavoidable: abort all WUs which we can't finish in time, i.e. before the replacement result is send out. Most of the resends will be crunched successfully, so crunching timed out WUs is not only causing a problem on the server side, but is just a plain waste of own resources and definitely nothing that should be recommended IMHO.
ID: 1447350 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34744
Credit: 261,360,520
RAC: 489
Australia
Message 1447352 - Posted: 27 Nov 2013, 9:29:57 UTC

I'm sorry Uli but as much as I look I just can't see your stuck 1 in your task list.

Can you give us a link to it?

Cheers.
ID: 1447352 · Report as offensive
Profile Uli
Volunteer tester
Avatar

Send message
Joined: 6 Feb 00
Posts: 10923
Credit: 5,996,015
RAC: 1
Germany
Message 1447833 - Posted: 28 Nov 2013, 8:23:00 UTC

http://setiathome.berkeley.edu/workunit.php?wuid=1265039314
Happy to help Wiggo.
Just part of the known problem.
Pluto will always be a planet to me.

Seti Ambassador
Not to late to order an Anni Shirt
ID: 1447833 · Report as offensive
Ulrich Metzner
Volunteer tester
Avatar

Send message
Joined: 3 Jul 02
Posts: 1256
Credit: 13,565,513
RAC: 13
Germany
Message 1447837 - Posted: 28 Nov 2013, 9:23:40 UTC - in response to Message 1447001.  


If you aren't over your GPU limit of 100, you can set your preferences for Use NVIDIA GPU only and then remove the wanted files from the client_state.xml. (...)

Thanks a lot, that worked and the resent files are also correctly marked as Nvidia! :)
Aloha, Uli

ID: 1447837 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34744
Credit: 261,360,520
RAC: 489
Australia
Message 1447844 - Posted: 28 Nov 2013, 10:08:19 UTC - in response to Message 1447833.  

http://setiathome.berkeley.edu/workunit.php?wuid=1265039314
Happy to help Wiggo.
Just part of the known problem.

Ah yes, that was when that script was run on the servers.

We maybe waiting a while on those to be corrected Uli (hopefully at the same time the old V6 Enhanced database gets cleaned out).

Cheers
ID: 1447844 · Report as offensive

Message boards : Number crunching : Rescheduling - Please don't beat me - read first!


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.