New rescheduler

Message boards : Number crunching : New rescheduler
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 11 · Next

AuthorMessage
Profile [DPC] hansR Project Donor
Volunteer tester
Avatar

Send message
Joined: 14 Jul 00
Posts: 47
Credit: 235,829,569
RAC: 8
Netherlands
Message 1017548 - Posted: 19 Jul 2010, 12:02:12 UTC - in response to Message 1017529.  

Have changed BOINC installation to not run as service. The outage is coming soon and this systeem needs a lot of jobs ;-)

You could try moving work over from the GPU to the CPU.
Start with a small number, this should get you GPU work as well.


Thats exactly what I am doing, but still not getting (requesting) any work :-(
ID: 1017548 · Report as offensive
Profile S@NL - eFMer - efmer.com/boinc
Volunteer tester
Avatar

Send message
Joined: 7 Jun 99
Posts: 512
Credit: 148,746,305
RAC: 0
United States
Message 1017552 - Posted: 19 Jul 2010, 12:07:21 UTC - in response to Message 1017548.  

Have changed BOINC installation to not run as service. The outage is coming soon and this systeem needs a lot of jobs ;-)

You could try moving work over from the GPU to the CPU.
Start with a small number, this should get you GPU work as well.


Thats exactly what I am doing, but still not getting (requesting) any work :-(

I just trashed by work when the Client shut down and restarted itself automatically. But I now got 550+ already, more than enough.
Not requesting work is mostly another problem. There is plenty of work and the limits are gone.

TThrottle Control your temperatures. BoincTasks The best way to view BOINC. Anza Borrego Desert hiking.
ID: 1017552 · Report as offensive
Profile Questor Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 3 Sep 04
Posts: 471
Credit: 230,506,401
RAC: 157
United Kingdom
Message 1017556 - Posted: 19 Jul 2010, 12:19:38 UTC - in response to Message 1017548.  
Last modified: 19 Jul 2010, 12:21:11 UTC

Have changed BOINC installation to not run as service. The outage is coming soon and this systeem needs a lot of jobs ;-)

You could try moving work over from the GPU to the CPU.
Start with a small number, this should get you GPU work as well.


Thats exactly what I am doing, but still not getting (requesting) any work :-(


Probably now new work because of the number of errors you are getting.
http://setiathome.berkeley.edu/results.php?hostid=4414471&offset=0&show_names=0&state=5
Shows a lot of -226 errors on both CPU and GPU (CUDA).

ERR_TOO_MANY_EXITS -226

An application has exited prematurely (unexpectedly) more than 99 times without generating a checkpoint, so giving up on that task.

I see you are using 6.10.58 - have you recently upgraded to this version and now get these problems and from which version? Try downgrading?
GPU Users Group



ID: 1017556 · Report as offensive
Profile [DPC] hansR Project Donor
Volunteer tester
Avatar

Send message
Joined: 14 Jul 00
Posts: 47
Credit: 235,829,569
RAC: 8
Netherlands
Message 1017557 - Posted: 19 Jul 2010, 12:20:04 UTC - in response to Message 1017552.  

After a few hours (?) just started getting new work !!! :-)
ID: 1017557 · Report as offensive
Profile S@NL - eFMer - efmer.com/boinc
Volunteer tester
Avatar

Send message
Joined: 7 Jun 99
Posts: 512
Credit: 148,746,305
RAC: 0
United States
Message 1018376 - Posted: 22 Jul 2010, 7:10:36 UTC - in response to Message 1017557.  

V 1.1

Warning: Make sure the BOINC Manager is shut down before rescheduling. It restarts the BOINC client on some computers.

Changed: A slightly different way to start the BOINC client.
Fixed: When the CPU or GPU was out of work, it sometimes refused to reschedule.
TThrottle Control your temperatures. BoincTasks The best way to view BOINC. Anza Borrego Desert hiking.
ID: 1018376 · Report as offensive
Profile [DPC] hansR Project Donor
Volunteer tester
Avatar

Send message
Joined: 14 Jul 00
Posts: 47
Credit: 235,829,569
RAC: 8
Netherlands
Message 1018378 - Posted: 22 Jul 2010, 7:48:37 UTC - in response to Message 1017556.  


I got those errors after rescheduling as i mentioned a few posts earlier.
All my CPU jobs errored out.

Running 6.10.58 on some systems now without any problem. Seems to be a good stable version.

Have changed BOINC installation to not run as service. The outage is coming soon and this systeem needs a lot of jobs ;-)

You could try moving work over from the GPU to the CPU.
Start with a small number, this should get you GPU work as well.




Thats exactly what I am doing, but still not getting (requesting) any work :-(


Probably now new work because of the number of errors you are getting.
http://setiathome.berkeley.edu/results.php?hostid=4414471&offset=0&show_names=0&state=5
Shows a lot of -226 errors on both CPU and GPU (CUDA).

ERR_TOO_MANY_EXITS -226

An application has exited prematurely (unexpectedly) more than 99 times without generating a checkpoint, so giving up on that task.

I see you are using 6.10.58 - have you recently upgraded to this version and now get these problems and from which version? Try downgrading?


ID: 1018378 · Report as offensive
Profile [DPC] hansR Project Donor
Volunteer tester
Avatar

Send message
Joined: 14 Jul 00
Posts: 47
Credit: 235,829,569
RAC: 8
Netherlands
Message 1018379 - Posted: 22 Jul 2010, 7:49:47 UTC - in response to Message 1018376.  

Will try this version after the current outage. (and reporting)
ID: 1018379 · Report as offensive
TheFreshPrince a.k.a. BlueTooth76
Avatar

Send message
Joined: 4 Jun 99
Posts: 210
Credit: 10,315,944
RAC: 0
Netherlands
Message 1018509 - Posted: 22 Jul 2010, 21:04:12 UTC - in response to Message 1018379.  

Will try this version after the current outage. (and reporting)


Hey HansR!
Always nice to see another Dutch Power Cow on the forum :P
Rig name: "x6Crunchy"
OS: Win 7 x64
MB: Asus M4N98TD EVO
CPU: AMD X6 1055T 2.8(1,2v)
GPU: 2x Asus GTX560ti
Member of: Dutch Power Cows
ID: 1018509 · Report as offensive
Profile S@NL - eFMer - efmer.com/boinc
Volunteer tester
Avatar

Send message
Joined: 7 Jun 99
Posts: 512
Credit: 148,746,305
RAC: 0
United States
Message 1018632 - Posted: 23 Jul 2010, 10:20:07 UTC - in response to Message 1018509.  

V 1.2

Add: A block on the BOINC client, to run while rescheduling. The BOINC Manager sometimes restarts the client.
Add: A warning in the log, when the BOINC Manager is running.
Add: Rescheduling now uses the CPU and GPU ratio. The ratio is used by SETI to correct runtime estimates.
Changed: Rescheduling on the regular SETI installation is now permitted. (With two different plan classes running).
Fixed: Several small bugs
TThrottle Control your temperatures. BoincTasks The best way to view BOINC. Anza Borrego Desert hiking.
ID: 1018632 · Report as offensive
Profile Hellsheep
Volunteer tester

Send message
Joined: 12 Sep 08
Posts: 428
Credit: 784,780
RAC: 0
Australia
Message 1018888 - Posted: 23 Jul 2010, 21:46:47 UTC - in response to Message 1018632.  

V 1.2

Add: A block on the BOINC client, to run while rescheduling. The BOINC Manager sometimes restarts the client.
Add: A warning in the log, when the BOINC Manager is running.
Add: Rescheduling now uses the CPU and GPU ratio. The ratio is used by SETI to correct runtime estimates.
Changed: Rescheduling on the regular SETI installation is now permitted. (With two different plan classes running).
Fixed: Several small bugs


What happens now that VLAR's don't go to GPU's?
- Jarryd
ID: 1018888 · Report as offensive
Profile [DPC] hansR Project Donor
Volunteer tester
Avatar

Send message
Joined: 14 Jul 00
Posts: 47
Credit: 235,829,569
RAC: 8
Netherlands
Message 1019125 - Posted: 24 Jul 2010, 7:04:05 UTC - in response to Message 1018632.  

Version 1.2 crushes something: after rescheduling BOINC manager sees no projects at all, no clients running.

Repair BOINC installation fixes the problem: everything is back: projects, clients running.

Running the rescheduler V 1.2 again: same problem.

Repair fixes this again.

Switched back to V 0.9: no problem.
ID: 1019125 · Report as offensive
Profile Gundolf Jahn

Send message
Joined: 19 Sep 00
Posts: 3184
Credit: 446,358
RAC: 0
Germany
Message 1019126 - Posted: 24 Jul 2010, 7:51:53 UTC - in response to Message 1019125.  

Repair BOINC installation fixes the problem: everything is back: projects, clients running.

Perhaps an "Advanced/Select Computer..." would have done it too?

Or a "File/Exit" and restart of the manager?

Gruß,
Gundolf
ID: 1019126 · Report as offensive
Profile S@NL - eFMer - efmer.com/boinc
Volunteer tester
Avatar

Send message
Joined: 7 Jun 99
Posts: 512
Credit: 148,746,305
RAC: 0
United States
Message 1019137 - Posted: 24 Jul 2010, 9:23:15 UTC - in response to Message 1019126.  

V 1.3

Fixed: Program crashes when SETI beta is present and regular SETI is out of work, after pressing Run, even when no rescheduling is required.
Fixed: Sometimes the BOINC client was blocked from running.

TThrottle Control your temperatures. BoincTasks The best way to view BOINC. Anza Borrego Desert hiking.
ID: 1019137 · Report as offensive
Profile S@NL - eFMer - efmer.com/boinc
Volunteer tester
Avatar

Send message
Joined: 7 Jun 99
Posts: 512
Credit: 148,746,305
RAC: 0
United States
Message 1019138 - Posted: 24 Jul 2010, 9:25:56 UTC - in response to Message 1019125.  

Version 1.2 crushes something: after rescheduling BOINC manager sees no projects at all, no clients running.

Repair BOINC installation fixes the problem: everything is back: projects, clients running.

Running the rescheduler V 1.2 again: same problem.

Repair fixes this again.

Switched back to V 0.9: no problem.

Too much protection didn't allow the boinc client to run.
1.3 fixed that, but closing the rescheduler unlocks everything as well.
A repair shouldn't be necessary. as there is nothing wrong with the state file.

TThrottle Control your temperatures. BoincTasks The best way to view BOINC. Anza Borrego Desert hiking.
ID: 1019138 · Report as offensive
Profile [DPC] hansR Project Donor
Volunteer tester
Avatar

Send message
Joined: 14 Jul 00
Posts: 47
Credit: 235,829,569
RAC: 8
Netherlands
Message 1019149 - Posted: 24 Jul 2010, 10:28:34 UTC - in response to Message 1019138.  

Neither the "select computer" nor the restart did fix it.
Closing the scheduler might have solved it.

I can't try V 1.3 till monday, because my remote login get stuck if I try to download the rescheduler.



ID: 1019149 · Report as offensive
Profile S@NL - eFMer - efmer.com/boinc
Volunteer tester
Avatar

Send message
Joined: 7 Jun 99
Posts: 512
Credit: 148,746,305
RAC: 0
United States
Message 1019452 - Posted: 25 Jul 2010, 13:21:07 UTC - in response to Message 1019149.  

V 1.4

Add: Other tab, for rescheduling other tasks like Astropulse -> ATI GPU.

TThrottle Control your temperatures. BoincTasks The best way to view BOINC. Anza Borrego Desert hiking.
ID: 1019452 · Report as offensive
Terror Australis
Volunteer tester

Send message
Joined: 14 Feb 04
Posts: 1817
Credit: 262,693,308
RAC: 44
Australia
Message 1019465 - Posted: 25 Jul 2010, 14:45:44 UTC

Hi Fred
A couple of suggestions.
1) An option NOT to restart BOINC after a reschedule. Sometimes for maintenance purposes I use your app while the client is not running and do not want it to restart afterwards.

2) An easier way to move all the units to the CPU (similar in operation to Marius's slider bar ). Once again this is for maintenance purposes.

There are times, e.g. When changing the CUDA app version or video card that you need to be able to do things this way to avoid trashing units.

You may have already added these, I'm still using V0.6. It's been hard to keep up with as things have been changing so rapidly. :-)

Thanks for your efforts on this.

T.A.
ID: 1019465 · Report as offensive
Profile BilBg
Volunteer tester
Avatar

Send message
Joined: 27 May 07
Posts: 3720
Credit: 9,385,827
RAC: 0
Bulgaria
Message 1019466 - Posted: 25 Jul 2010, 14:46:30 UTC


What happens if these two options are conflicting?:

"If GPU tasks are less than" 40 "fill up to" 60 "Tasks": Moves tasks to the GPU when the current total is less than 40. Move enough tasks to end up with 60 tasks.
"If CPU tasks are less than" 20 "fill up to" 30 "Tasks": Moves tasks to the CPU when the current total is less than 20. Move enough tasks to end up with 30 tasks.

Will it loop e.g. if you have total of 62 tasks
GPU: 20 - CPU: 42 -> GPU fill to 60 =
GPU: 60 - CPU: 2 -> CPU fill to 30 =
GPU: 32 - CPU: 30 -> GPU fill to 60 =
GPU: 60 - CPU: 2 -> CPU fill to 30 =
..........


 


- ALF - "Find out what you don't do well ..... then don't do it!" :)
 
ID: 1019466 · Report as offensive
Profile S@NL - eFMer - efmer.com/boinc
Volunteer tester
Avatar

Send message
Joined: 7 Jun 99
Posts: 512
Credit: 148,746,305
RAC: 0
United States
Message 1019472 - Posted: 25 Jul 2010, 14:52:11 UTC - in response to Message 1019466.  


What happens if these two options are conflicting?:

"If GPU tasks are less than" 40 "fill up to" 60 "Tasks": Moves tasks to the GPU when the current total is less than 40. Move enough tasks to end up with 60 tasks.
"If CPU tasks are less than" 20 "fill up to" 30 "Tasks": Moves tasks to the CPU when the current total is less than 20. Move enough tasks to end up with 30 tasks.

Will it loop e.g. if you have total of 62 tasks
GPU: 20 - CPU: 42 -> GPU fill to 60 =
GPU: 60 - CPU: 2 -> CPU fill to 30 =
GPU: 32 - CPU: 30 -> GPU fill to 60 =
GPU: 60 - CPU: 2 -> CPU fill to 30 =
..........


The GPU will take first pick. What's left is for the CPU.
It takes account for the minimum on both GPU and CPU.
You can simply try by pressing test.
TThrottle Control your temperatures. BoincTasks The best way to view BOINC. Anza Borrego Desert hiking.
ID: 1019472 · Report as offensive
Profile S@NL - eFMer - efmer.com/boinc
Volunteer tester
Avatar

Send message
Joined: 7 Jun 99
Posts: 512
Credit: 148,746,305
RAC: 0
United States
Message 1019474 - Posted: 25 Jul 2010, 14:58:37 UTC - in response to Message 1019465.  

Hi Fred
A couple of suggestions.
1) An option NOT to restart BOINC after a reschedule. Sometimes for maintenance purposes I use your app while the client is not running and do not want it to restart afterwards.

2) An easier way to move all the units to the CPU (similar in operation to Marius's slider bar ). Once again this is for maintenance purposes.

There are times, e.g. When changing the CUDA app version or video card that you need to be able to do things this way to avoid trashing units.

You may have already added these, I'm still using V0.6. It's been hard to keep up with as things have been changing so rapidly. :-)

Thanks for your efforts on this.

T.A.

1) I can do, will add a START and STOP button and a check not to start the client. You could also use the simulation mode and move over the new client state when needed.
2) There is a reason. The Server keeps tab on the CPU and GPU work, with a slider you keep moving work from one to another too easily.
The approach I took tries to schedule as little as possible. It's less confusing for the Server and doesn't mess the correction ratio's, that will be switched on anytime.

A reason to update, there are some bug fixes in later version.......

TThrottle Control your temperatures. BoincTasks The best way to view BOINC. Anza Borrego Desert hiking.
ID: 1019474 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 11 · Next

Message boards : Number crunching : New rescheduler


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.