Message boards :
Number crunching :
ATI OpenCL MultiBeam 6.10 problem..
Message board moderation
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · Next
Author | Message |
---|---|
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14672 Credit: 200,643,578 RAC: 874 |
Aaaaaarrgghh! He's using v6.10.58, and that doesn't do ATI properly yet - no ati_compare. It's still in testing, but close to "release candidate" status - and probably in a better state than the ATI drivers and OpenCL application you're trying to use. Click the all versions link on the normal download page. |
Claggy Send message Joined: 5 Jul 99 Posts: 4654 Credit: 47,537,079 RAC: 4 |
...btw Claggy, i did put the alternative cc-config.xml file you suggested above into the BOINC data directory after removing the one with the <use_all_gpus> directive in it. and that actually caused all tasks to error out, including the first one, which i wasn't have a problem with before. Which device number did you put in instead of 'n' ?, i think it should be 0, i initially said 1, ATI GPU 0: ATI Radeon HD 2300/2400/3200 (RV610) (CAL version 1.4.900, 341MB, 56 GFLOPS peak) The confusion crept in because with OpenCL there's only one device, and there the HD5870 is device 0: Stderr output Claggy |
Sunny129 Send message Joined: 7 Nov 00 Posts: 190 Credit: 3,163,755 RAC: 0 |
yes, i used device "0" in the cc_config.xml you suggested to me. it read exactly like this: <cc_config> anyways, let me try BOINC v6.12.18 when i get home and see if it works in conjunction with the cc_config.xml fix. by the way, i have lots of tasks in the que - am i going to lose all of those when i update BOINC? can i just install v6.12.18 over v6.10.58, or do i have to treat it like i would drivers (uninstall the old version first, then install the new one)? |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14672 Credit: 200,643,578 RAC: 874 |
anyways, let me try BOINC v6.12.18 when i get home and see if it works in conjunction with the cc_config.xml fix. by the way, i have lots of tasks in the que - am i going to lose all of those when i update BOINC? can i just install v6.12.18 over v6.10.58, or do i have to treat it like i would drivers (uninstall the old version first, then install the new one)? I suggest you use either v6.12.18, or a cc_config.xml file - not both at once. I've lost count of the number of times I've installed test versions of BOINC v6.12 - probably over 50, given the number in my download archive and the number of machines I test on. Not lost any tasks yet. But, having said that, take precautions if you feel the need. Disable network activity, shut down BOINC, and backup the entire "D:\Documents and Settings\All Users\Application Data\BOINC" folder structure, including all sub-folders. As you'll have gathered from your experiences to date, care and attention to detail are important when you venture into the realms of test and non-standard applications. Make sure you download the correct version (32-bit or 64-bit) of BOINC - the normal page detects the OS for you, on the test page you have to do it yourself. By "correct", I mean the same as you had before (32-bit it looks like - but check yourself) - that makes the upgrade easier. And read the installer screens - check in particular that that D:\... data drive has been picked up correctly. |
Claggy Send message Joined: 5 Jul 99 Posts: 4654 Credit: 47,537,079 RAC: 4 |
You can install Boinc 6.12.18 straight over the top of 6.10.58, you shouldn't loose any Wu's, But to be safe, suspend all the Wu's first, then you can see which GPU is detected, before unsuspending a few to see if they run correctly, (only suspend the running tasks, after you're suspended the unstarted tasks, Boinc 6.10.58 has a bug where if you suspend everything, as Boinc ripples through the tasks, some of them get marked as started, but the slot for that Wu doesn't get filled, so errors out next time that Wu is attempted) Remember 6.12.18 is an Alpha build, it may still have unreported bugs in it, the only Bugs i've reported is it doesn't close Collatz Cuda tasks down on exit, and errors out the next task if i suspend a running Cuda task, both issues are probably due to my 128Mb 8400M GS being very slow, and lacking in memory, Claggy Edit: or do as Richard suggests, |
Sunny129 Send message Joined: 7 Nov 00 Posts: 190 Credit: 3,163,755 RAC: 0 |
I suggest you use either v6.12.18, or a cc_config.xml file - not both at once. well as i stated previously, i had problems using the suggested cc_config.xml file w/ BOINC v6.10.58. so i suppose the next logical step is to give v6.12.18 a try without the help of a cc_config.xml file. if i still have problems, i'll add the following cc_config.xml file to the BOINC data directory and see it that fixes things: <cc_config> thanks again guys for the guidance. |
BilBg Send message Joined: 27 May 07 Posts: 3720 Credit: 9,385,827 RAC: 0 |
I suggest you use either v6.12.18, or a cc_config.xml file - not both at once. How can line like this: <ignore_ati_dev>n</ignore_ati_dev> "fix things"?? "n" have to be substituted by 0 or 1 or 2 ..., e.g.: <ignore_ati_dev>0</ignore_ati_dev> or <ignore_ati_dev>1</ignore_ati_dev> Â - ALF - "Find out what you don't do well ..... then don't do it!" :) Â |
Sunny129 Send message Joined: 7 Nov 00 Posts: 190 Credit: 3,163,755 RAC: 0 |
yes i know...that "n" should be a "0" since GPU_0 is my HD 3300 integrated video and GPU_1 is my HD 5870. i tried to edit my above post to reflect that, but by the time i realized it, it was too late...i can no longer edit that post. if you look up 5 or 6 posts, you'll see that i did in fact try that cc_config.xml file under BOINC v6.10.58 to no avail (and yes i remembered to replace n with 0 when i tried it). |
Gundolf Jahn Send message Joined: 19 Sep 00 Posts: 3184 Credit: 446,358 RAC: 0 |
(and yes i remembered to replace n with 0 when i tried it) Did you also try to replace n with 1, in case the numbering in message 1085649 was the right one? Gruß, Gundolf |
Sunny129 Send message Joined: 7 Nov 00 Posts: 190 Credit: 3,163,755 RAC: 0 |
(and yes i remembered to replace n with 0 when i tried it) no i haven't, b/c the BOINC start-up dialogue told me that GPU_0 was in fact the HD 3300 onboard video (which is of course the one i would like to disable in BOINC). nevertheless, i suppose i should try the cc_config.xml file with the number n=1 before i update BOINC to v6.12.18 just to be sure...i'll do that first and let you know if anything positive comes of it. |
Raistmer Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121 |
It's still in testing, but close to "release candidate" status - and probably in a better state than the ATI drivers and OpenCL application you're trying to use. I would not put all together. App status is not "release candidate", it's released :P |
Raistmer Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121 |
And speaking more seriously, "release candidate" could not mix so different cards and provide single APP plan for them, knowing that HD3xxx only brook/cal capable. Do we have OpenCL plan class already? W/o it I use ati13ati as bandage... |
BilBg Send message Joined: 27 May 07 Posts: 3720 Credit: 9,385,827 RAC: 0 |
I read all posts before posting mine. But the last of yours sounded like you want to say: "because I tried <ignore_ati_dev>0</ignore_ati_dev> (5 posts above) and it has no effect I will try <ignore_ati_dev>n</ignore_ati_dev> in hope it will (magically) fix things (maybe BOINC knows what to disable and will auto-choose the "n" device)" Â - ALF - "Find out what you don't do well ..... then don't do it!" :) Â |
Sunny129 Send message Joined: 7 Nov 00 Posts: 190 Credit: 3,163,755 RAC: 0 |
I read all posts before posting mine. lol sorry, i didn't mean to give you the impression that i literally thought the letter "n" would fix my problem. at any rate, i think the problem is solved, but i'm going to let a few WU's run to completion just to be sure. i went ahead and replaced n=0 with n=1 despite the fact that BOINC's start-up dialogue is telling me the GPU_0 is my HD 3300 onboard video. with the <ignore_ati_dev>1</ignore_ati_dev> directive in the cc_config.xml file, S@H MB GPU tasks are no longer erroring out. granted, my 5870 is still only crunching one task at a time, but the important thing is that when i resume a 2nd task while one is already crunching, it goes to "ready to start" status, rather than erroring out the instant i resume it. BillBG pointed out earlier in the thread how to run 2 tasks at once, but i'm going to hold on that until i know everything is stable and without errors. |
BilBg Send message Joined: 27 May 07 Posts: 3720 Credit: 9,385,827 RAC: 0 |
BilBG pointed out earlier in the thread how to run 2 tasks at once, but i'm going to hold on that until i know everything is stable and without errors. Raistmer (as the main/only programmer of the ATI apps) can tell you exactly what to change. <count> is one of the things to change but "there is another" (Yoda) :) Do you use GPU-Z to monitor temperature and load of the GPUs? http://www.techpowerup.com/gpuz/ Â - ALF - "Find out what you don't do well ..... then don't do it!" :) Â |
Sunny129 Send message Joined: 7 Nov 00 Posts: 190 Credit: 3,163,755 RAC: 0 |
Raistmer (as the main/only programmer of the ATI apps) can tell you exactly what to change. thanks for the tip Jedi Master ;) on that note, none of my S@H GPU tasks are currently "suspended" (i.e. they are all "ready to start" in the que) while waiting for the 1 active task to finish before the next one begins, and i'm getting zero errors! yay! the only other issue i've encountered since adding the cc_config.xml is a Milkyway@Home issue. i figured it would be a good idea to test out the one other GPU app i use, and upon resuming my MW@H tasks, i noticed that my 5870 only crunches 1 MW@H task at a time now, when it used to crunch 2 at a time. it really doesn't bother me, as it used to take ~3:00 min. to crunch 2 MW@H tasks, and now it takes ~1:30 min. to crunch 1 MW@H task. so it seems that despite going from crunching 2 MW@H tasks at a time to crunching only 1 at a time, my 5870's MW@H crunchng efficiency is still essentially the same as before. so like i said, it really doesn't bother me since the "average time per WU" hasn't changed, but i am still curious as to why that cc_config.xml file made a difference in the number of active simultaneous MW@H tasks. Do you use GPU-Z to monitor temperature and load of the GPUs? actually i'm using MSI Afterburner b/c it allows me to both monitor vitals AND manipulate them, whereas GPU-Z only allows me to monitor. sometimes i run them both at simultaneously out of paranoia. but rest assured that i'm keeping temps under control. |
dnolan Send message Joined: 30 Aug 01 Posts: 1228 Credit: 47,779,411 RAC: 32 |
To run 2 at a time, you would want to change [edit] in app_info.xml [\edit] <count>1</count> to <count>0.5</count> and change <cmdline></cmdline> to <cmdline>-instances_per_device 2</cmdline> or add the -instances_per_device 2 into whatever other commands you already have. -Dave |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14672 Credit: 200,643,578 RAC: 874 |
May I, yet again, plead that you - everyone - try to understand what is going on, and propose changes that are appropriate both to the hardware being used, and the skill level of the person using it? We all might learn something then. If setting <ignore_ati_dev>1</ignore_ati_dev> results in the device which BOINC itself describes as device 0 being ignored, and device 1 being used without error, then that's good to know. I'm not quite sure how we're going to write it up in the FAQ, though. But, what do the startup Event Log (now you're using v6.12) messages say? Which device is flagged for use, and which is being ignored? What does the stderr_txt of the successful tasks say about the device they ran on? |
Claggy Send message Joined: 5 Jul 99 Posts: 4654 Credit: 47,537,079 RAC: 4 |
Richard, it's even better than that, Sunny129 is still using Boinc 6.10.58, the site is showing he's got a CAL ATI Radeon HD 2300/2400/3200 (RV610) (341MB) driver: 1.4.900, and he's crunching with OpenCL on his disabled HD5870 via app_info, What i want to know is why Wu's errored out when he disabled the HD3300, Claggy |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14672 Credit: 200,643,578 RAC: 874 |
Richard, it's even better than that... Oh, I give up on ATI. I'm going to bed ;-) |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.