OpenCL apps are available for download on Lunatics |
![]() |
| log in |
Message boards : Number crunching : OpenCL apps are available for download on Lunatics
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 . . . 9 · Next
| Author | Message |
|---|---|
|
You cant compare HD5 app version against non HD5 app. | |
| ID: 1334313 · | |
|
For those having trouble running Cat 13.1. | |
| ID: 1334356 · | |
For those having trouble running Cat 13.1. I don't suppose it would be as easy as: start with clean install of 12.8, run 13.1 install but don't check the box for "APP SDK Runtime" ? IE keep the previous APP runtime? | |
| ID: 1334446 · | |
For those having trouble running Cat 13.1. Its not that hard at all. Took me 20 minutes. Some might want to keep 13.1. for new games. Edit: Its for those who already have 13.1 installed. ____________ | |
| ID: 1334451 · | |
|
1'st task done with 1761 | |
| ID: 1334453 · | |
|
I've noticed a warning message that's been bothering me since r1761: WARNING: can't open binary kernel file for oclFFT plan: C:\Documents and Settings\All Users\Application Data\BOINC/projects/setiathome.berkeley.edu\AP_clFFTplan_ATIRV770_32768_r1766.bin, continue with recompile... On two of my hosts, I get this message with every single work-unit (and, obviously, I haven't purged any binary caches). This is what I've been able to find, relating to this warning message:
I'm just wondering, is this going to be a problem for me? Are these hosts really going to spend CPU cycles recompiling for every single AP WU? | |
| ID: 1339804 · | |
|
Its not really a problem. | |
| ID: 1339855 · | |
I've noticed a warning message that's been bothering me since r1761: Stop boinc locate that file. Does it exist? rename it. start boinc file recreated? compare recreated one with saved one -are they identical? EDIT: Is this result from affected host? http://setiathome.berkeley.edu/result.php?resultid=2841704346 If yes better to stop with ATi AP on that host, it produces invalids. Perhaps low-end HD4xxx no more supported at all. ____________ News about SETI opt app releases: https://twitter.com/Raistmer | |
| ID: 1339857 · | |
|
I'll double-check when I get home, but I have restarted BOINC and the whole system at least a few times since r1761 was first released. I haven't changed the Catalyst drivers, but that's a good point: the affected hosts are using Catalyst 11.12, and none of my other hosts use that particular version. | |
| ID: 1339883 · | |
|
Phew, site is back on-line... | |
| ID: 1340084 · | |
Phew, site is back on-line... Cause "patching required" for oclFFT. It means that after kernel formation oclFFT setup code finds that it can't execute prepared kernels on particular device. And re-creates some of them. There were reports that after binary cache introduction app stopped to work on low-ATi devices (workgroup of 128). So, I disabled cache creation in such case. It will always though such warning but should each time create "right" kernels. But look slike last statement not the case. But not because of binary cache at last. App was compiled with SDK 2.5 so you need corresponding driver to run w/o issues. We can try SDK 2.4-compiled one insttead as debugging effort, but it will take some time to get. ____________ News about SETI opt app releases: https://twitter.com/Raistmer | |
| ID: 1340129 · | |
|
Ah, that makes sense. The 'patching' warning, that is, as both hosts where the .bin cache is not present also exhibit the patching warning. That's fine, as long as I know there's no major problem, I'm satisfied, thanks for the answer. APP SDK 2.5 support is supposed to start with Catalyst 11.7, so both hosts should be okay at Catalyst 11.12, anyway. | |
| ID: 1340149 · | |
Somewhat related, the invalid results you were observing on my HD 4670 earlier, I have a sneaking suspicion - though no valid proof yet - that the invalid results may be caused by CPU starvation. It's a very old Pentium 4 CPU on this host, so I don't have any CPU tasks running on it, but the AstroPulse NV tasks do cause a lot of CPU usage. I'm aware of the high-CPU usage issue in recent GeForce drivers... what's the most recent set of drivers that did not exhibit this high-CPU issue? After waiting so long for AMD to fix Catalyst's high-CPU usage problem, I'm disappointed Nvidia still hasn't fixed theirs... IIRC around 266.58 or so (maybe earlier, someone else can correct me) was the last nv OpenCL 1.0 driver after when they went to multithreaded drivers. These older ones use the blocking style synchronisation semantics required for that kindof coding to use low amounts of CPU. All nVidia OpenCL 1.1 drivers will be expecting the developer to implement GPU callback functionality ( ala. OpenGL/DirectX ) for low CPU usage, as blocking synchronisation is no longer used with multithreaded drivers at low levels (a form of 'deadlock prevention', moving to 'non-blocking' APIs ). ____________ "It is not the strongest of the species that survives, nor the most intelligent that survives. It is the one that is the most adaptable to change." Charles Darwin | |
| ID: 1340152 · | |
|
Thanks for that. I haven't done any OpenCL development before, but I think I understand what's going on there. | |
| ID: 1340167 · | |
...After looking around, it seems OpenCL 1.1 was introduced with GeForce 280.26, with the release notes for GeForce 275.33 indicating OpenCL support is at 1.0. You're saying the CPU issue relates to the multi-threading drivers introduced with OpenCL 1.1 or did the switch to multi-threading happen before that?It's a while back now, but i recall a bit of overlap due to the usual transition bugs etc. Also, you say that OpenCL 1.1 is dependent on the developer to implement GPU call-back - is this the case for the current AstroPulse NV application?Not sure which direction Raistmer went in after OpenCL 1.0, but yes. I'm not aware if AMDs OpenCL 1.2 implementation works around this with Cuda like syncs at a higher level, or a change/ammendment to the opencl specification (or both*). [*Including callback functionailty derived from preexisting OpenGL, DirectX and Windows Core Audio technologies] ____________ "It is not the strongest of the species that survives, nor the most intelligent that survives. It is the one that is the most adaptable to change." Charles Darwin | |
| ID: 1340169 · | |
|
Thanks for the information. I thought of one more point: I notice that running AP and MB simultaneously on the same NV GPU results in far lower overall CPU usage (on the GPU tasks) than running multiple AP WUs on a single NV card only. I notice this pattern across all hosts, where those hosts also run CPU-only tasks, on recent GeForce drivers. | |
| ID: 1340176 · | |
Does this match your experience of running tasks on NV GPUs? Technically no, because I don't run GPU AP. x41zc is pretty low CPU usage on its own here. ____________ "It is not the strongest of the species that survives, nor the most intelligent that survives. It is the one that is the most adaptable to change." Charles Darwin | |
| ID: 1340185 · | |
Thanks for the information. I thought of one more point: I notice that running AP and MB simultaneously on the same NV GPU results in far lower overall CPU usage (on the GPU tasks) than running multiple AP WUs on a single NV card only. I notice this pattern across all hosts, where those hosts also run CPU-only tasks, on recent GeForce drivers. It can be explained if postulate that OpenCL sync uses busy-wait loop while CUDA uses interrupt-based signalling (or smth more modern signalling that I'm not aware of but not busy-wait loop). Then, if GPU busy with both OpenCL AP and CUDA MB kernels there is time for AP kernel to complete before driver thread starts to poll CPU constantly. Hence, lower CPU usage overall. Also, I see BIG drop in CPU usage for the same reason between idle and busy CPU (studied in details on APU, observed on Intel "APU" (Ivy Bridge) and on "CPU consuming" NV drivers on GTX 260). If CPU busy with some work it will take longer time to switch to polling driver thread. Hence, GPU may report completion on first iteration saving CPU cycles for useful work. Unfortunately, in this case CPU can easely become TOO busy to feed GPU unit with new work so, while CPU consumption will be quite slow, GPU load and overall host performance will drop considerably. Hence, freeing core for GPU feeding is some sort of compromise. ____________ News about SETI opt app releases: https://twitter.com/Raistmer | |
| ID: 1340257 · | |
Thanks for the information. I thought of one more point: I notice that running AP and MB simultaneously on the same NV GPU results in far lower overall CPU usage (on the GPU tasks) than running multiple AP WUs on a single NV card only. I notice this pattern across all hosts, where those hosts also run CPU-only tasks, on recent GeForce drivers. Which takes us back to a question which I think has been raised before. Is it possible, or is it not possible, for a developer to code for "GPU callback functionality ( ala. OpenGL/DirectX )" under the OpenCL 1.1 platform on NVidia hardware, using open-source GPL-compliant development environments? If so, could somebody put together a recommended toolset and programming guide for study? | |
| ID: 1340269 · | |
If so, could somebody put together a recommended toolset and programming guide for study? Like the nVidia OpenCL Ocean Demo (from the GPU compute SDK) ? ____________ "It is not the strongest of the species that survives, nor the most intelligent that survives. It is the one that is the most adaptable to change." Charles Darwin | |
| ID: 1340271 · | |
Message boards : Number crunching : OpenCL apps are available for download on Lunatics
| Copyright © 2013 University of California |