Message boards :
Number crunching :
Running SETI@home on an nVidia Fermi GPU
Message board moderation
Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · 10 . . . 15 · Next
Author | Message |
---|---|
Speedy ![]() Send message Joined: 26 Jun 04 Posts: 1643 Credit: 12,921,799 RAC: 89 ![]() ![]() |
Thanks Richard. According to reschedule 1.9 no tasks need to be moved according to log User testing for a reschedule CPU tasks: 0 (0 VLAR, 0 VHAR) GPU tasks: 0 (0 VLAR, 0 VHAR) No reschedule needed
![]() |
![]() ![]() Send message Joined: 20 Aug 02 Posts: 3377 Credit: 20,676,751 RAC: 0 ![]() |
This is the angle range for that work unit.. WU true angle range is : 0.012972. Task 1635808649 is definitely a VLAR. ![]() PROUD MEMBER OF Team Starfire World BOINC |
Speedy ![]() Send message Joined: 26 Jun 04 Posts: 1643 Credit: 12,921,799 RAC: 89 ![]() ![]() |
In that case I'm most surprised that ReSchedule 1.9 says that there are no VLars /VHar. Under settings I've set the paths as:
Boinc data path C:\ProgramData\BOINC I'm using win 7 ultimate. Tasks are taking round 1 hour 22 minutes. Not sure why but I can't move them to the cpu ![]() |
![]() Send message Joined: 19 Sep 00 Posts: 3184 Credit: 446,358 RAC: 0 ![]() |
Perhaps reschedule can't find them because they have the "wrong" plan_class (or whatever tag is appropriate). Gruß, Gundolf Computer sind nicht alles im Leben. (Kleiner Scherz) ![]() SETI@home classic workunits 3,758 SETI@home classic CPU time 66,520 hours |
TheFreshPrince a.k.a. BlueTooth76 ![]() Send message Joined: 4 Jun 99 Posts: 210 Credit: 10,315,944 RAC: 0 ![]() |
I got work for my Fermi today :) Now running 2 WU's on a GTX470 (607 @ 751 Mhz and 1.000volt). Didn't get the red "app_info" messages and I run the Lunatics apps :) It's actually an Asus ENGTX470 but I use the MSI Afterburner software for overclocking and fan regulation. It works perfectly with the Asus and its free for download on the MSI site :) |
TheFreshPrince a.k.a. BlueTooth76 ![]() Send message Joined: 4 Jun 99 Posts: 210 Credit: 10,315,944 RAC: 0 ![]() |
I got work for my Fermi today :) I think I should have been more clear ;) From what I read my app_info.xml is correct. I use the Lunatics apps for the CPU only, Fermi is using the "standard" app. <app_info> <app> <name>setiathome_enhanced</name> </app> <file_info> <name>AK_v8b_win_x64_SSSE3x.exe</name> <executable/> </file_info> <app_version> <app_name>setiathome_enhanced</app_name> <version_num>603</version_num> <platform>windows_intelx86</platform> <file_ref> <file_name>AK_v8b_win_x64_SSSE3x.exe</file_name> <main_program/> </file_ref> </app_version> <app_version> <app_name>setiathome_enhanced</app_name> <version_num>603</version_num> <platform>windows_x86_64</platform> <file_ref> <file_name>AK_v8b_win_x64_SSSE3x.exe</file_name> <main_program/> </file_ref> </app_version> <app> <name>setiathome_enhanced</name> </app> <file_info> <name>libfftw3f-3-1-1a_upx.dll</name> <executable/> </file_info> <file_info> <name>setiathome_6.10_windows_intelx86__cuda_fermi.exe</name> <executable/> </file_info> <file_info> <name>cudart32_30_14.dll</name> <executable/> </file_info> <file_info> <name>cufft32_30_14.dll</name> <executable/> </file_info> <app_version> <app_name>setiathome_enhanced</app_name> <version_num>610</version_num> <avg_ncpus>0.300000</avg_ncpus> <max_ncpus>0.300000</max_ncpus> <flops>57462450464</flops> <plan_class>cuda_fermi</plan_class> <file_ref> <file_name>setiathome_6.10_windows_intelx86__cuda_fermi.exe</file_name> <main_program/> </file_ref> <file_ref> <file_name>cudart32_30_14.dll</file_name> </file_ref> <file_ref> <file_name>cufft32_30_14.dll</file_name> </file_ref> <file_ref> <file_name>libfftw3f-3-1-1a_upx.dll</file_name> </file_ref> <coproc> <type>CUDA</type> <count>0.5</count> </coproc> </app_version> </app_info> |
TheFreshPrince a.k.a. BlueTooth76 ![]() Send message Joined: 4 Jun 99 Posts: 210 Credit: 10,315,944 RAC: 0 ![]() |
:P Sorry :P |
Speedy ![]() Send message Joined: 26 Jun 04 Posts: 1643 Credit: 12,921,799 RAC: 89 ![]() ![]() |
Perhaps reschedule can't find them because they have the "wrong" plan_class (or whatever tag is appropriate). How can I tell if this info has been set/ set correctly? ![]() |
![]() ![]() Send message Joined: 20 Aug 02 Posts: 3377 Credit: 20,676,751 RAC: 0 ![]() |
Speedy, the rescheduler won't work with the fermi. It only recognizes 6.08 and 6.09 it cannot do the 6.10 Fermi plan_class. ![]() PROUD MEMBER OF Team Starfire World BOINC |
Speedy ![]() Send message Joined: 26 Jun 04 Posts: 1643 Credit: 12,921,799 RAC: 89 ![]() ![]() |
Ok thanks. I'm running them on my gpu they are taking about 1 hour 24 minutes each. I have another 2 after this 1 completes it's 18% done So I'm correct in saying theres no way to send tasks from a GTX 470 Gpu to the Cpu? ![]() |
![]() ![]() ![]() ![]() ![]() Send message Joined: 3 Sep 04 Posts: 471 Credit: 230,506,401 RAC: 157 ![]() ![]() |
Speedy, the rescheduler won't work with the fermi. It only recognizes 6.08 and 6.09 it cannot do the 6.10 Fermi plan_class. My fermi machine had almost run out of tasks on GPU but had plenty of suitable CPU tasks so I did a bit of testing on this. The rescheuled tool seems to work OK with all tasks as long as you are rebranding from GPU to CPU. The problem occurs when you rebrand from CPU to GPU. The issue seems to be more about the plan_class entry rather than version number. 608 tasks were plan_class cuda 609 tasks are plan_class cuda_23 (although some people have left 609 as cuda) 610 tasks are plan_class_fermi Moving tasks from CPU to GPU, Reschedule has to add a plan_class entry for tasks as it does not exist for CPU tasks. After taking a full backup and stopping network access I ran reschedule to move tasks from CPU to GPU and examined the client_state.xml file. There are 3 relevant sections in the file - file_info, workunit and result. Result has a version number and plan_class, workunit has version number and file_info has neither. All workunit entries that had been changed to GPU had a version number of 610 - correct. All 'result' entries that had been changed to GPU had a plan class entry of >cuda< rather than >cuda_fermi<. I did a "case sensitive" [there are 2 other entries which contain >CUDA< which should not change] search and replace using Notepad from >cuda< to >cuda_fermi<, restarted BOINC and all worked OK - no lost workunits because of missing apps. I believe the same is true of 609 tasks also as >cuda_23< was adopted - so the same thing should apply but I havent actually tested this yet. Anyone continuing to use >cuda< rather than >cuda_23< or >cuda_fermi< "should" find that reschedule works OK. This does of course mean manual intervention in the use of Reschedule - not ideal but better than letting your GPU run dry. i.e. 1. Stop BOINC (and include the running applications) 2. Run Reschedule to move tasks from CPU to GPU 3. Manually edit client_state file to change plan_class where necessary 4. Restart BOINC. N.B. If you run Reschedule while BOINC is running, it will automatically restart BOINC before you have a chance to edit the file and all incorrect tasks will be dropped. I have only tested this on one machine so caution should be used if your are attempting this - especially with the shortage of new tasks at present (you dont want to lose any precious tasks!) and definitely take a backup before doing this so that all files can be restored to before Rescheduling until you are confident that all is working OK. John. GPU Users Group ![]() ![]() |
Speedy ![]() Send message Joined: 26 Jun 04 Posts: 1643 Credit: 12,921,799 RAC: 89 ![]() ![]() |
Can I please have a example of what could need to be changed client state file? ![]() |
Richard Haselgrove ![]() Send message Joined: 4 Jul 99 Posts: 14687 Credit: 200,643,578 RAC: 874 ![]() ![]() |
Speedy, the rescheduler won't work with the fermi. It only recognizes 6.08 and 6.09 it cannot do the 6.10 Fermi plan_class. If you would all just read a bit more of the thread - specifically, to my conversation with MadMaC on 16 June. Questor (just now) is absolutely right, and has confirmed what we found then: ReScheduler puts the correct <version_num> into the file, but the wrong <plan_class>. Manually changing every <plan_class> to <cuda fermi> obviously works, but the alternative is to change your app_info file so that BOINC knows how to handle the reschuled tasks. We got into a slight muddle with x64 applications last time, so here'e a slightly different suggestion. Open your app_info.xml file for editing. (Usual rules - plain text only, Notepad in ANSI mode is fine). Locate the <app_version> ... </app_version> containing your Fermi application. Duplicate the entire section (copy and paste), including the two bracketing tags <app_version> and </app_version>. In one copy, change the <plan_class> from cuda_fermi to cuda: leave the other alone. Repeat the above paragraph if you have any more app_version sections containing the Fermi application. Save your changes. And that's it. Rescheduler should work automatically again. |
Speedy ![]() Send message Joined: 26 Jun 04 Posts: 1643 Credit: 12,921,799 RAC: 89 ![]() ![]() |
Thanks Richard. I'm going to leave client state file alone as I'm scard I'm going to crash my tasks. Thanks all the same ![]() |
![]() ![]() ![]() ![]() Send message Joined: 20 Jun 99 Posts: 6659 Credit: 121,090,076 RAC: 0 ![]() |
Good morning! Based on careful reading of this thread, I tried to get my app_info file straightened out with the latest file names. I got 3 GPU units, but they errored out instantly, so I realize I need help. This is what I have so far. The CPU and AP portions work perfectly, as they are the result of the Lunatics installer, but clearly the Fermi portions are flawed somehow. <app_info> <app> <name>setiathome_enhanced</name> </app> <file_info> <name>AK_v8b_win_x64_SSSE3x.exe</name> <executable/> </file_info> <app_version> <app_name>setiathome_enhanced</app_name> <version_num>603</version_num> <platform>windows_intelx86</platform> <file_ref> <file_name>AK_v8b_win_x64_SSSE3x.exe</file_name> <main_program/> </file_ref> </app_version> <app_version> <app_name>setiathome_enhanced</app_name> <version_num>603</version_num> <platform>windows_x86_64</platform> <file_ref> <file_name>AK_v8b_win_x64_SSSE3x.exe</file_name> <main_program/> </file_ref> </app_version> <app> <name>astropulse_v505</name> </app> <file_info> <name>ap_5.05r409_SSE.exe</name> <executable/> </file_info> <app_version> <app_name>astropulse_v505</app_name> <version_num>505</version_num> <platform>windows_intelx86</platform> <file_ref> <file_name>ap_5.05r409_SSE.exe</file_name> <main_program/> </file_ref> </app_version> <app_version> <app_name>astropulse_v505</app_name> <version_num>505</version_num> <platform>windows_x86_64</platform> <file_ref> <file_name>ap_5.05r409_SSE.exe</file_name> <main_program/> </file_ref> </app_version> <app> <name>setiathome_enhanced</name> </app> <file_info> <name>setiathome_6.10_windows_intelx86__cuda_fermi.exe</name> <executable/> </file_info> <file_info> <name>cudart32_30_14.dll</name> <executable/> </file_info> <file_info> <name>cufft32_30_14.dll</name> <executable/> </file_info> <file_info> <name>libfftw3f-3-1-1a_upx.dll</name> <executable/> </file_info> <app_version> <app_name>setiathome_enhanced</app_name> <version_num>610</version_num> <avg_ncpus>0.100000</avg_ncpus> <max_ncpus>0.100000</max_ncpus> <platform>windows_intelx86_64</platform> <plan_class>cuda_fermi</plan_class> <file_ref> <file_name>setiathome_6.10_windows_intelx86__cuda_fermi.exe</file_name> <main_program/> </file_ref> <file_ref> <file_name>cudart32_30_14.dll</file_name> </file_ref> <file_ref> <file_name>cufft32_30_14.dll</file_name> </file_ref> <file_ref> <file_name>libfftw3f-3-1-1a_upx.dll</file_name> </file_ref> <coproc> <type>CUDA</type> <count>1</count> </coproc> </app_version> <app_version> <app_name>setiathome_enhanced</app_name> <version_num>610</version_num> <avg_ncpus>0.100000</avg_ncpus> <max_ncpus>0.100000</max_ncpus> <platform>windows_x86_64</platform> <plan_class>cuda_fermi</plan_class> <file_ref> <file_name>setiathome_6.10_windows_intelx86__cuda_fermi.exe</file_name> <main_program/> </file_ref> <file_ref> <file_name>cudart32_30_14.dll</file_name> </file_ref> <file_ref> <file_name>cufft32_30_14.dll</file_name> </file_ref> <file_ref> <file_name>libfftw3f-3-1-1a_upx.dll</file_name> </file_ref> <coproc> <type>CUDA</type> <count>1</count> </coproc> </app_version> <app_version> <app_name>setiathome_enhanced</app_name> <version_num>610</version_num> <avg_ncpus>0.100000</avg_ncpus> <max_ncpus>0.100000</max_ncpus> <platform>windows_intelx86_64</platform> <plan_class>cuda_fermi</plan_class> <file_ref> <file_name>setiathome_6.10_windows_intelx86__cuda_fermi.exe</file_name> <main_program/> </file_ref> <file_ref> <file_name>cudart32_30_14.dll</file_name> </file_ref> <file_ref> <file_name>cufft32_30_14.dll</file_name> </file_ref> <file_ref> <file_name>libfftw3f-3-1-1a_upx.dll</file_name> </file_ref> <coproc> <type>CUDA</type> <count>1</count> </coproc> </app_version> Thank you for any help. Steve Warning, addicted to SETI crunching! Crunching as a member of GPU Users Group. GPUUG Website |
Richard Haselgrove ![]() Send message Joined: 4 Jul 99 Posts: 14687 Credit: 200,643,578 RAC: 874 ![]() ![]() |
Good morning! Based on careful reading of this thread, I tried to get my app_info file straightened out with the latest file names. I got 3 GPU units, but they errored out instantly, so I realize I need help. This is what I have so far. The CPU and AP portions work perfectly, as they are the result of the Lunatics installer, but clearly the Fermi portions are flawed somehow. It would be better if you posted a representative sub-set of error messages, so we know what we're looking for. |
![]() ![]() ![]() ![]() Send message Joined: 20 Jun 99 Posts: 6659 Credit: 121,090,076 RAC: 0 ![]() |
Good morning! Based on careful reading of this thread, I tried to get my app_info file straightened out with the latest file names. I got 3 GPU units, but they errored out instantly, so I realize I need help. This is what I have so far. The CPU and AP portions work perfectly, as they are the result of the Lunatics installer, but clearly the Fermi portions are flawed somehow. At the time BOINC hadn't reported yet,and all BOINC said was computation error. Here is a link to one of the failed units. http://setiathome.berkeley.edu/result.php?resultid=1638383755 Thank you! I really feel bad causing even one error. Steve Warning, addicted to SETI crunching! Crunching as a member of GPU Users Group. GPUUG Website |
![]() ![]() ![]() ![]() ![]() Send message Joined: 3 Sep 04 Posts: 471 Credit: 230,506,401 RAC: 157 ![]() ![]() |
Speedy, the rescheduler won't work with the fermi. It only recognizes 6.08 and 6.09 it cannot do the 6.10 Fermi plan_class. Found your madMac conversation now. It's hard keeping up some times - I don't know how you do it! So the rebranded tasks just get processed with the extra >cuda< section of app_info and all original unbrandeded GPU tasks are processed with the original >cuda_fermi< section. GPU Users Group ![]() ![]() |
![]() ![]() ![]() ![]() ![]() Send message Joined: 3 Sep 04 Posts: 471 Credit: 230,506,401 RAC: 157 ![]() ![]() |
Looking back at this 'Speedy' post, are you still getting 0 tasks showing up when you run the Reschedule tool? If you actually have CPU/GPU tasks but it shows a 0 count you may be suffering from a problem I had where a slight (very difficult to spot) corruption in the client_state.xml causes reschedule to show 0 tasks even though BOINC works perfectly OK. GPU Users Group ![]() ![]() |
Richard Haselgrove ![]() Send message Joined: 4 Jul 99 Posts: 14687 Credit: 200,643,578 RAC: 874 ![]() ![]() |
Good morning! Based on careful reading of this thread, I tried to get my app_info file straightened out with the latest file names. I got 3 GPU units, but they errored out instantly, so I realize I need help. This is what I have so far. The CPU and AP portions work perfectly, as they are the result of the Lunatics installer, but clearly the Fermi portions are flawed somehow. IIRC, "Exit status -185 (0xffffffffffffff47)" may refer to not having the correct DLL files either linked via app_info, or present in the project directory. But I'm 100 miles away from the nearest CUDA card this weekend, so it's hard to check. Or: Anyone else reading Steve's app_info as having three identical Fermi sections, all with <version_num>610</version_num> <platform>windows_intelx86_64</platform> <plan_class>cuda_fermi</plan_class> Read back over my conversations with MadMaC, but I think I'd try that with at least one each of: <platform> windows_intelx86 <plan_class> cuda_fermi and <platform> windows_intelx86 <plan_class> cuda (in the original format, of course: I've just shown it like that to emphasise the changes) The DLL references look OK - just check the files themselves are still there..... |
©2025 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.