Message boards :
Number crunching :
Lunatics Windows Installer v0.41 Release Notes
Message board moderation
Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · 11 · 12 . . . 17 · Next
Author | Message |
---|---|
Sutaru Tsureku Send message Joined: 6 Apr 07 Posts: 7105 Credit: 147,663,825 RAC: 5 |
OK, I will continue to let run the APv6 r1797 CPU app. At which file size of the related .wisdom file I could see the app found the best/fastest settings? -> r1797 CPU app reached max performance. Currently 3 kb. [EDIT#2: To now, AFAIK, two half (finished) and two complete WUs calculated with r1797 CPU app.] Thanks. [EDIT: BTW, no problems so far on my system. All is running smoothly.] * Best regards! :-) * Philip J. Fry, team seti.international founder. * Optimize your PC for higher RAC. * SETI@home needs your help. * |
JarrettH Send message Joined: 14 Nov 02 Posts: 97 Credit: 25,385,250 RAC: 95 |
No, we still leave the default to crunch 1 WU at a time, download this file, unzip it and place it into the C:\ProgramData\BOINC\projects\setiathome.berkeley.edu folder, then go to the advanced menu in boinc and select read config files. Juat wanted to confirm, but previously I was running 4 units at a time on my i3 2100 (2 cores with hyperthreading). With v7, which I have noticed takes an extra 45~ min to complete, should I limit to processing 2 at a time instead? I didn't catch whether your discussion was about GPU or CPU |
Darth Beaver Send message Joined: 20 Aug 99 Posts: 6728 Credit: 21,443,075 RAC: 3 |
I have question not shore if you have already been asked but anyway Thanks Raistmer and thanks again to the team for the work you have done |
Gatekeeper Send message Joined: 14 Jul 04 Posts: 887 Credit: 176,479,616 RAC: 0 |
No, we still leave the default to crunch 1 WU at a time, download this file, unzip it and place it into the C:\ProgramData\BOINC\projects\setiathome.berkeley.edu folder, then go to the advanced menu in boinc and select read config files. They're talking about GPU's. As long as your temps are under control, you can run 4 CPU units at a time (2XHT) |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14650 Credit: 200,643,578 RAC: 874 |
AKV8B_SSE3_AMD You don't actually need an SSE3 app - the SSE2 app runs at the same speed on AMD hardware (we tested one, and decided to take it out to save space - the download has doubled in size since last year as it is). Joe Segur has posted some technical details of the difference between SSE2 and SSE3 somewhere, and an explanation of why they don't help SETI - I'll try and find a link. Edit - it wasn't Joe, it was Jason, a few pages down this very thread. Message 1376357 |
Furlozza Send message Joined: 11 Sep 08 Posts: 14 Credit: 2,543,927 RAC: 0 |
Thanks to all at Lunatics involved in one way or another in THE installer, plus for throwing in the link to the app_config.xml file. G4-PC is now chewing through wus like they were going out of fashion (compared to standard apps *grin*) and with the 2 GPU wus at a time, winter will vanish... (hopefully) From Australia, thanks folks. |
shizaru Send message Joined: 14 Jun 04 Posts: 1130 Credit: 1,967,904 RAC: 0 |
A big THANX to all the Seti gurus!:) |
QSilver Send message Joined: 26 May 99 Posts: 232 Credit: 6,452,764 RAC: 0 |
I've been running the Lunatics apps for a while and, when I ran out of WUs over the weekend, I just waited it out until v0.41 arrived on Monday. The new installation went fine and it's been crunching well since then. A big Thank You to Lunatics and everyone involved. One quick question: Can the NV 4200M on this laptop handle 2 units at a time? |
Link Send message Joined: 18 Sep 03 Posts: 834 Credit: 1,807,369 RAC: 0 |
While trying to install v7 on this computer one question came to my mind... no more SSE optimized apps? |
Mike Send message Joined: 17 Feb 01 Posts: 34258 Credit: 79,922,639 RAC: 80 |
While trying to install v7 on this computer one question came to my mind... no more SSE optimized apps? No, i`m afraid. SSE only has no benefit vs stock. With each crime and every kindness we birth our future. |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14650 Credit: 200,643,578 RAC: 874 |
While trying to install v7 on this computer one question came to my mind... no more SSE optimized apps? Largely because the stock app itself has received some extra optimisation this time round. |
Link Send message Joined: 18 Sep 03 Posts: 834 Credit: 1,807,369 RAC: 0 |
OK, than I guess it's time to upgrade this machine with the SSE2 capable hardware I have laying around since a year or two... |
bj Send message Joined: 11 Oct 00 Posts: 163 Credit: 50,429,507 RAC: 0 |
Quick question: Installed the Lunatics 0.41 on all my computers. On the Arkayn site; are they optimized app's or the same as the 0.41 installer but manual installation.? bj |
Josef W. Segur Send message Joined: 30 Oct 99 Posts: 4504 Credit: 1,414,761 RAC: 0 |
... Actually, within 2 minutes after starting the first r1797 Astropulse task the contents of the .wisdom file were written and haven't really changed since. Each subsequent run will have loaded the file, seen that it contained info suitable for all the plans needed, and rewritten the file. That rewrite may change the order the codelets are listed, which makes a naive comparison think the files are different. The time saved by using the persistent wisdom rather than doing full planning for each new task is a constant on the order of a minute or so for modern CPUs. There's always some uncertainty whether the best/fastest plans have been found and the associated wisdom saved, partly because the best plan at any moment depends on whatever else is active in the multitasking environment. FFTW does quick tests of possible plans, starting with those most likely to perform well and then others which could be good too. But if the testing of the plan which would be best long-term happens to be slowed by a system background task or another application then the saved wisdom isn't fully optimal. Fortunately the second-best plan is likely to be so close in performance that there's no significant loss in speed. Nevertheless, for someone willing to put in extra effort to achieve best Astropulse CPU performance I'll suggest this procedure: 1. Select all tasks in BOINC Manager (or other Manager) and suspend them. 2. Shut BOINC down. 3. Rename the r1797_[stripped CPU name].wisdom file to preserve it but force creation of a new one. 4. Restart BOINC. 5. Unsuspend one task for AP6_win_x86_SSE_CPU_r1797.exe 6. Wait until it has created the new wisdom file, then unsuspend all the other tasks. If your preferences are set to NOT "Leave tasks in memory while suspended" steps 2 and 4 are not needed. That's a global computing preference where the web setting may be overridden by the local preferences, so using those steps is recommended unless you're very sure they aren't needed. The purpose is to do the planning with least interference from other crunching, and it would make sense to shut down all other applications too during the procedure. Even this doesn't guarantee the best wisdom, but in testing I've done it gives the best chance of getting optimal wisdom. Testing whether that has improved the wisdom is a matter of running enough tasks to get a decent comparison, paying attention to the amount of blanking, etc. Joe" |
Rockhount Send message Joined: 29 May 00 Posts: 34 Credit: 31,935,954 RAC: 29 |
I've a question about the available CUDA Versions. On one of my machine I could not change the nvidia driver version. There ist only the 257.59 available with CUDA 3.1 on a Quadro 4000 (Fermi). The new v7 support CUDA > 3.2 or 2.3 but with the warning not to try this on Kepler or Fermi cards. Is there any way or chance to use this card for v7 with cuda 3.1 and Fermi? The CPU (Xeon L5640) works most on einstein and the gpu should do the seti stuff, this worked fine till v7 came up. greetz from germany Regards from nothern Germany Roman SETI@home classic workunits 207,059 SETI@home classic CPU time 1,251,095 hours |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14650 Credit: 200,643,578 RAC: 874 |
I've a question about the available CUDA Versions. I'm surprised you say 'only 257.59' available for Quadro 4000. I found every recognised WHQL driver version listed, up to and including v320.00: http://www.nvidia.de/object/quadro-tesla-grid-win8-win7-winvista-64bit-320.00-whql-driver-de.html Your only listed NVidia host (6428818) describes the card as 'NVIDIA D12P1-30', and the driver as v267.24 already. |
BilBg Send message Joined: 27 May 07 Posts: 3720 Credit: 9,385,827 RAC: 0 |
Nevertheless, for someone willing to put in extra effort to achieve best Astropulse CPU performance I'll suggest this procedure: 1. For some apps this have to be done twice i.e.: - after a few minutes suspend the first running task - wait a few seconds and unsuspend another task of the same kind - wait a few minutes before "unsuspend all the other tasks" I tested with AKv8c_Bb_r1846_winx86_SSE2x.exe - the #1 task created the new wisdom file with size 14 KB, the #2 task increased the size to 29 KB (AP6_win_x86_SSE2_OpenCL_ATI_r1843.exe seems to (maybe) need only one test but it doesn't hurt to run a second task for a while before unsuspend all tasks) Each subsequent run will have loaded the file, seen that it contained info suitable for all the plans needed, and rewritten the file. That rewrite may change the order the codelets are listed, which makes a naive comparison think the files are different. I can confirm that #3 task do not change the values in the wisdom file (I made copy of 'wisdom' after each run/task). The file seems changed but if you sort it alphabetically you will see it is the same for #2 and #3 run tasks. (I used in CMD prompt: sort <r1846_AMDAthlontmIIX3455Processor.wisdom >>wisdomSorted) 2. I think that (Intel) SpeedStep and (AMD) Cool'n'Quiet / PowerNow! technologies can lead to unpredictable results (skew the tests) during this 'wisdom' files generation. So disable them (in the BIOS) before the procedure. Â - ALF - "Find out what you don't do well ..... then don't do it!" :) Â |
William Send message Joined: 14 Feb 13 Posts: 2037 Credit: 17,689,662 RAC: 0 |
Attention those failing to getting the Brook app to run. I've posted a revised aistub here. Please give it a try. We can't update the installer until we've consulted with Eric on this matter (there's a build in beta). A person who won't read has no advantage over one who can't read. (Mark Twain) |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14650 Credit: 200,643,578 RAC: 874 |
Nevertheless, for someone willing to put in extra effort to achieve best Astropulse CPU performance I'll suggest this procedure: It's Josef's CPU applications which do the double-wisdom jig. Raistmer's ATI applications do all they're going to do the first time. (Reasoning: the full wisdom generation for the CPU app takes some time, on what may be inherently a slow processor by modern/GPU standards. And until it completes, the task will display no progress in BOINC Manager. It is likely that the user will be around and anxious to check that it's working, immediately after installation. So we don't want a big startup delay the first time. But by the time the second task starts, it's likely that the user will have lost interest and wandered off to make a cup of tea. So the app can roll up its sleeves and really get to work on that wisdom, without disturbing anybody. At least that's how I read it.) |
Raistmer Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121 |
:) actually there is some tech reasoning too, not only sociological one :) first start will be most probably for all tasks. Later finish times will be scattered so delay in one task allows to avoid situation where all 4 or more CPu tasks on multicore host do same FFTW planning, i.e. quite artifical CPU load that will never meet after. Planning on even loaded CPU but loaded with usual work could be more adequate. SETI apps news We're not gonna fight them. We're gonna transcend them. |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.