Message boards :
Number crunching :
High performance Linux clients at SETI
Message board moderation
Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 . . . 20 · Next
Author | Message |
---|---|
juan BFP Send message Joined: 16 Mar 07 Posts: 9786 Credit: 572,710,851 RAC: 3,799 |
Thanks TBar. We appreciate the work you put into developing the All-in-One package. +3 |
tazzduke Send message Joined: 15 Sep 07 Posts: 190 Credit: 28,269,068 RAC: 5 |
+4 |
Dr.Diesel Send message Joined: 14 May 99 Posts: 41 Credit: 123,695,755 RAC: 139 |
Many thanks TBar. |
Joe Januzzi Send message Joined: 13 Apr 03 Posts: 54 Credit: 307,134,110 RAC: 492 |
|
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14653 Credit: 200,643,578 RAC: 874 |
So, if you put a condition round the scroll command, likePleased to report that this patch has been accepted by the scrutineers and will be incorporated in the next client/manager release - probably 7.16, whenever that happens: there is still one showstopper bug which hasn't been fixed, and other items are still under discussion. |
TBar Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768 |
The New All-In-One is in place, at the same place, http://www.arkayn.us/lunatics/BOINC.7zA New version has been uploaded to the same location. New in this version is an upgrade to 0.98b1 featuring numerous improvements to compatibility and speed. Most notable is much better handling of the Arecibo files producing fewer inconclusive results. All Users should update to this version to reduce the number of repeated Validation attempts. The default versions are BOINC 7.14.2 requiring at least Ubuntu 15.04 or higher, versions for 14.04 & 12.04 are included if needed. These BOINC Apps have the fix which Stops the jumping Tasks & Transfers Pages. The default CUDA App is CUDA 9.0 which will work with the Repository drivers from 14.04 through 18.10. The CUDA 10.1 App is included for those able to update to a CUDA 10.1 driver. ReadMe files are included. This version also features much lower VRAM requirements meaning your 2 GB GPUs can be used for limited other uses while crunching, use NVIDIA XServer Settings or nvidia-smi -l to view VRAM usage. |
-= Vyper =- Send message Joined: 5 Sep 99 Posts: 1652 Credit: 1,065,191,981 RAC: 2,537 |
|
W3Perl Send message Joined: 29 Apr 99 Posts: 251 Credit: 3,696,783,867 RAC: 12,606 |
Thanks Petri, TBar and everyone involved in this masterpiece ! :) |
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
Thanks Petri for such a nice application. Cheers. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
Still waiting for tasks finished with the new app to report so I can inspect the stderr.txt. From the readme it looks like the default unroll is again equal to the number of SM's on the card. But I saw interesting things with unroll =1 on my beta test. So to answer a question, I think I will run the new app through the benchmark again with the default unroll parameter against the new app with unroll=1 to see what if any improvements are gained. [Edit] I see now that I was wrong. The default unroll is one now with new app. So no need to test in the benchmark. It still would be interesting to compile a version with the old unroll method to get an app to benchmark against this new version since TBar had included the lovely source code for the applications. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
Ian&Steve C. Send message Joined: 28 Sep 99 Posts: 4267 Credit: 1,282,604,591 RAC: 6,640 |
yeah I saw the same. I loaded up the 418 drivers and new app on my 6-GPU system. but initial impressions were that it was ever so slightly slower than the beta version of v0.98 that petri had me testing (which used a normal unroll value equal to the SM/CU count). if there's any slowdown (vs 0.98 beta), it's probably due to features added to make it more compatible with older cards. that's not a bad thing, just might be a little slower on the newer cards if that's the case. I hope petri can chime in about something else though. Why the change from CUDA 10 to 10.1? was there any specific reason for this? From what i remember, performance was better with 410 drivers (vs 415), so i always stayed there. but having cuda 10.1 requires the 418+ drivers now, which may have the same slowdown in performance that 415 had. I'll do the same and run my suite of benchmarks to directly compare v0.98 beta (cuda10) to v0.98b1 (cuda10.1) with 418 drivers. Seti@Home classic workunits: 29,492 CPU time: 134,419 hours |
W3Perl Send message Joined: 29 Apr 99 Posts: 251 Credit: 3,696,783,867 RAC: 12,606 |
Unroll always set to 1 ? No way to change this value in the <cmdline> line ? GTX 1050 Ti have 4 Gb of RAM but sometimes you can fill them with Xorg/compiz (with two 2K monitor) and some apps like Mathematica. I used to change the unroll value on the fly according to the remaning free gpu ram. Does it mean the binary require a fixed amount of ram, whatever the number of sm is used ? If yes, it's a good news as I don't need anymore to monitor the free ram remaining ! Tell me please ! Thanks |
JohnDK Send message Joined: 28 May 00 Posts: 1222 Credit: 451,243,443 RAC: 1,127 |
From the readme 4) With this version -unroll autotune and Blocking Sync is set by default. Override Autotune with the cmdline -unroll N, Blocking Sync override is -nobs |
petri33 Send message Joined: 6 Jun 02 Posts: 1668 Credit: 623,086,772 RAC: 156 |
Hi, unroll is default 1. You can set it with -unroll 2 or whatever your RAM and number of SM is if you want to squeeze out a second or so. Using value 1 makes it possibly a bit slower but more compatible with the official CPU version (1 means sequential find stage.) The pulse find algorithm search stage was completely rewritten . It does not need any buffer for temporary values. The scan is fully unrolled to all SM units but does not require any memory to store data. Earlier the process read the values sequentially and wrote sums back and read the values again and did addition and wrote again and again,,,. That needed a lot memory for each SM participating. Indexes: 01(2)3456789 56(7)89 34 89 1 6 4 9 now it reads them in 0538164927 and makes pairwise sums and sums of sums ... in one go. If something is found when scanning the original routine kicks in and reports the pulse. [/u] To overcome Heisenbergs: "You can't always get what you want / but if you try sometimes you just might find / you get what you need." -- Rolling Stones |
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
Yes, I forgot to mention in my beta test summary that the memory usage is one third of the previous application. Form 4800MB down to 1684MB. I would be very interested in your benchmark run Ian. I don't have the 0.98b1CUDA10 app to compare against. The speed may be less on the new 0.98b1CUDA10.1 with unroll =1 . . . . but I think you will be able to process more work per day because of the decrease in idle time. You are no longer having to set up memory and then read from it. Also, very good point Petri points out, the search algorithm is now sequential matching the search algorithm of the reference cpu app. So we should no longer get invalids because of the differences in pulse counts on early and late overflows. Since the benchmark is so easy to use with gpu apps running for such short times, it will be easy to run through a battery of tests incrementing the unroll count argument. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
Ian&Steve C. Send message Joined: 28 Sep 99 Posts: 4267 Credit: 1,282,604,591 RAC: 6,640 |
thanks for the info Petri. can you comment on CUDA 10 vs 10.1? was it implemented just to be current? or was there another reason? Seti@Home classic workunits: 29,492 CPU time: 134,419 hours |
Ian&Steve C. Send message Joined: 28 Sep 99 Posts: 4267 Credit: 1,282,604,591 RAC: 6,640 |
Yes, I forgot to mention in my beta test summary that the memory usage is one third of the previous application. Form 4800MB down to 1684MB. here you go. TL;DR the beta was pretty consistently a little bit faster by a few seconds/%, but not enough for me to revert. i think the larger change in speed came from the 410->418 drivers. but I'm not planning to revert on this system. i'll just leave it. it's still faster than v0.97 either way, which is a good thing :) KWSN-Linux-MBbench v3.0 cache-keeping edition Running on NVIDIA_RIG1 at Tue 16 Apr 2019 06:28:32 PM UTC ---------------------------------------------------------------- Starting benchmark run... ---------------------------------------------------------------- Suspending BOINC Listing wu-file(s) in /testWUs : 09ap19aa.1283.476.16.43.64.wu 09ap19aa.1283.476.16.43.66.wu 09ap19aa.1283.476.16.43.70.wu 09ap19aa.1283.476.16.43.73.wu 09ap19aa.1283.476.16.43.75.wu 11ap19aa.2325.25961.8.35.149.vlar.wu 11oc06aa.28195.15614.11.38.24.vlar.wu 11oc06aa.29615.15614.13.40.195.vlar.wu 12au10ae.18476.18484.11.38.41.vlar.wu 14oc18ad.1344.25021.3.30.59.wu 14oc18ad.22649.481.12.39.170.wu blc01_2bit_guppi_58406_00572_HIP85973_0031.13793.818.22.45.29.vlar.wu blc01_2bit_guppi_58406_00572_HIP85973_0031.14467.818.21.44.242.vlar.wu blc05_2bit_guppi_58227_05505_HIP52675_0013.6536.409.21.44.88.vlar.wu blc05_2bit_guppi_58227_08999_HIP53034_0023.16861.409.22.45.251.vlar.wu blc22_2bit_guppi_58340_32429_HIP112870_0012.26098.818.19.28.48.vlar.wu blc22_2bit_guppi_58340_32771_HIP112187_0013.26088.818.20.29.103.vlar_0_r980328791_0.wu blc22_2bit_guppi_58340_32771_HIP112187_0013.26088.818.20.29.91.vlar.wu blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15278.0.22.45.169.vlar.wu blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15474.818.22.45.25.vlar.wu blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15488.818.21.44.14.vlar.wu blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15488.818.21.44.25.vlar.wu blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15488.818.21.44.27.vlar.wu blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15488.818.21.44.36.vlar.wu blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15503.818.22.45.15.vlar.wu blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15503.818.22.45.33.vlar.wu blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15826.818.21.44.15.vlar.wu blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15826.818.21.44.46.vlar.wu Listing executable(s) in /APPS : setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 Listing executable in /REF_APPS : setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ---------------------------------------------------------------- Current WU: 09ap19aa.1283.476.16.43.64.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 75.01 sec 19.99 sec 6.44 sec Elapsed Time: ....................... 75 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 74.22 sec 18.86 sec 6.36 sec Elapsed Time : ...................... 75 seconds Speed compared to default : ......... 100 % ----------------- Comparing results Result : Strongly similar, Q= 100.0% ---------------------------------------------------------------- Done with 09ap19aa.1283.476.16.43.64.wu ==================================================================== Current WU: 09ap19aa.1283.476.16.43.66.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 74.44 sec 17.91 sec 5.67 sec Elapsed Time: ....................... 74 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 73.62 sec 16.39 sec 5.33 sec Elapsed Time : ...................... 74 seconds Speed compared to default : ......... 100 % ----------------- Comparing results Result : Strongly similar, Q= 100.0% ---------------------------------------------------------------- Done with 09ap19aa.1283.476.16.43.66.wu ==================================================================== Current WU: 09ap19aa.1283.476.16.43.70.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 74.96 sec 16.65 sec 5.53 sec Elapsed Time: ....................... 75 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 73.99 sec 16.38 sec 5.25 sec Elapsed Time : ...................... 74 seconds Speed compared to default : ......... 101 % ----------------- Comparing results Result : Strongly similar, Q= 99.99% ---------------------------------------------------------------- Done with 09ap19aa.1283.476.16.43.70.wu ==================================================================== Current WU: 09ap19aa.1283.476.16.43.73.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 74.50 sec 20.91 sec 6.71 sec Elapsed Time: ....................... 75 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 74.21 sec 26.71 sec 8.43 sec Elapsed Time : ...................... 74 seconds Speed compared to default : ......... 101 % ----------------- Comparing results Result : Strongly similar, Q= 100.0% ---------------------------------------------------------------- Done with 09ap19aa.1283.476.16.43.73.wu ==================================================================== Current WU: 09ap19aa.1283.476.16.43.75.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 74.46 sec 25.24 sec 8.38 sec Elapsed Time: ....................... 75 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 73.69 sec 25.22 sec 8.18 sec Elapsed Time : ...................... 73 seconds Speed compared to default : ......... 102 % ----------------- Comparing results Result : Strongly similar, Q= 100.0% ---------------------------------------------------------------- Done with 09ap19aa.1283.476.16.43.75.wu ==================================================================== Current WU: 11ap19aa.2325.25961.8.35.149.vlar.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 113.73 sec 31.29 sec 12.01 sec Elapsed Time: ....................... 113 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 111.60 sec 30.04 sec 10.99 sec Elapsed Time : ...................... 112 seconds Speed compared to default : ......... 100 % ----------------- Comparing results Result : Strongly similar, Q= 100.0% ---------------------------------------------------------------- Done with 11ap19aa.2325.25961.8.35.149.vlar.wu ==================================================================== Current WU: 11oc06aa.28195.15614.11.38.24.vlar.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 91.69 sec 27.48 sec 9.55 sec Elapsed Time: ....................... 92 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 90.95 sec 27.30 sec 8.65 sec Elapsed Time : ...................... 91 seconds Speed compared to default : ......... 101 % ----------------- Comparing results Result : Strongly similar, Q= 100.0% ---------------------------------------------------------------- Done with 11oc06aa.28195.15614.11.38.24.vlar.wu ==================================================================== Current WU: 11oc06aa.29615.15614.13.40.195.vlar.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 93.67 sec 29.00 sec 9.63 sec Elapsed Time: ....................... 94 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 91.48 sec 28.68 sec 9.04 sec Elapsed Time : ...................... 91 seconds Speed compared to default : ......... 103 % ----------------- Comparing results Result : Strongly similar, Q= 100.0% ---------------------------------------------------------------- Done with 11oc06aa.29615.15614.13.40.195.vlar.wu ==================================================================== Current WU: 12au10ae.18476.18484.11.38.41.vlar.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 113.08 sec 31.56 sec 11.47 sec Elapsed Time: ....................... 113 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 111.48 sec 30.44 sec 10.88 sec Elapsed Time : ...................... 112 seconds Speed compared to default : ......... 100 % ----------------- Comparing results Result : Strongly similar, Q= 100.0% ---------------------------------------------------------------- Done with 12au10ae.18476.18484.11.38.41.vlar.wu ==================================================================== Current WU: 14oc18ad.1344.25021.3.30.59.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 39.58 sec 13.22 sec 4.35 sec Elapsed Time: ....................... 40 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 38.56 sec 13.96 sec 4.50 sec Elapsed Time : ...................... 38 seconds Speed compared to default : ......... 105 % ----------------- Comparing results Result : Strongly similar, Q= 100.0% ---------------------------------------------------------------- Done with 14oc18ad.1344.25021.3.30.59.wu ==================================================================== Current WU: 14oc18ad.22649.481.12.39.170.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 38.54 sec 13.75 sec 4.31 sec Elapsed Time: ....................... 39 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 37.40 sec 13.27 sec 4.53 sec Elapsed Time : ...................... 37 seconds Speed compared to default : ......... 105 % ----------------- Comparing results Result : Strongly similar, Q= 100.0% ---------------------------------------------------------------- Done with 14oc18ad.22649.481.12.39.170.wu ==================================================================== Current WU: blc01_2bit_guppi_58406_00572_HIP85973_0031.13793.818.22.45.29.vlar.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 61.40 sec 18.50 sec 7.07 sec Elapsed Time: ....................... 62 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 59.55 sec 17.13 sec 6.23 sec Elapsed Time : ...................... 60 seconds Speed compared to default : ......... 103 % ----------------- Comparing results Result : Strongly similar, Q= 100.0% ---------------------------------------------------------------- Done with blc01_2bit_guppi_58406_00572_HIP85973_0031.13793.818.22.45.29.vlar.wu ==================================================================== Current WU: blc01_2bit_guppi_58406_00572_HIP85973_0031.14467.818.21.44.242.vlar.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 61.14 sec 18.18 sec 6.87 sec Elapsed Time: ....................... 61 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 59.44 sec 17.83 sec 5.95 sec Elapsed Time : ...................... 59 seconds Speed compared to default : ......... 103 % ----------------- Comparing results Result : Strongly similar, Q= 100.0% ---------------------------------------------------------------- Done with blc01_2bit_guppi_58406_00572_HIP85973_0031.14467.818.21.44.242.vlar.wu ==================================================================== Current WU: blc05_2bit_guppi_58227_05505_HIP52675_0013.6536.409.21.44.88.vlar.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 64.95 sec 20.09 sec 7.82 sec Elapsed Time: ....................... 65 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 63.28 sec 18.79 sec 6.55 sec Elapsed Time : ...................... 64 seconds Speed compared to default : ......... 101 % ----------------- Comparing results Result : Strongly similar, Q= 100.0% ---------------------------------------------------------------- Done with blc05_2bit_guppi_58227_05505_HIP52675_0013.6536.409.21.44.88.vlar.wu ==================================================================== Current WU: blc05_2bit_guppi_58227_08999_HIP53034_0023.16861.409.22.45.251.vlar.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 64.46 sec 18.92 sec 7.78 sec Elapsed Time: ....................... 64 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 63.08 sec 17.52 sec 6.76 sec Elapsed Time : ...................... 63 seconds Speed compared to default : ......... 101 % ----------------- Comparing results Result : Strongly similar, Q= 100.0% ---------------------------------------------------------------- Done with blc05_2bit_guppi_58227_08999_HIP53034_0023.16861.409.22.45.251.vlar.wu ==================================================================== Current WU: blc22_2bit_guppi_58340_32429_HIP112870_0012.26098.818.19.28.48.vlar.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 94.98 sec 27.69 sec 10.28 sec Elapsed Time: ....................... 95 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 91.96 sec 24.74 sec 9.23 sec Elapsed Time : ...................... 92 seconds Speed compared to default : ......... 103 % ----------------- Comparing results Result : Strongly similar, Q= 100.0% ---------------------------------------------------------------- Done with blc22_2bit_guppi_58340_32429_HIP112870_0012.26098.818.19.28.48.vlar.wu ==================================================================== Current WU: blc22_2bit_guppi_58340_32771_HIP112187_0013.26088.818.20.29.103.vlar_0_r980328791_0.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 1.99 sec 0.00 sec 0.00 sec Elapsed Time: ....................... 2 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 0.19 sec 0.00 sec 0.00 sec setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 failed to run on your system. Continuing with next (if any)... ---------------------------------------------------------------- Done with blc22_2bit_guppi_58340_32771_HIP112187_0013.26088.818.20.29.103.vlar_0_r980328791_0.wu ==================================================================== Current WU: blc22_2bit_guppi_58340_32771_HIP112187_0013.26088.818.20.29.91.vlar.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 94.92 sec 27.31 sec 9.94 sec Elapsed Time: ....................... 95 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 92.64 sec 24.80 sec 9.00 sec Elapsed Time : ...................... 93 seconds Speed compared to default : ......... 102 % ----------------- Comparing results Result : Strongly similar, Q= 100.0% ---------------------------------------------------------------- Done with blc22_2bit_guppi_58340_32771_HIP112187_0013.26088.818.20.29.91.vlar.wu ==================================================================== Current WU: blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15278.0.22.45.169.vlar.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 83.65 sec 24.50 sec 9.40 sec Elapsed Time: ....................... 84 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 80.60 sec 21.98 sec 8.18 sec Elapsed Time : ...................... 80 seconds Speed compared to default : ......... 105 % ----------------- Comparing results Result : Strongly similar, Q= 99.99% ---------------------------------------------------------------- Done with blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15278.0.22.45.169.vlar.wu ==================================================================== Current WU: blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15474.818.22.45.25.vlar.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 83.29 sec 24.03 sec 8.77 sec Elapsed Time: ....................... 83 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 81.34 sec 21.97 sec 8.41 sec Elapsed Time : ...................... 81 seconds Speed compared to default : ......... 102 % ----------------- Comparing results Result : Strongly similar, Q= 100.0% ---------------------------------------------------------------- Done with blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15474.818.22.45.25.vlar.wu ==================================================================== Current WU: blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15488.818.21.44.14.vlar.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 85.09 sec 24.01 sec 9.44 sec Elapsed Time: ....................... 85 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 83.71 sec 22.22 sec 7.69 sec Elapsed Time : ...................... 83 seconds Speed compared to default : ......... 102 % ----------------- Comparing results Result : Strongly similar, Q= 100.0% ---------------------------------------------------------------- Done with blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15488.818.21.44.14.vlar.wu ==================================================================== Current WU: blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15488.818.21.44.25.vlar.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 83.44 sec 20.40 sec 8.35 sec Elapsed Time: ....................... 84 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 80.98 sec 14.84 sec 5.32 sec Elapsed Time : ...................... 81 seconds Speed compared to default : ......... 103 % ----------------- Comparing results Result : Strongly similar, Q= 100.0% ---------------------------------------------------------------- Done with blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15488.818.21.44.25.vlar.wu ==================================================================== Current WU: blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15488.818.21.44.27.vlar.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 82.44 sec 17.41 sec 7.04 sec Elapsed Time: ....................... 82 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 79.77 sec 15.01 sec 6.00 sec Elapsed Time : ...................... 79 seconds Speed compared to default : ......... 103 % ----------------- Comparing results Result : Strongly similar, Q= 100.0% ---------------------------------------------------------------- Done with blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15488.818.21.44.27.vlar.wu ==================================================================== Current WU: blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15488.818.21.44.36.vlar.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 83.21 sec 17.78 sec 7.37 sec Elapsed Time: ....................... 84 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 79.54 sec 15.40 sec 5.63 sec Elapsed Time : ...................... 79 seconds Speed compared to default : ......... 106 % ----------------- Comparing results Result : Strongly similar, Q= 99.99% ---------------------------------------------------------------- Done with blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15488.818.21.44.36.vlar.wu ==================================================================== Current WU: blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15503.818.22.45.15.vlar.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 81.31 sec 16.86 sec 6.59 sec Elapsed Time: ....................... 82 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 78.98 sec 15.29 sec 5.59 sec Elapsed Time : ...................... 79 seconds Speed compared to default : ......... 103 % ----------------- Comparing results Result : Strongly similar, Q= 100.0% ---------------------------------------------------------------- Done with blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15503.818.22.45.15.vlar.wu ==================================================================== Current WU: blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15503.818.22.45.33.vlar.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 81.54 sec 17.06 sec 6.81 sec Elapsed Time: ....................... 81 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 78.98 sec 16.36 sec 5.83 sec Elapsed Time : ...................... 79 seconds Speed compared to default : ......... 102 % ----------------- Comparing results Result : Strongly similar, Q= 100.0% ---------------------------------------------------------------- Done with blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15503.818.22.45.33.vlar.wu ==================================================================== Current WU: blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15826.818.21.44.15.vlar.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 83.64 sec 23.54 sec 8.82 sec Elapsed Time: ....................... 83 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 81.53 sec 22.28 sec 7.93 sec Elapsed Time : ...................... 82 seconds Speed compared to default : ......... 101 % ----------------- Comparing results Result : Strongly similar, Q= 100.0% ---------------------------------------------------------------- Done with blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15826.818.21.44.15.vlar.wu ==================================================================== Current WU: blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15826.818.21.44.46.vlar.wu ---------------------------------------------------------------- Running default app with command :... setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 ./setiathome_x41p_V0.98b1_x86_64-pc-linux-gnu_cuda101 83.60 sec 20.90 sec 7.92 sec Elapsed Time: ....................... 84 seconds ---------------------------------------------------------------- Running app with command : .......... setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 ./setiathome_x41p_v0.98b1_beta_x86_64-pc-linux-gnu_cuda100 81.56 sec 15.61 sec 5.88 sec Elapsed Time : ...................... 81 seconds Speed compared to default : ......... 103 % ----------------- Comparing results Result : Strongly similar, Q= 100.0% ---------------------------------------------------------------- Done with blc32_2bit_guppi_58405_78644_PSR_J2022+5154_0007.15826.818.21.44.46.vlar.wu ==================================================================== Hosts CPU data ... model name : Intel(R) Core(TM) i7-7700K CPU @ 4.20GHz cpu cores : 4 cpu MHz : 4200.265 cache size : 8192 KB flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq dtes64 monitor ds_cpl vmx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault invpcid_single pti ssbd ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm mpx rdseed adx smap clflushopt intel_pt xsaveopt xsavec xgetbv1 xsaves dtherm ida arat pln pts hwp hwp_notify hwp_act_window hwp_epp flush_l1d Done with Benchmark run! Removing temporary files! Resuming BOINC Seti@Home classic workunits: 29,492 CPU time: 134,419 hours |
petri33 Send message Joined: 6 Jun 02 Posts: 1668 Credit: 623,086,772 RAC: 156 |
thanks for the info Petri. Just to be current and make sure the code compiles in the future too. To overcome Heisenbergs: "You can't always get what you want / but if you try sometimes you just might find / you get what you need." -- Rolling Stones |
Ian&Steve C. Send message Joined: 28 Sep 99 Posts: 4267 Credit: 1,282,604,591 RAC: 6,640 |
ok, makes sense. thanks :) Seti@Home classic workunits: 29,492 CPU time: 134,419 hours |
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
Thanks for the benchmark results with the two apps Ian. Question . . . .were you running with any parameters? Like -nobs and -pfb 32 for the CUDA10 app? It wasn't until I examined my first results of the beta app did I notice the -unroll 1 parameter and that my normal -pfb parameter was getting getting overwritten. I then edited out my -pfb 32 parameter since it was going to get overwritten anyway. I see that the older beta 0.98b1 app is consistently faster by 1 or 2 seconds. I'd like to hear from others running the new app how it performs on the lower grade cards like the 1060 3GB and and the 1050 cards. I don't know whether the lesser RAM requirement helps in speeding up the crunch time or not. Looks like I need to run through the hosts lists and look for those cards. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.