|
121)
留言板 :
Number crunching :
Linux CUDA 'Special' App finally available, featuring Low CPU use
(消息 1948187)
发表于:6 Aug 2018 作者: petri33
Post: I was just thinking about the source code at https://setisvn.ssl.berkeley.edu/trac/browser/branches/sah_v7_opt/Xbranch/client/alpha Looking at your MAC in FreeDC stats seems like you got a 30% boost a week ago. Your MAC will be at #8 in no time with 135-140 000 RAC. |
|
122)
留言板 :
Number crunching :
NVIDIA P0, P2 states and overclocking 1080, 1080Ti and VOLTA in Linux
(消息 1948156)
发表于:6 Aug 2018 作者: petri33
Post: so memory overclock is more beneficial to the SETI work unit cruch times than gpu core overclock? The memory is the current limiter of performance. My recent software improvement (30-50%) in seti vlar calculations was achieved through reduced number of memory reads and writes. There is still a lot to do and gain. As of overcloking memory or graphics clocks all depends on how much temperature the card/cooling can handle and the benefit depends of the GPU generation and memory subsytem/architecture NVIDIA has chosen to implement. I can not overclock the processor during the summer but I can set the memory clocks at P2 to resemble the standard clocks at P0. Petri |
|
123)
留言板 :
Number crunching :
NVIDIA P0, P2 states and overclocking 1080, 1080Ti and VOLTA in Linux
(消息 1948154)
发表于:6 Aug 2018 作者: petri33
Post: Hello kittyman, Yes P0 would be more productive, but there is no utility for Linux to make the card go to P0 permanently. As soon as a compute job begins the cards go to P2. When the job ends the cards go to P0 and crash with high overclocks. So I had to make an utility to keep the cards at P2 so that the card never jumps to P0. Petri |
|
124)
留言板 :
Number crunching :
NVIDIA P0, P2 states and overclocking 1080, 1080Ti and VOLTA in Linux
(消息 1948076)
发表于:5 Aug 2018 作者: petri33
Post: Hi, It is hot here in Finland this summer. I have not set the gpu MHz any higher than the default so the offset is 0, I have raised the memory clock by 976 MHz on 1080 and 1000MHz on 1080Ti. The utility helps my GPUs to run stable. Running now ... 1080Ti: graphics 1835MHz and 11016 mem, 81 degrees Celsius, air cooled 1080 : graphics 1784MHz and 10002 mem, 82 degrees Celsius, air cooled I need to dust the cards on Tuesday. |
|
125)
留言板 :
Number crunching :
NVIDIA P0, P2 states and overclocking 1080, 1080Ti and VOLTA in Linux
(消息 1948053)
发表于:5 Aug 2018 作者: petri33
Post: The executable allows higher mem and grapics clocks since the GPU is kept from going to P0 where the overclock values that are neede for P2 would be too high. High end NV GPUs are forced to use P2 with CUDA compute loads. When switching WUs the GPU goes briefly to P0 and crashes without this new tool. In the readme is a sample of how to compile the source code to an executable. It can be modified to build an executable for other cuda versions too. The executable for linux is the file called keepP2. It can be saved and run on any directory path before overclocking. You may have to say once "chmod ugo+x keepP2" to make it executable. If you have one GPU just run ./keepP2 & on any terminal window. If you have two GPUs run ./keepP2 device=0 & ./keepP2 device=1 & etc. It may need cuda 92 but i doubt that since ldd says: linux-vdso.so.1 => (0x00007fff5038d000) librt.so.1 => /lib/x86_64-linux-gnu/librt.so.1 (0x00007f5e3d888000) libpthread.so.0 => /lib/x86_64-linux-gnu/libpthread.so.0 (0x00007f5e3d66a000) libdl.so.2 => /lib/x86_64-linux-gnu/libdl.so.2 (0x00007f5e3d465000) libstdc++.so.6 => /usr/lib/x86_64-linux-gnu/libstdc++.so.6 (0x00007f5e3d0e3000) libgcc_s.so.1 => /lib/x86_64-linux-gnu/libgcc_s.so.1 (0x00007f5e3cecc000) libc.so.6 => /lib/x86_64-linux-gnu/libc.so.6 (0x00007f5e3cb01000) /lib64/ld-linux-x86-64.so.2 (0x00005619ebe86000) libm.so.6 => /lib/x86_64-linux-gnu/libm.so.6 (0x00007f5e3c7f9000) |
|
126)
留言板 :
Number crunching :
NVIDIA P0, P2 states and overclocking 1080, 1080Ti and VOLTA in Linux
(消息 1948006)
发表于:5 Aug 2018 作者: petri33
Post: Hi, Here : https://drive.google.com/open?id=1Jl1tlN5V27odSgzPAzCotED4Z9yUmkwn is an utility to keep yopur card at P2 state. I have problems overclocking since overclocking 1080, 1080Ti and VOLTA affects both P0 and P2 states. These cards can not be locked to P0 in linux. When CUDA compute workload finishes the card jumps to P0 and crashes. Now with this program running there is always a compute work load in the background. The program runs a simple GPU kernel 10 times per second and keeps the driver thinking something is being calculated and thus keeps the card at P2. The performance hit is negligible. Now you can overclock. To run this program in the background start it with ./keepP2 device=N & and replace N with your gpu id. I run ./keepP2 device=0 & ./keepP2 device=1 & ./keepP2 device=2 & ./keepP2 device=3 & to make all four cards to stay at P2. -- petri33 |
|
127)
留言板 :
Number crunching :
Linux CUDA 'Special' App finally available, featuring Low CPU use
(消息 1947839)
发表于:4 Aug 2018 作者: petri33
Post: @Petri, can you explain simply what the new tuning parameter in the latest 0.96 Beta does. I am referring to: The pfl sets a FftLength based limit for which implementation of pulse find algorithm to use. Now that I have found out the default value works well, ill probably remove the flag. It was for testing purposes. The 0.95 and 0.96 got a 30% speed gain from reorganizing the 'folding' part of the pulse find process. The reorganization reduced memory writes and reads by 50%. 0.96 is a bit slower than 0.95 but gives more accurate results with noise bombs. The 0.96 is still not ready. There is possibly lurking something bad with Arecibo shorties and some NV cards. |
|
128)
留言板 :
Number crunching :
SETI/BOINC Milestones [ v2.0 ] - XXIX
(消息 1947617)
发表于:2 Aug 2018 作者: petri33
Post: Ayyeee! #15 at Seti top users and with just one computer. About to break 270 mil total with this one. RAC and Credit: 348,381.78 269,657,402 EDIT: Average processing rate (Seti v8) 3,575.33 GFLOPS No need to celebrate. This is just an update. |
|
129)
留言板 :
Number crunching :
Intel 16 core/32T discusssion
(消息 1947612)
发表于:2 Aug 2018 作者: petri33
Post: Hi, *I'm sorry to hear about the death of your loved one Computer* and all the needed sentences of being sorry to be accepted as a qualified member of your community. How about trying to catch with the Linux world with your tech support person? Move your 10x0 cards or four of them to a new MOBO with an adequate (800W+) PSU and reorganize all the old GPUs to the windows machines. Then grab a one of the TBar supplied InstallAtOnce or otherwise named packages and enjoy a threefold RAC for that computer? The Linux is not so hard. Just set it up and let run. *Grumpy as my next door swede to the west* Petri Now for a ()___)___________)))~~ |
|
130)
留言板 :
Number crunching :
New monster. I'm doing something wrong
(消息 1947610)
发表于:2 Aug 2018 作者: petri33
Post: Can you run a cpu benchmark? Does it produce values typical for a TR? Does a memory benchmark produce proper values for the rated speed? What numbers do you get if you run the BOINC benchmark again? I would first confirm the cpu is good before looking at what might be wrong with the cpu app. +1 |
|
131)
留言板 :
Number crunching :
To Hyperthread or not to Hyperthread, that is the question
(消息 1947609)
发表于:2 Aug 2018 作者: petri33
Post: I don't know if it is the difference in SMT (or HT) as Intel calls it or simply the difference in architecture between Ryzen and Intel Broadwell-E. But the Ryzens kick butt on the cpu tasks compared to the i7-6850K that runs 300 Mhz faster. I always heard that Intel had superior math performance. Sure not seeing it on my example. I was an affiliate of AMD CPUs some ten-twenty years ago. If I'm going to build a new system, I'll still consider the AMD alternative a viable option. My recent build needed some serious backup from the MOBO supporting 4xGPU with maximum PCIEx lanes. Things may change. -- |
|
132)
留言板 :
Number crunching :
To Hyperthread or not to Hyperthread, that is the question
(消息 1947602)
发表于:2 Aug 2018 作者: petri33
Post: Windows scheduler has affinity property that allows to pin process to particular logical CPU. So it's possible to group CPU processes on one real core while GPU processes allocate to anotehr real CPU core. Thank You Raistmer, and the Linux has a similar system too. Please do some find on site pages and with google too. If then still some problem we're glad to help you. P.R. #!/bin/bash for (( ; ; )) do schedtool -a 1,2,3,4 `pidof setiathome_x41zc_x86_64-pc-linux-gnu_cuda65_v8` schedtool -a 1,2,3,4 `pidof ap_7.01r2793_sse3_clGPU_x86_64` schedtool -a 1,2,3,4 `pidof axo` schedtool -a 6,7,8,9,10,11 `pidof MBv8_8.22r3712_avx2_x86_64-pc-linux-gnu` schedtool -a 5 `pidof compiz` sleep 2 done |
|
133)
留言板 :
Number crunching :
Linux CUDA 'Special' App finally available, featuring Low CPU use
(消息 1947319)
发表于:1 Aug 2018 作者: petri33
Post: Hi, the 0.96 is still not finished. Like TBar said there are problems with Arecibo shorties. When we get them ironed out we will come up with V1.00 . I've got two weeks summer vacation left and I hope I have time to fix the app. Petri |
|
134)
留言板 :
Number crunching :
What is most power efficient SETI compute platform, for scaling?
(消息 1947007)
发表于:30 Jul 2018 作者: petri33
Post: A single TITAN V does 105 tasks/hour. (blc guppi vlar is 34 seconds). It uses typically 134W and on occasion 160W. |
|
135)
留言板 :
Number crunching :
To Hyperthread or not to Hyperthread, that is the question
(消息 1946804)
发表于:29 Jul 2018 作者: petri33
Post: I have 6 real cores and 6 HT cores. I run 6 CPU tasks and 4 GPU tasks and leave 2 cores free. |
|
136)
留言板 :
Number crunching :
GPU FLOPS: Theory vs Reality
(消息 1946601)
发表于:27 Jul 2018 作者: petri33
Post: I updated the OS to level 14 at about 2014. Ever since I have updated (manually) the GPU drivers about once a year. I download a new development platform (CUDA) whenever it is released. Since 2014 that has happened about once a year. To babysit a machine? No - never done that. To babysit a highly overclocked development machine? Yes. I try to do that every day. p.s. On vacations when I'm away from my computer I call for my daughters to help me out if my computer has any problem. They understand reset button, BIOS and Linux boot-up, sudo, tsch, bash, !nnn, <tab> ... Both of my +18y daughters went to Helsinki (400+ km south from here) to study general language, automation, artificial intelligence and robotics and other studies based on their personal interests. My two other youngsters are still under 12y. It seems (I can hear) they do know how they are going to spend their near future... |
|
137)
留言板 :
Number crunching :
GPU FLOPS: Theory vs Reality
(消息 1946516)
发表于:27 Jul 2018 作者: petri33
Post: i'm considering trading up 2 of my systems from dual 1060's (3gb) to single 1080ti's If on Linux, then your 1080Ti will do guppi vlars in 48 seconds with latest software. http://setiathome.berkeley.edu/workunit.php?wuid=3069924624 |
|
138)
留言板 :
Number crunching :
I've Built a Couple OSX CUDA Apps...
(消息 1944700)
发表于:16 Jul 2018 作者: petri33
Post: I have it in the app_info, but, it's not showing up on any of the machines. Looking at the confsettings.cpp, what else needs changing besides the g_pfFftLimit; Your offline benchmark run needs it in comlineoptions.txt. Just the 512 to 64. I need to make the executable to print the value it is using. Can you please email me the Arecibo shortie that is overflowing and its correct result.sah file to compare against to so that I do not have to run it on CPU first. |
|
139)
留言板 :
Number crunching :
I've Built a Couple OSX CUDA Apps...
(消息 1944686)
发表于:16 Jul 2018 作者: petri33
Post: A few numbers that may be of Interest; Nice improvement! Could you try with a new flag: -pfl 64 It sets a splitting point to pulse finding which one of the internal functions to use. I find the -pfl 64 give faster run times compared to default (no flag set equals to -pfl 512). I'll change that in confsettings.cpp to 64 in the future. Petri |
|
140)
留言板 :
Number crunching :
The Early Death of a Hackintosh!
(消息 1944685)
发表于:16 Jul 2018 作者: petri33
Post: Hi, how about the thermal paste on your CPU drying and during the power outage cooling down and cracking? Petri |
©2020 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.