Posts by petri33

101) Message boards : Number crunching : Setting up Linux to crunch CUDA90 and above for Windows users (Message 1950750)
Posted 19 Aug 2018 by Profile petri33
Post:
`GLIBCXX_3.4.21' is required. See stderr from your fauled tasks.
102) Message boards : Number crunching : Setting up Linux to crunch CUDA90 and above for Windows users (Message 1950700)
Posted 19 Aug 2018 by Profile petri33
Post:
Sounds right. Tell us how it went.
103) Message boards : Number crunching : all 14au18ac GPU tasks on one system running 3 seconds? (Message 1950669)
Posted 19 Aug 2018 by Profile petri33
Post:
Petri, how about posting your Current CUDA app_info.xml section so we can see what You are using? Similar to what I posted. I believe that's what Keith is looking for.


That was my app_info.xml and app_config.xml.
I have an ancient executable name there. I always copy my latest over the previous executable file.
I do not change my app_info unless I need to.

That is why I said this is an example app_info.xml. It works, but you have to copy any executable you use over the original cuda 65 bla bla executable.
104) Message boards : Number crunching : all 14au18ac GPU tasks on one system running 3 seconds? (Message 1950636)
Posted 19 Aug 2018 by Profile petri33
Post:
This is a Linux compilation for sm_50, sm_52, sm 61 and sm_70. It may run with GTX750. I do not have one to test with. It is statically linked and I guess it will run with suitably recent 39X.xx driver. No dev pack installation required. Try and tell.

It is a bit slower on 10x0 than the previous one because the 750 and 9x0 have less shared memory than 10x0.
multiversion: https://drive.google.com/open?id=1rKaagzCyisp-IYHZrX4hwpfEKPoiKI-h
10x0&Volta: https://drive.google.com/open?id=1pe6-p5zn27tXFvvszyGCzo0OfCkkCqDt

Link to source was posted earlier in this thread.
105) Message boards : Number crunching : all 14au18ac GPU tasks on one system running 3 seconds? (Message 1950625)
Posted 18 Aug 2018 by Profile petri33
Post:
in the app_info there are 'empty' parts
    <file_info>
      <name></name>
    </file_info>
    <file_info>
      <name></name>
    </file_info>

I do not know, but the may confuse boinc..
... and to the 7) the names may be old or something. Leftovers from trying to build a linux executable years ago. Definitely nothing new.

I hope you get your system to work.
Could someone else help, since it is bed time in this side of the world (2:30 AM).

Petri
106) Message boards : Number crunching : all 14au18ac GPU tasks on one system running 3 seconds? (Message 1950619)
Posted 18 Aug 2018 by Profile petri33
Post:
Hi Petri, then color me very CONFUSED. I downloaded the file from your link. It is named setiV0.97.linux_x64_10x0. So that is what I changed the appname to setiV0.97.linux_x64_10x0 in the file info and app version defines.

I look at your app_info example and see no sign of the 0.97 app name setiV0.97.linux_x64_10x0. I see instead setiathome_x41zc_x86_64-pc-linux-gnu_cuda65_v8.

So is the real name of the 0.97 app setiathome_x41zc_x86_64-pc-linux-gnu_cuda65_v8?

No wonder I dumped all the work if the name definition is not the name of the real executable name.

Also should I remove the libcudart.so.9.0 and libcufft.so.9.0 file defines for the zi3v app? Or just leave them in there and they will cause no harm. I expect they have to be removed. Also I see the -nobs still in there. I though someone said that is no longer effective for the static linked apps?

More clarification please.



1) You changed the names in the app_config.xml. Good. Check for any typos, spaces, ... just in case.
2) Did the unzip set the file executable bit on? You can do that many ways. I use: chmod ugo+x setiV0.97.linux_x64_10x0
3) When the executable file got deleted at boinc restart I suppose you copied/unzippped it back.
4) The executable can have any name. I copy mine over the years old setiathome_x41zc_x86_64-pc-linux-gnu_cuda65_v8 named file in the boinc/seti folder so I do not have to change app_info or anything. Just suspend. copy. resume. Most people, however, modify app_info.xml to match the executable.
5) You can delete the old lib files from boinc/seti folder. They can be left there as well.
6) -nobs speeds things up if you have spare cores. It makes the process (CPU) to wait in a spin loop for the GPU to respond. Without -nobs the process waits an event in 'sleeping' state and the CPU is free to run other jobs meanwhile.
107) Message boards : Number crunching : all 14au18ac GPU tasks on one system running 3 seconds? (Message 1950610)
Posted 18 Aug 2018 by Profile petri33
Post:
Here is an example of an app_info.xml file.
The <file info> part must state the name of the executable so it does not get deleted when boinc starts.
The same name is in the <app version> part <file name> xxx </file name>
Sometimes it is easiest to suspend boinc. copy a new executable over the old one and keep the old file name. And resume computing.

<app_info>
  <app>
    <name>astropulse_v7</name>
  </app>
  
  <file_info>
    <name>ap_7.01r2793_sse3_clGPU_x86_64</name>
    <executable/>
  </file_info>
  
 
  <app_version>
    <app_name>astropulse_v7</app_name>
    <version_num>708</version_num>
    <platform>linux_x86_64</platform>
    <plan_class>opencl_nvidia_100</plan_class> 
    <cmdline> -verb -st -nog -unroll 80 -ffa_block 2304 -ffa_block_fetch 1152 -oclFFT_plan 256 16 256 </cmdline> 
    <coproc> 
      <type>NVIDIA</type> 
      <count>0.33</count> 
    </coproc>
    <file_ref>
      <file_name>ap_7.01r2793_sse3_clGPU_x86_64</file_name>
      <main_program/>
    </file_ref>
  </app_version>

  
  <app>
    <name>setiathome_v8</name>
  </app>
  
  <file_info>
    <name>MBv8_8.22r3712_avx2_x86_64-pc-linux-gnu</name>
    <namex>MBv8_8.05r3345_avx_linux64</namex>
    <executable/>
  </file_info>
  
  <file_info>
    <name>setiathome_x41zc_x86_64-pc-linux-gnu_cuda65_v8</name>
    <executable/>
  </file_info>
  
  <app_version>
    <app_name>setiathome_v8</app_name>
    <version_num>800</version_num>
    <platform>x86_64-pc-linux-gnu</platform>
    <avg_ncpus>1.000000</avg_ncpus>
    <max_ncpus>1.000000</max_ncpus>
    <file_ref>
      <file_name>MBv8_8.22r3712_avx2_x86_64-pc-linux-gnu</file_name>
      <main_program/>
    </file_ref>
  </app_version>

  <app_version>
    <app_name>setiathome_v8</app_name>
    <version_num>801</version_num>
    <platform>x86_64-pc-linux-gnu</platform>
    <avg_ncpus>1.000000</avg_ncpus>
    <max_ncpus>1.000000</max_ncpus>
    <file_ref>
      <file_name>MBv8_8.22r3712_avx2_x86_64-pc-linux-gnu</file_name>
      <main_program/>
    </file_ref>
  </app_version>
  
  <app_version>
    <app_name>setiathome_v8</app_name>
    <version_num>804</version_num>
    <platform>x86_64-pc-linux-gnu</platform>
    <avg_ncpus>1.000000</avg_ncpus>
    <max_ncpus>1.000000</max_ncpus>
    <file_ref>
      <file_name>MBv8_8.22r3712_avx2_x86_64-pc-linux-gnu</file_name>
      <main_program/>
    </file_ref>
  </app_version>   

  <app_version>
    <app_name>setiathome_v8</app_name>
    <version_num>808</version_num>
    <platform>x86_64-pc-linux-gnu</platform>
    <avg_ncpus>0.1</avg_ncpus> 
    <max_ncpus>0.1</max_ncpus> 
    <plan_class>nvidia_gpu</plan_class> 
    <cmdline> -nobs -pfb 32 </cmdline> 
    <coproc> 
      <type>NVIDIA</type> 
      <count>1.00</count> 
    </coproc>
    <file_ref>
      <file_name>setiathome_x41zc_x86_64-pc-linux-gnu_cuda65_v8</file_name>
      <main_program/>
    </file_ref>
  </app_version>

  <app_version>
    <app_name>setiathome_v8</app_name>
    <version_num>809</version_num>
    <platform>x86_64-pc-linux-gnu</platform>
    <avg_ncpus>0.1</avg_ncpus> 
    <max_ncpus>0.1</max_ncpus> 
    <plan_class>opencl_nvidia_sah</plan_class> 
    <cmdline> -nobs -pfb 32  </cmdline> 
    <coproc> 
      <type>NVIDIA</type> 
      <count>1.00</count> 
    </coproc>
    <file_ref>
      <file_name>setiathome_x41zc_x86_64-pc-linux-gnu_cuda65_v8</file_name>
      <main_program/>
    </file_ref>
  </app_version>
  
</app_info>



An here is an example of a app_config.xml
<app_config>
  <xproject_max_concurrent>10</xproject_max_concurrent>
  
  <app>
    <name>astropulse_v7</name>
    <max_concurrent>8</max_concurrent>
    <gpu_versions>
      <gpu_usage>0.50</gpu_usage>
      <cpu_usage>0.125</cpu_usage>
    </gpu_versions>
  </app>

  <app>
    <name>setiathome_v8</name>
    <xmax_concurrent>10</xmax_concurrent>
    <gpu_versions>
      <gpu_usage>1.0</gpu_usage>
      <cpu_usage>0.1</cpu_usage>
    </gpu_versions>
  </app>
  
  <app_version>
    <app_name>astropulse_v7</app_name>
    <plan_class>opencl_nvidia_100</plan_class>
    <cmdline> -verb -st -nog -unroll 80 -sbs 2048 -ffa_block 2304 -ffa_block_fetch 1152 -oclFFT_plan 256 16 256 </cmdline>
  </app_version>

  <app_version>
    <app_name>setiathome_v8</app_name>
    <plan_class>opencl_nvidia_sah</plan_class>
    <cmdline> -pfb 32 -nobs -pfl 64 </cmdline>
  </app_version>

  <app_version>
    <app_name>setiathome_v8</app_name>
    <plan_class>nvidia_gpu</plan_class>
    <cmdline> -pfb 32 -nobs -pfl 64 </cmdline>
  </app_version>
  
 </app_config>
108) Message boards : Number crunching : all 14au18ac GPU tasks on one system running 3 seconds? (Message 1950582)
Posted 18 Aug 2018 by Profile petri33
Post:
Try this on Linux and 10x0
executable https://drive.google.com/open?id=1pe6-p5zn27tXFvvszyGCzo0OfCkkCqDt
source https://drive.google.com/open?id=17Djj2E8Pxcd7k2WouYBskfPGrtxwjvFO
109) Message boards : Number crunching : all 14au18ac GPU tasks on one system running 3 seconds? (Message 1950458)
Posted 18 Aug 2018 by Profile petri33
Post:
It is a known problem. I'm working on it.
I did not have a WU in my test cases that would have revealed it.
Now I grabbed one.
I'll work on it tomorrow.
110) Message boards : Number crunching : Next Lunatics? (Message 1949088)
Posted 11 Aug 2018 by Profile petri33
Post:
Happy B-Day, Petri! :-)

Mine was June 30th, I'm 51 now... BUT, I'm in the "Forever 16 Club!" ;-)

1984 was a VERY good year. :-)


TL


And again. Thank you all! : TL04, Stargate, CliffH, betreger, big Z, Bill G, Stephen, ... and all.
111) Message boards : Number crunching : Next Lunatics? (Message 1949087)
Posted 11 Aug 2018 by Profile petri33
Post:
Happy B-Day Petri, you make me feel like an olde man at 73!

Well you are, I'll be 73 next month.

Not all that old, I am still crunching at 77.
Happy B-day Petri


I respect THAT! Thanks.
112) Message boards : Number crunching : Next Lunatics? (Message 1949085)
Posted 11 Aug 2018 by Profile petri33
Post:

I still measure time in years. Today is my 50th birthday.
Petri

Happy Birthday! All the best!!!


Thanks R.
113) Message boards : Number crunching : 27 Computation Errors? (Message 1949084)
Posted 11 Aug 2018 by Profile petri33
Post:
Is it possible that you are having a heat issue? That is something that will cause errors like this.
Hoping you are not OverClocking???


Heat issue. My guess too.
+1
114) Message boards : Number crunching : NVIDIA P0, P2 states and overclocking 1080, 1080Ti and VOLTA in Linux (Message 1949083)
Posted 11 Aug 2018 by Profile petri33
Post:
I see I could have used Petri's new utility just now. I just trashed all 300 gpu tasks on one system because the Compute Cache got corrupted. I was too aggressive on a memory overclock I guess and it trashed the work and froze the system probably on the transition for P2 to P3. Backed off my memory overclock to what was stable previously. Lesson learned.


Well,
That was the purpose to make and release the utility.
Now I take my friend Partagas (from Cuba) to the hot tub.

The night has settled and the atmosphere certainly feels like the moment to inhale some real Cuban night air. It is clowdy so you may have a better chance to appreciate the Perseids.

From the feelings of a half of a Century ahead. Good Night!

--
Petri

p,s, I got some really good stuff from TBar. I'm running them Wus right now. Those are Bug Revealing!! Like a birthday present should be!. Thanks.
115) Message boards : Number crunching : Next Lunatics? (Message 1948826)
Posted 10 Aug 2018 by Profile petri33
Post:
Thanks every one.

To the shortening of years I an explanation found have: you compare a year to your lived life. Each year is less and less of the total lived. My year is now 1/50. Next year is a bit less. It is 1/51 etc.
116) Message boards : Number crunching : Next Lunatics? (Message 1948736)
Posted 10 Aug 2018 by Profile petri33
Post:
Well! Happy Birthday Petri! Just wanted to say how much we crunchers appreciate our volunteer app developers. That goes for Raistmer and TBar too and all the supporting volunteers who test and suggest improvements.


Thank you Keith,
and all other too who feel like congratulating. Let's keep the thread clean.

Thanks.
117) Message boards : Number crunching : Next Lunatics? (Message 1948734)
Posted 10 Aug 2018 by Profile petri33
Post:
If anything is found that part is done sequentially on one SM to ensure correctness.

Petri

Hi, Petri

Same I use in OpenCL. If this allow to pass test cases collected so far I would say bugfix applied. Of course sequential scan slows things down but it's not a bug as long as valid results are provided.

Regarding signal collecting on GPU - pulses are hard die at least for me :) If I recall right, pulses and triplets are the ones that not "on GPU" in Signals on GPU (SoG) build.
So I wish you have more time too! Would be glad to port CUDA solution to OpenCL if any ;)
Perhaps the key is reordering on GPU too as you say. In SoG I just gather signals and final reordering is done on CPU. And that's not quite appropriate for Pulses. Maybe smth else too but at time I actively worked on SoG Pulses resisted %).


And I messed up my software earlier today so if my inconclusives is climbing up, I'll revert back to the old. I'll check that in the morning. Now at 4:10+ am I'm going to sleep.
118) Message boards : Number crunching : Next Lunatics? (Message 1948733)
Posted 10 Aug 2018 by Profile petri33
Post:
If anything is found that part is done sequentially on one SM to ensure correctness.

Petri

Hi, Petri

Same I use in OpenCL. If this allow to pass test cases collected so far I would say bugfix applied. Of course sequential scan slows things down but it's not a bug as long as valid results are provided.

Regarding signal collecting on GPU - pulses are hard die at least for me :) If I recall right, pulses and triplets are the ones that not "on GPU" in Signals on GPU (SoG) build.
So I wish you have more time too! Would be glad to port CUDA solution to OpenCL if any ;)
Perhaps the key is reordering on GPU too as you say. In SoG I just gather signals and final reordering is done on CPU. And that's not quite appropriate for Pulses. Maybe smth else too but at time I actively worked on SoG Pulses resisted %).


I still measure time in years. Today is my 50th birthday.
I have a long time to develop and enhance any code and project I find interesting.
Today it is Seti CUDA and the next day it will be ... well the future is Open.

Petri
119) Message boards : Number crunching : Next Lunatics? (Message 1948571)
Posted 9 Aug 2018 by Profile petri33
Post:
But based on comments alluded by Stephen and what I remember, our Windows app developer Raistmer said it would be almost impossible to use the exiting CUDA9 Linux app code source for any Windows app simply because the mechanisms the Linux app uses isn't available in the Windows environment.

Ups, hardly I said so :)
What I could say it's no Windows build until sync bug would be resolved (is it?).
CUDA is quite OS-independent AFAIK though I dealt only with older versions.


Hi Raistmer,
sync-bug as you call it is still there but with less impact. I scan the pulses with all SM units running. If anything is found that part is done sequentially on one SM to ensure correctness.

I wish I had more time too. If I did I'd implement signal collecting/reordering on GPU.

Petri
120) Message boards : Number crunching : NVIDIA P0, P2 states and overclocking 1080, 1080Ti and VOLTA in Linux (Message 1948280)
Posted 7 Aug 2018 by Profile petri33
Post:
Just a quick, little bit out in the weeds question, what is the issue running P0 in Linux on the latest 10x0 series cards? Is it a driver issue, an OS issue, or a software utility issue which won't allow it to be set? And, what is needed for a solution, Nvidia, Linus, or the utility developer to get on it? And the likelihood of it happening any time soon? Don't know much about it, other than I also thought I heard that P0 is always the best way to go, obviously if you can set it, that is.


P0 would be best but NVIDIA has decided that consumer grade cards (the high end of them) will not be allowed to run at P0 with compute loads. Gaming (graphic load) is allowed.
It is a driver limit imposed by NVIDIA.

EDIT (addition): And even now with the ability to hold P2 during computational pauses the P2 with higer clocks may still have some RAM wait states/lower bus rate/reduced other timigs or something that makes the P2 inferior to real P0.


Previous 20 · Next 20


 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.