Posts by petri33

1) Message boards : Number crunching : SETI orphans (Message 2036035)
Posted 5 Mar 2020 by Profile petri33
Post:
I'll go to a project that has an open source code for Nvidia GPUS and an off-line test suite similar to Seti@home.

Hey! I'm sure your optimizations will be greatly welcomed!!

To follow my own interests: Go Einstein@Home? ;-)


Keep searchin',
Martin


Hi,
Einstein seems to have open source code and it uses 45-88% GPU on my system.
Seems like it could need some optimizing. But but,... if it is OpenCL then I'd have to turn it to CUDA first ...
2) Message boards : Number crunching : SETI orphans (Message 2035673)
Posted 4 Mar 2020 by Profile petri33
Post:
I'll go to a project that has an open source code for Nvidia GPUS and an off-line test suite similar to Seti@home.
3) Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (119) (Message 2034135)
Posted 27 Feb 2020 by Profile petri33
Post:
I'm still convinced that somehow, whether it be intent or just net result, the higher your RAC is the lower you are in the priority stack in terms of actually getting work during a recovery from outage. This is entirely too consistent to be the luck of the draw.


+1
4) Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (118) (Message 2029073)
Posted 24 Jan 2020 by Profile petri33
Post:
It could be my "Seti Toaster"

To win a SETI Toaster you need to crunch at least 1 Billion credits. LOL


. . I don't think my current toaster will last that long :(

Stephen

:)


I hope you have had time to collect them/those for a bad toaster day. It is a short resource.
5) Message boards : Number crunching : Don't know where it should go? Stick it here! (Message 2029072)
Posted 24 Jan 2020 by Profile petri33
Post:
Well the leaks are out about the NVIDIA RTX 3070/3080

Now to see how long before they are officially out on the web.

Cheers


I'd wait for the non RTX variation. 2180Ti or smthng.

The reason: RTX2080Ti is slower than Titan V by 5% and the power usage is 50% more.

The Titan V runs cooler and uses far less power on Seti workload. Why? My guess: It does not have RTX cores (that are not used on 20x0 but are still fed power to).

So If NVIDIA would release a consumer market version of 2080Ti without RTX cores its wattage would put it in favor of many popular cards when running Seti.

My other guess is that NVIDIA did deliberately not offer a SW based solution to shut down RTX cores with consumer market prized cards (RTX2080TI and Titan) so that they can sell "professionally targeted" 3x priced scientific oriented server farm cards (Quadro etc.) that use far less power when compared to 2080Ti.

--
6) Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (118) (Message 2024913)
Posted 25 Dec 2019 by Profile petri33
Post:
OK, I've applied Retvari's workround on my Linux Mint host running special sauce and the spoofed client. There's good news and bad news.

Good news (1): It's running
Good news (2): It's picking up the command line. -nobs isn't specifically acknowledged, but I've got an -unroll in there too, and that's reported.

Bad news: It's not received the instruction to run on device 0 / device 1

The device number is being passed correctly in init_data.xml, but the special sauce app isn't looking in the right place. I see Petri's app_info.xml doesn't contain an API version specifier, and it worked before, so I assume it's only listening for a command line. That's ancient, and should be corrected. Petri needs to compile against a newer BOINC API library and make the appropriate coding adjustments.

But at least my machine is running 2-up on device 0 - so at about half normal speed - and some warmth is creeping back into my workroom.


Hi,
anyone with compilation ability can make the special app to read the gpu number from init_data.xml.

in main.cpp you can define BOINC_MAJOR_VERSION to 8 just before the version check
...
    // Patch for Cuda device selection, Care of Juha Sointusalo.
    // Deals with boinc api 7.5 onwards breaking change (mid major version),
    // from standard use of command line to field not present on preferred earlier versions.
    // init_data.xml has e.g. <gpu_device_num>0</gpu_device_num>
#define BOINC_MAJOR_VERSION 8 
#if (BOINC_MAJOR_VERSION >= 8) || ((BOINC_MAJOR_VERSION == 7) && (BOINC_MINOR_VERSION >= 5))
    if (app_init_data.gpu_device_num >= 0) {
      gCUDADevPref = app_init_data.gpu_device_num + 1;
      fprintf(stderr, "app_init.xml specified GPU %d\r\n", app_init_data.gpu_device_num);
    }
#endif
...


I tested this and it says now on stderr
app_init.xml specified GPU 0
7) Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (118) (Message 2024761)
Posted 24 Dec 2019 by Profile petri33
Post:
Muchas gracias Juan.
That is exactly what I was looking for.
To save time for everyone who hasn't seen it, I will reprint Richard Hasselgrove's earlier post:

____________________________________________________________________________________
Posted: 23 Dec 2019, 16:55:49 UTC

For general information

I've had a provisional whisper back from the project. As of about an hour ago, the provisional plan is to go through normal maintenance and backup tomorrow (Tuesday), and then revert the database changes so we can go back to running the old server code.

My gloss on that is that it probably means a longer outage, and an even worse than usual recovery, but everyone should be able to enjoy the rest of the holiday in peace. Most especially, the staff can relax. Sounds like a good idea to me.
ID: 2024655 · Report as offensive \
Posted: 23 Dec 2019, 16:55:49 UTC

For general information

I've had a provisional whisper back from the project. As of about an hour ago, the provisional plan is to go through normal maintenance and backup tomorrow (Tuesday), and then revert the database changes so we can go back to running the old server code.

My gloss on that is that it probably means a longer outage, and an even worse than usual recovery, but everyone should be able to enjoy the rest of the holiday in peace. Most especially, the staff can relax. Sounds like a good idea to me.
____________________________________________________________________________________


p.s. Thank you, Richard!


My wishes with you!

p.s.
After another call to Santa I can verify with the the following:
I'm going to replace some GPUs and it seems so I can have time to tune them up too.

Nightey Nightey! [nati nati]
8) Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (118) (Message 2024531)
Posted 23 Dec 2019 by Profile petri33
Post:
Any oldtimers here remember that one Christmas season that seti was very down... was it a month?? two??
Was that the 1 where it all came crashing down in early December and didn't get fixed until late January, or something like that?

It might actually be a good plan for those of us running Anonymous to set No New Tasks until this gets figured out, in order to quit hammering the servers for those running stock who could actually get some work.
Personally, Einstein is running fine here until it's squared away and I just can't see breaking what works here to work around a far-end issue ... Just a thought.
. . Or you could do what I have done with the machines with no work, turn them off and save on power bills.
Stephen :)
The lack of heat being produced here is very helpful ATM with the very hot and smokey conditions we're experiencing, but I'd imagine that those in the other half of the world would prefer to have the heat on. ;-)

Cheers.


Instead of burning my GPUs with electricity I'll burn some wood in the fireplace and some rubber with spikes on the icy roads with my V60 D6.
Now I have time to sleep and replace my Seasonic 1250-X with a 1600W bla bla and throw out those two 1080's and one 1080Ti to make room for my presents.

Those old ones I'd hope go to East. Not south.
9) Message boards : Number crunching : 20th Anniversary T-shirts (Message 2024521)
Posted 23 Dec 2019 by Profile petri33
Post:
Hallo,
Ich habe den von fünf Jahre seit - die 2015.
Gern hätte Ich ein neues wenn das möglich isst.
Ich habe eine neue Addresse. Bitte PM.
--
Petri
10) Message boards : Number crunching : Seeing Linux GPU temperatures and getting alerts when things go south (Message 2024517)
Posted 23 Dec 2019 by Profile petri33
Post:
I agree with Z, the hybrids are perfect for crunching. With them you could easily keep the temps within a safe range even on hot & high humidity places like the one i live.

The only problem i have with the hybrids is with their pumps, i had 2 of them fail about 1 per year. They are hard to find here but when changed all return to work fine.


:)

Here is not humid nor warm. Still my GPUs go south. Or because of that.
11) Message boards : Number crunching : Seeing Linux GPU temperatures and getting alerts when things go south (Message 2024512)
Posted 23 Dec 2019 by Profile petri33
Post:
Hi,

whenever I develop a new improved version of the software I run into temperature problems. I'm running on air and flying there in the vincinity of upper limits of cooling.

Some times it just happens that even though my system has been running OK for some hours a temperature catastrophe hits when I'm away from my computer. One of my GPUs goes south and does not recover. It will either run slow or rapidly destroy my work queue.

I could take a look at your solution.

--
Petri
12) Message boards : Number crunching : New CUDA 10.2 Linux App Available (Message 2022278)
Posted 8 Dec 2019 by Profile petri33
Post:
The App is in the All-In-One package at the same location http://www.arkayn.us/lunatics/BOINC.7z
Make sure to read the README_x41p_V0.98.txt file in BOINC/projects/setiathome.berkeley.edu/docs.
The 10.2 App will need driver 440.33 or Higher, Ubuntu 15.04 or Higher, and Pascal or Higher GPU.
The 440.33 driver is available in the CUDA 10.2 ToolKit, you don't need to install the ToolKit, just the driver.
In My experience, the easiest way to install the nVidia Downloaded driver on a fresh Ubuntu install is to first install the highest Repository nVidia driver from Additional Drivers. Then download the 10.2 ToolKit from nVidia, place it in your Home folder, and make sure the Execute bit is set. Next reboot into Recovery Mode and enable Networking. Switch to Root mode, cd to your home folder, then remove the Repository driver by running, apt remove --purge nvidia* and then run, apt autoremove. Next install the Driver by running the installer and setting the options to just install the driver, ./cuda_10.2.89_440.33.01_linux.run
Depending on your HDD it may take a while to read and launch the installer, it's not long if using an SSD. Once the installer is finished, reboot.
It seems to work better the first time if you use the driver installer from the ToolKit, after that, running the downloaded standalone driver should work without any trouble. If the driver fails to work, simply boot back to Recovery mode and remove the downloaded driver by running, nvidia-uninstall
Have fun.


Yea! Just read this. Would have saved me the trouble of building it!!


Here it is. You can build the latest and port it to any platform.
https://drive.google.com/open?id=1ydIV4gv86_OERo6zF2qP7LNKVTMsN9BQ
13) Message boards : Number crunching : GPU FLOPS: Theory vs Reality (Message 2021981)
Posted 7 Dec 2019 by Profile petri33
Post:
That would guide the Santa to deliver the right present to the best behaving children and adults who do not like to obey M$ or other rules.
Or better yet port your efforts over to Windows to make more use of all that hardware.
Or maybe we should get a fundraiser together to hire a Nvidia Windows Cuda expert to work on it against Petri's Linux efforts. ;-)

That port could be done, but AFAIK the optimization will not be high as with Linux due the way the Windows works.
Anyway the original code is available if anyone wants to play with it and try this porting.


Hi,

My guess is that you'd get near the same speed-up on Windows too or even ...

a) Autopulse would use half the time it uses now.
b) Pulsefind would benefit greatly.
c) Doing a lot of the signal finding asynchronously would cut a lot of processing time and release the stress from the CPU.
(
d) Keeping or re-engineering Raistmers great work with SOG might make the Windows version even faster than the current Linux one.
e) Some, if not all, optimizations could be implemented on OpenCL to get a united codebase for 3rd party GPUs too.
)

I highly recommend someone with a big heart, a lot of time and some enthusiasm and knowledge of Windows platform development (and some courage) to take a step forward. I'll help and explain the mysteries of some of my optimizations when I can.

Petri.
14) Message boards : Number crunching : GPU FLOPS: Theory vs Reality (Message 2021859)
Posted 5 Dec 2019 by Profile petri33
Post:
Hi,

Reality is now that the top 200 are almost all Linux machines running with the special app. GPU FLOPS are of no interest as long as they are wasted.

If someone is willing to run a curiosity check on those machines and make a rough estimate of performance and wattage I'd be pleased.
The wattage can be taken from the manufacturer data or from a questionnaire from actual users running "nvidia-smi -l"

That would guide the Santa to deliver the right present to the best behaving children and adults who do not like to obey M$ or other rules.


Petri
15) Message boards : Number crunching : High performance Linux clients at SETI (Message 1996672)
Posted 3 Jun 2019 by Profile petri33
Post:
I'm still waiting for mine.
Ha ha ha LOL. :-> Me too!~
My last early birthday present to myself was 2yrs ago and that was the 2x 1060's in that rig. ;-)

Cheers.


Maybe I could do some toast in my PC CD tray? What else?
16) Message boards : Number crunching : Welcome to the 20 Year Club! (Message 1996664)
Posted 3 Jun 2019 by Profile petri33
Post:
I'll post this first. Then I'll read the messages.

I love my 15 years T-Shirt I received some years ago - I still wear it at special days at my work: As a teacher of math, physics, chemistry and computer related stuff (programming, meaningful usage and detecting false news). Sometimes it raises questions.

Thanks to the lovely lady and a far away friend I've never met. I still use the T-shirt.

It will wear out some day. I'm still the same size.

--
Petri33.


Sandstorm (Darude) and some other songs were playing from my speakers when working at SSH at those days.
17) Message boards : Number crunching : Welcome to the 20 Year Club! (Message 1996662)
Posted 3 Jun 2019 by Profile petri33
Post:
I'll post this first. Then I'll read the messages.

I love my 15 years T-Shirt I received some years ago - I still wear it at special days at my work: As a teacher of math, physics, chemistry and computer related stuff (programming, meaningful usage and detecting false news). Sometimes it raises questions.

Thanks to the lovely lady and a far away friend I've never met. I still use the T-shirt.

It will wear out some day. I'm still the same size.

--
Petri33.
18) Message boards : Number crunching : High performance Linux clients at SETI (Message 1996645)
Posted 3 Jun 2019 by Profile petri33
Post:
I can't remember . . . . . Wiggo did you state you were going to hold off adding the -nobs parameter after your RAC stabilized?
Yep, other than changing the checkpoints to 6mins I'll see how the "out of the box" goes first before doing any changes.

I must have my baselines set 1st Keith, which is also why my 3570K rig is still on Win7 and waiting for its bits of my early birthday present to myself. ;-)

Cheers.


My Social Demograpghic wiew sees a 2.5 times more RAC.

A nice move. No harm done.

Some would say it must be a green act. A "Sociogreenleftwingattack" against a free will to compute and a plot to shut down the whole idea of computing at free will with out a freedom to choose a 'greener' alternative and an intent to use less power to achieve a toaster a lot faster.

I'm still waiting for mine.
19) Message boards : Number crunching : Don't know where it should go? Stick it here! (Message 1995831)
Posted 29 May 2019 by Profile petri33
Post:
Hi,

This is an issue with seti.

Consider a multiple of scenarios: Sum up 32768 values A) sequentially B) in sequences of length N using middle sums C) pairwise bottom up (recursive) level first.
a) when all are 'small'
b) when all are 'big'
c) when there are a lot more small values than 'big' ones
d) when there are a lot more big ones than 'small' ones
e) when there are first a lot of 'big' ones
f) when there are first a lot of 'small' ones
g) when the sum exceeds the computational representation to make any difference if anything small is added to.

1. Make a matrix of your answers.

Then there is a hunt for the greatest sum of the individual values sampled in "various ways*"

At some point the Sum is used to divide all values that were summed up.

2. What kind of errors you may encounter depending of the input?
3. How would you deal with it?
4. Which kind of an input would you define as 'hard to compute'?
5. Do those kind of input (noisy data) have any scientific value?
6. How about a 'flat' noisy as usual input with small variance?

--

Petri

*Various ways would be a lengthy lesson of how Seti computes what is a pulse.


For those that are curious, I found an interesting PDF in relation to GPU processing on Nvidia GPUs
Precision & Performance:Floating Point and IEEE 754 Compliance for NVIDIA GPUs

Mathematically, (A + B)+ C does equal A + (B + C).
Let rn(x) denote one rounding step on x. Performing these same computations in single precision floating
point arithmetic in round-to-nearest mode according to IEEE 754, we obtain:
            A + B = 21 × 1:1000000000000000000000110000 : : :
        rn(A + B) = 21 × 1:10000000000000000000010
            B + C = 23 × 1:0010000000000000000000100100 : : :
        rn(B + C) = 23 × 1:00100000000000000000001
        A + B + C = 23 × 1:0110000000000000000000101100 : : :
rn(rn(A + B) + C) = 23 × 1:01100000000000000000010
rn(A + rn(B + C)) = 23 × 1:01100000000000000000001

For reference, the exact, mathematical results are computed as well in the table above. Not only are the results computed according to IEEE 754 different from
the exact mathematical results, but also the results corresponding to the sum rn(rn(A + B) + C) and the sum
rn(A + rn(B + C)) are different from each other. In this case, rn(A + rn(B + C)) is closer to the correct mathematical result than rn(rn(A + B) + C).
This example highlights that seemingly identical computations can produce different results even if all basic
operations are computed in compliance with IEEE 754.

Here, the order in which operations are executed affects the accuracy of the result. The results are independent
of the host system. These same results would be obtained using any microprocessor, CPU or GPU, which
supports single precision floating point.


As we have shown in Section 3, the final values computed using IEEE 754 arithmetic can depend on implementation choices
such as whether to use fused multiplyadd or whether additions are organized in series or parallel. These differences affect computation on the CPU
and on the GPU.
One example of the differences can arise from differences between the number of concurrent threads involved in a computation.
On the GPU, a common design pattern is to have all threads in a block coordinate to do a parallel reduction on data within the block,
followed by a serial reduction of the results from each block. Changing the number of threads per block reorganizes the reduction; if the reduction is addition, then
the change rearranges parentheses in the long string of additions.


Computing results in a high precision and then comparing to results computed in a lower precision can be
helpful to see if the lower precision is adequate for a particular application. However, rounding high precision
results to a lower precision is not equivalent to performing the entire computation in lower precision. This can
sometimes be a problem when using x87 and comparing results against the GPU. The results of the CPU may
be computed to an unexpectedly high extended precision for some or all of the operations. The GPU result
will be computed using single or double precision only.
20) Message boards : Number crunching : High performance Linux clients at SETI (Message 1995830)
Posted 29 May 2019 by Profile petri33
Post:
Hi,

Thanks to those saying "a way to GO TBar & petri33 !!"

This is what Wiggo's machine is doing now:



You can deduce and add some to get the picture what kind of RAC it will produce.
--
Petri


Next 20


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.