Posts by Raistmer


log in
1) Message boards : Number crunching : Nvidia Driver kernal something stopped responding something recovered (Message 1799385)
Posted 2 hours ago by Profile Raistmer
strange thing, i changed the drivers, set pcie bus frequency lower, but
the SoG workunits, since Lunarics 0.45 BETA newest release.

Crunshing stop, and display drivers recovers permanently.

Known issue ?

http://setiathome.berkeley.edu/forum_thread.php?id=79760
2) Message boards : Number crunching : GPU stoped during crunching (Message 1799384)
Posted 2 hours ago by Profile Raistmer
http://setiathome.berkeley.edu/forum_thread.php?id=79760
3) Message boards : Number crunching : SETI applications for NVIDIA GPU improvement - how you can help (Message 1799329)
Posted 10 hours ago by Profile Raistmer
faster version of timing extraction script:

$path="stderr.txt"; $results="times_iterations.txt"; open (RES, ">".$results); open (IN, $path); print RES "exec_time\titerations\n"; while (<IN>) { if(/Partial PulseFind_3(.*)Awaited (\d+) iterations/){ $iter=$2; } else{ if(/Kernel PULSE_PARTIAL execution time: (\d+\.\d+)/ || /Kernel PULSE_PARTIAL execution time: (\d+)/ ){ print RES $1."\t".$iter."\n"; }} }
4) Message boards : Number crunching : SETI@home v8.12 Windows GPU applications support thread (Message 1799327)
Posted 10 hours ago by Profile Raistmer
here is the link on updated builds (r3480): https://cloud.mail.ru/public/4g4V/Gt278t2ku
5) Message boards : Number crunching : Profiling AMD/ATI OpenCL systems (Linux) (Message 1799326)
Posted 10 hours ago by Profile Raistmer

How does one profile this, to find out where the choke points are?

In terms of coding, I prefer FORTRAN, but I have done C, C++ and others.

SETI app written on C/C++/OpenCL/CUDA(for NV) so sorry no use of FORTRAN here.
Sources available here: https://setisvn.ssl.berkeley.edu/svn/branches/sah_v7_opt
6) Message boards : Number crunching : SETI@home v8.12 Windows GPU applications support thread (Message 1799324)
Posted 10 hours ago by Profile Raistmer
Raistmer,

I'm starting to see these errors on some of the SoG work units

ERROR: Possible wrong computation state on GPU, host needs reboot or maintenance
GPU device sync requested... ...GPU device synched
09:07:29 (7932): called boinc_finish(-1)


Are these the same errors we saw on Beta?
http://setiathome.berkeley.edu/result.php?resultid=5010738737
http://setiathome.berkeley.edu/result.php?resultid=5010028516

Zalster

As Richard said this should go into this thread: http://setiathome.berkeley.edu/forum_thread.php?id=79760

Yes, it's false positive from autocorr sanity check we saw on beta.
New builds (will be r3480 and up) have this particular sanity check disabled.
7) Message boards : Number crunching : SETI@home v8.12 Windows GPU applications support thread (Message 1799322)
Posted 11 hours ago by Profile Raistmer
There is new set of RC builds available here: https://cloud.mail.ru/public/8RM1/LMYTwvGYp
If you experience any issues with v8.12 please try new RC build instead.


Here's a WU done with the new app. http://setiathome.berkeley.edu/result.php?resultid=5010274940

The output on this task looks a lot different than before, can't even see which GPU was used. I see quite a lot of these.

On the other hand here's another task that the output looks normal: http://setiathome.berkeley.edu/result.php?resultid=5010142578

Thanks for report.
Looks like leftover from increased verbosity build. I'll do rebuild.
8) Message boards : Number crunching : SETI@home v8.12 Windows GPU applications support thread (Message 1799246)
Posted 1 day ago by Profile Raistmer
There is new set of RC builds available here: https://cloud.mail.ru/public/8RM1/LMYTwvGYp
If you experience any issues with v8.12 please try new RC build instead.
9) Message boards : Number crunching : SETI@home v8.12 Windows GPU applications support thread (Message 1799244)
Posted 1 day ago by Profile Raistmer

. . Can you offer any suggestions ?

Currently available tunings listed in ReadMe file.
Due to massive changes in PulseFind area new builds behavior will differ so new tunings could be needed in sleeping and pulsefind area (in particular, -use_sleep expected to be more powerful and to give less impact on performance while saving CPU cycles for NV app flavour).
10) Message boards : Number crunching : Is it possible to swap a guppi assigned to GPU with a Arecibo assigned to CPU? (Message 1799202)
Posted 1 day ago by Profile Raistmer
I would say there are quite many urban myths around credit area currently.
Local re-scheduling (that is, inside single host) much more preferable than "global re-scheduling" that constitutes task abortion en masse.
So, if robust re-scheduling tool will exist it would be good.
BTW, old Fred's re-scheduler most probably coul dbe configured for that too. It has configurable tab.
11) Message boards : Number crunching : SETI applications for NVIDIA GPU improvement - how you can help (Message 1798982)
Posted 2 days ago by Profile Raistmer
Sleeping behavior greatly reworked.
I updated corresponding post about this option( http://lunatics.kwsn.info/index.php/topic,1808.msg60933.html#msg60933 ).
New builds to test usability of new approach to sleep will be awailable soon, stay tuned.
12) Message boards : Number crunching : SETI applications for NVIDIA GPU improvement - how you can help (Message 1798887)
Posted 2 days ago by Profile Raistmer
. . I can change back to SoG with 0.45 installer Beta(3) but how do I make it use this app instead of the included r3472? And where do I add the script file?

If you have Perl it can be used for speedup data extraction from stderr.txt.
If not do it by hands.
here http://lunatics.kwsn.info/index.php?action=downloads;sa=view;down=497 I put some small Perl interpreter in pack.
13) Message boards : Number crunching : SETI@home v8.12 Windows GPU applications support thread (Message 1798886)
Posted 2 days ago by Profile Raistmer
This one implies reboot indeed.
http://setiathome.berkeley.edu/result.php?resultid=4955880255
Others are due data currently processing. This sanity check will be removed in next version.
14) Message boards : Number crunching : SETI applications for NVIDIA GPU improvement - how you can help (Message 1798440)
Posted 5 days ago by Profile Raistmer
Here is NV sibling of posted earlier HD5 ATi build.
https://cloud.mail.ru/public/HUAE/soM11FDVh

Please look this post for info what to do with it.

So far I found that on my C-60 changing -use_sleep_ex N from 1 to 4 including almost doesn't change real sleep time. It remains ~15ms.

How it will react on CPU load and priority change - to be explored.

P.S. here is small Perl script for relevant data extraction from stderr.txt:

$path="stderr.txt"; $results="times_iterations.txt"; open (RES, ">".$results); open (IN, $path); while (<IN>) { if(/Partial PulseFind_3(.*)Awaited (\d+) iterations/){ @iterations=(@iterations,$2); } if(/Kernel PULSE_PARTIAL execution time: (\d+\.\d+)/ || /Kernel PULSE_PARTIAL execution time: (\d+)/ ){ @exec_time=(@exec_time,$1); } } print RES "excec_time\titerations\n"; foreach $iter (@iterations){ print RES $exec_time[$i]."\t".$iterations[$i]."\n"; $i++; }
15) Message boards : Number crunching : SETI applications for NVIDIA GPU improvement - how you can help (Message 1798439)
Posted 5 days ago by Profile Raistmer
They are downloadable, thanks.
16) Message boards : Number crunching : SETI@home v8.12 Windows GPU applications support thread (Message 1798211)
Posted 5 days ago by Profile Raistmer
For the TDR registry, should I specify only the TdrDelay value or should I create all of those mentioned on the Microsoft page you linked? Currently I cannot find any of those in my registry.

You can experiment with them. From memory disabling TDR completely via first key didn't work as it should.
Better attempt to tune app first. It worked on GT720 so should be doable.
17) Message boards : Number crunching : SETI@home v8.12 Windows GPU applications support thread (Message 1798173)
Posted 6 days ago by Profile Raistmer
So try to configure app to avoid driver restarts. Driver restart is abnormal situation that doesn't properly handled by NV runtime.

http://setiathome.berkeley.edu/forum_thread.php?id=79760&postid=1795582
https://msdn.microsoft.com/en-us/library/windows/hardware/ff569918%28v=vs.85%29.aspx?f=255&MSPPError=-2147217396
18) Message boards : Number crunching : SETI@home v8.12 Windows GPU applications support thread (Message 1798145)
Posted 6 days ago by Profile Raistmer

- The SoG application does not terminate if you stop Boinc, you have to kill the process manually to recover the WU, otherwise it's locked up and cannot be accessed when you restart Boinc

BOINC restart was done after driver restart or in usual conditions?
19) Message boards : Number crunching : SETI@home v8.12 Windows GPU applications support thread (Message 1798144)
Posted 6 days ago by Profile Raistmer


This is what I have noticed about these hangups:
- They happen when driver was not responding and was restarted (but not at every restart)
don't know is the SoG application terminated in this situation.

It's the reason of hangup. Depending on runtime, app never receives error code for broken OpenCL context. In this case it will never return.



I have now reduced -period_iteration_num to 35 to see when the driver restarts go away.

Param should be increased not reduced. Default is 50. Task you list has 30. It can be the reason of driver restart. Try to set it let say to 100.
20) Message boards : Number crunching : SETI applications for NVIDIA GPU improvement - how you can help (Message 1797965)
Posted 7 days ago by Profile Raistmer
It's "verbose" one so don't expect amazing performance from it but it can tell you smth new about your system.


If that makes it slow, would it not also affect the figures, it is supposed to report?

No. I doesn't. It make it slow overall due to added output overhead. But each kernel call executes on same speed as before so you get correct info about kernel execution times.


Next 20

Copyright © 2016 University of California