Setting up Linux to crunch CUDA90 and above for Windows users

Message boards : Number crunching : Setting up Linux to crunch CUDA90 and above for Windows users
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 . . . 162 · Next

AuthorMessage
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1865671 - Posted: 5 May 2017, 8:28:50 UTC - in response to Message 1865662.  
Last modified: 5 May 2017, 8:45:00 UTC

You can get your copy of zi3t2b executable here
Look at it carefully. The name is different than your present App. You will need to copy the name and paste it in the two lines of the app_info, replacing the existing name with the new name.

To deal with Inconclusives you must convert them to Net from Gross. Generally Instant Overflows aren't counted as they are commonly different, and they are Not used by SETI since they are usually less than 10 seconds long and just a fraction of the file. Obviously Bad Wingmen are not counted as you don't have any control over how many Bad Wingpeople the Server decides to send you. I've had the Server hook me up with twenty or more tasks in a row from an Obvious Bad host. You need to deduct tasks from Bad Wingpeople and tasks where both Hosts finished in a few seconds with an Overflow. Then you can calculate your Inconclusive rate.
ID: 1865671 · Report as offensive     Reply Quote
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1865695 - Posted: 5 May 2017, 12:05:10 UTC - in response to Message 1865671.  

You can get your copy of zi3t2b executable here
Look at it carefully. The name is different than your present App. You will need to copy the name and paste it in the two lines of the app_info, replacing the existing name with the new name.

. . Thanks I have downloaded it for later...

To deal with Inconclusives you must convert them to Net from Gross. Generally Instant Overflows aren't counted as they are commonly different, and they are Not used by SETI since they are usually less than 10 seconds long and just a fraction of the file. Obviously Bad Wingmen are not counted as you don't have any control over how many Bad Wingpeople the Server decides to send you. I've had the Server hook me up with twenty or more tasks in a row from an Obvious Bad host. You need to deduct tasks from Bad Wingpeople and tasks where both Hosts finished in a few seconds with an Overflow. Then you can calculate your Inconclusive rate.


. . Well I have looked through my 26 inconclusives and 6 of them are instant overflows, or thereabouts, and only one of them is zi3t2b, the others are hangovers from zi3k+ as I suspected. Since I have only processed a few hundred tasks with zi3t2b so far and have no clear idea of how many have been validated, I will have to wait for the hangovers to clear before I have useful info.

. . But only one out of several hundred is a good sign at this point.

Stephen

:)
ID: 1865695 · Report as offensive     Reply Quote
Profile petri33
Volunteer tester

Send message
Joined: 6 Jun 02
Posts: 1668
Credit: 623,086,772
RAC: 156
Finland
Message 1865730 - Posted: 5 May 2017, 15:58:11 UTC - in response to Message 1865633.  

I just got my cruncher back up and running after two days of installing a couple more 1070s. I also installed new EK water blocks on all four 1070s, a new radiator, pump, reservoir and cooling loop. After a successful 24 hour leak test, I'm back crunching again. I've been running all day without issue. At the moment, I'm running 1 WU per GPU with 1.25 CPU threads per WU as well as 5 CPU threads crunching WUs. CPU and GPU temperatures are all under 46 degrees after running for 12+ hours. I'm pretty happy with it.

I appreciate all the help!

scocam


Thanks for choosing the right OS and SW to test with. https://setiathome.berkeley.edu/show_host_detail.php?hostid=8257416
Let's all wait for the results!

Thanks!

Petri
To overcome Heisenbergs:
"You can't always get what you want / but if you try sometimes you just might find / you get what you need." -- Rolling Stones
ID: 1865730 · Report as offensive     Reply Quote
Profile petri33
Volunteer tester

Send message
Joined: 6 Jun 02
Posts: 1668
Credit: 623,086,772
RAC: 156
Finland
Message 1865731 - Posted: 5 May 2017, 16:00:32 UTC - in response to Message 1865695.  

....'ve no clear idea of how many have been validated, I will have to wait for the hangovers to clear before I have useful info.

. . But only one out of several hundred is a good sign at this point.

Stephen

:)


Yesh,
Normally that ish a good ideah. And all the good sings are appreciated.

Petri :)
To overcome Heisenbergs:
"You can't always get what you want / but if you try sometimes you just might find / you get what you need." -- Rolling Stones
ID: 1865731 · Report as offensive     Reply Quote
Profile petri33
Volunteer tester

Send message
Joined: 6 Jun 02
Posts: 1668
Credit: 623,086,772
RAC: 156
Finland
Message 1865732 - Posted: 5 May 2017, 16:07:00 UTC - in response to Message 1865671.  
Last modified: 5 May 2017, 16:07:22 UTC

Thank You TBar!

This is the most short and valid (shortest and best) explanation.

You can get your copy of zi3t2b executable here
Look at it carefully. The name is different than your present App. You will need to copy the name and paste it in the two lines of the app_info, replacing the existing name with the new name.

To deal with Inconclusives you must convert them to Net from Gross. Generally Instant Overflows aren't counted as they are commonly different, and they are Not used by SETI since they are usually less than 10 seconds long and just a fraction of the file. Obviously Bad Wingmen are not counted as you don't have any control over how many Bad Wingpeople the Server decides to send you. I've had the Server hook me up with twenty or more tasks in a row from an Obvious Bad host. You need to deduct tasks from Bad Wingpeople and tasks where both Hosts finished in a few seconds with an Overflow. Then you can calculate your Inconclusive rate.



Pci Evga Teraherz Ryzen Interedted 3.3x.
To overcome Heisenbergs:
"You can't always get what you want / but if you try sometimes you just might find / you get what you need." -- Rolling Stones
ID: 1865732 · Report as offensive     Reply Quote
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1865736 - Posted: 5 May 2017, 16:27:35 UTC - in response to Message 1865730.  

Yeap those 4 x 1070's on Linux will definitely put that machine up in the top 4 computers at SETI.
I wonder how long it will take to level out ...
ID: 1865736 · Report as offensive     Reply Quote
Profile petri33
Volunteer tester

Send message
Joined: 6 Jun 02
Posts: 1668
Credit: 623,086,772
RAC: 156
Finland
Message 1865739 - Posted: 5 May 2017, 16:58:10 UTC - in response to Message 1865736.  

Yeap those 4 x 1070's on Linux will definitely put that machine up in the top 4 computers at SETI.
I wonder how long it will take to level out ...


This should tell after next Tuesday...
http://stats.free-dc.org/stats.php?page=hostbycpid&cpid=d9e29890eb1042129f24e63d82ccb65b

The machine is too 'new' now.
To overcome Heisenbergs:
"You can't always get what you want / but if you try sometimes you just might find / you get what you need." -- Rolling Stones
ID: 1865739 · Report as offensive     Reply Quote
Profile scocam
Avatar

Send message
Joined: 28 Feb 17
Posts: 27
Credit: 15,120,999
RAC: 0
United States
Message 1865742 - Posted: 5 May 2017, 17:29:53 UTC - in response to Message 1865659.  

Thank you for taking a look into these errors, TBar! The errors seem to have calmed down as of this morning but the amount of inconclusives was startling at first though most seem to be genuinely inconclusive. I think it's that I'm crunching through so many WUs that the amount was a bit of a shock at first glance. But this cruncher, after only having been in commission for a few days, has almost caught-up to my Windows 10 machine that's been online for the 2 months!

I have noticed a few random restarts on this Ubuntu 14.04 machine since standing it up. I'll need to track down the source of these faults and get that under control to set my mind at ease. I think I'll tackle that during my Tuesday maintenance window. I've never had random restarts with Ubuntu before but I'm dual-booting this with a Windows 10 install which I've never done before. Both seem happy but Ubuntu has me wondering if it doesn't like playing nicely with Windows. I chose to dual-boot so I could do initial system monitoring and load GPU BIOS updates with Windows. Other than that, it just takes up space on the disk.


scocam
ID: 1865742 · Report as offensive     Reply Quote
Profile scocam
Avatar

Send message
Joined: 28 Feb 17
Posts: 27
Credit: 15,120,999
RAC: 0
United States
Message 1865743 - Posted: 5 May 2017, 17:36:55 UTC - in response to Message 1865730.  

Thanks, Petri. After getting a glimpse of what zi3k+ could do on my initial tests, this machine was purpose-built for crunching with zi3t2b. So far, it's a champ and I couldn't be happier with the performance. As I mentioned in a previous thread, after only a few days, this machine has almost caught up to my 2-month old machine. That's impressive! Thanks for all your hard work.

scocam
ID: 1865743 · Report as offensive     Reply Quote
Profile scocam
Avatar

Send message
Joined: 28 Feb 17
Posts: 27
Credit: 15,120,999
RAC: 0
United States
Message 1865745 - Posted: 5 May 2017, 17:41:41 UTC - in response to Message 1865736.  

Yeap those 4 x 1070's on Linux will definitely put that machine up in the top 4 computers at SETI.


I would be happy to make the top 20! Anything more than that would be amazing. Aside from being fun, I hope it's able to somehow help the seti community as well.


scocam
ID: 1865745 · Report as offensive     Reply Quote
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1865765 - Posted: 5 May 2017, 20:11:18 UTC - in response to Message 1865742.  
Last modified: 5 May 2017, 20:15:22 UTC

I have noticed a few random restarts on this Ubuntu 14.04 machine since standing it up. I'll need to track down the source of these faults and get that under control to set my mind at ease. I think I'll tackle that during my Tuesday maintenance window. I've never had random restarts with Ubuntu before but I'm dual-booting this with a Windows 10 install which I've never done before. Both seem happy but Ubuntu has me wondering if it doesn't like playing nicely with Windows. I chose to dual-boot so I could do initial system monitoring and load GPU BIOS updates with Windows. Other than that, it just takes up space on the disk.
scocam
The random restarts with a fresh system is not a good indication. I recently had a machine start giving restarts overnight. First it was only once a week, after a few months it was multiple times a day. A new install would seem to work for about a day, then the restarts would resume. I tried different processors and ones that should have worked wouldn't even boot the machine even though they worked in other boards. At the end it had problems with Windows as well as Ubuntu. I replaced the board with an eBay board that used my existing parts and that solved the problem. Now it only gets restarted when there is a Kernel update. Of course, your problem could be different, hopefully.
ID: 1865765 · Report as offensive     Reply Quote
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13720
Credit: 208,696,464
RAC: 304
Australia
Message 1865790 - Posted: 5 May 2017, 22:01:11 UTC - in response to Message 1865736.  

Yeap those 4 x 1070's on Linux will definitely put that machine up in the top 4 computers at SETI.
I wonder how long it will take to level out ...

Major hardware/software changes (or a new system) and no server issues- 6-8 weeks.
Grant
Darwin NT
ID: 1865790 · Report as offensive     Reply Quote
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1865818 - Posted: 6 May 2017, 0:33:47 UTC - in response to Message 1865731.  

....'ve no clear idea of how many have been validated, I will have to wait for the hangovers to clear before I have useful info.

. . But only one out of several hundred is a good sign at this point.

Stephen

:)


Yesh,
Normally that ish a good ideah. And all the good sings are appreciated.

Petri :)


. . Have one for me :)

Stephen

:)
ID: 1865818 · Report as offensive     Reply Quote
Profile scocam
Avatar

Send message
Joined: 28 Feb 17
Posts: 27
Credit: 15,120,999
RAC: 0
United States
Message 1866252 - Posted: 8 May 2017, 16:11:31 UTC
Last modified: 8 May 2017, 16:12:22 UTC

Wow! This new build has already broken into the top 40 computers on Seti after only (seriously) crunching for about a week.

Though to be completely transparent, this machine (https://setiathome.berkeley.edu/show_host_detail.php?hostid=8257416) had been online (off and on) for almost 2 weeks prior to the Created date shown in the Host Detail and had accumulated some rac with two gpus. After I rebuilt it with a new mobo, two additional GPUs and the latest secret sauce, I merged the computers. The current build started on May 3rd with ~600k credits so that equates to crunching through almost 900k credits in 5 days... not bad!


scocam
ID: 1866252 · Report as offensive     Reply Quote
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1866264 - Posted: 8 May 2017, 18:00:35 UTC - in response to Message 1866252.  
Last modified: 8 May 2017, 18:02:04 UTC

You're on your way to the my #3 spot on the list. And will give those 8 cards on Windows a good chase.
scocam
Brent @ #3
uswg01 @ #2
Petri @ #1
ID: 1866264 · Report as offensive     Reply Quote
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1866322 - Posted: 9 May 2017, 0:47:57 UTC - in response to Message 1866252.  

Wow! This new build has already broken into the top 40 computers on Seti after only (seriously) crunching for about a week.
Though to be completely transparent, this machine (https://setiathome.berkeley.edu/show_host_detail.php?hostid=8257416) had been online (off and on) for almost 2 weeks prior to the Created date shown in the Host Detail and had accumulated some rac with two gpus. After I rebuilt it with a new mobo, two additional GPUs and the latest secret sauce, I merged the computers. The current build started on May 3rd with ~600k credits so that equates to crunching through almost 900k credits in 5 days... not bad!
scocam

. . Wait until it settles down :)

Stephen

:)
ID: 1866322 · Report as offensive     Reply Quote
Profile tazzduke
Volunteer tester

Send message
Joined: 15 Sep 07
Posts: 190
Credit: 28,269,068
RAC: 5
Australia
Message 1866324 - Posted: 9 May 2017, 1:03:30 UTC

Greetings

Well I took the plunge (again) and now have two crunchers up on Mint 18.1. One is crunching the other I just got to finalise a few things.
I have a 3rd to switch over when I have some time.

Also running the special cuda app as well.

Regards
ID: 1866324 · Report as offensive     Reply Quote
JohnDK Crowdfunding Project Donor*Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 28 May 00
Posts: 1222
Credit: 451,243,443
RAC: 1,127
Denmark
Message 1866335 - Posted: 9 May 2017, 2:00:35 UTC

At last got it working, runs fine it seems and at lot faster than SoG on windows.

Installed Mint 18.1 on this host https://setiathome.berkeley.edu/results.php?hostid=8261137

Also installed Psensor to check temp/load but it doesn't show any info on my Gtx1080/970, only CPU and some other stuff. Do I need to do something else to get it working?
ID: 1866335 · Report as offensive     Reply Quote
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1866344 - Posted: 9 May 2017, 3:36:07 UTC - in response to Message 1866335.  

hmm it should show up. Check PSensor preferences.

Also might try GKrellm, I like it better for CPU usage. Also shows numerical GPU temps/fans.
Also in terminal ... nvidia-smi -l

This is from memory, so might be wrong ... In terminal
sudo sensors ... so show info
sudo sensors -detect ... check system for additional sensors i.e. Not just defaults.
ID: 1866344 · Report as offensive     Reply Quote
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1866376 - Posted: 9 May 2017, 7:23:49 UTC - in response to Message 1866324.  


Well I took the plunge (again) and now have two crunchers up on Mint 18.1. One is crunching the other I just got to finalise a few things.
I have a 3rd to switch over when I have some time.
Also running the special cuda app as well.

. . Hi Tazz, I am guessing you have gotten the bug pretty bad :)

:)
ID: 1866376 · Report as offensive     Reply Quote
Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 . . . 162 · Next

Message boards : Number crunching : Setting up Linux to crunch CUDA90 and above for Windows users


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.