Setting up Linux to crunch CUDA90 and above for Windows users

Message boards : Number crunching : Setting up Linux to crunch CUDA90 and above for Windows users
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 69 · 70 · 71 · 72 · 73 · 74 · 75 . . . 162 · Next

AuthorMessage
Profile petri33
Volunteer tester

Send message
Joined: 6 Jun 02
Posts: 1668
Credit: 623,086,772
RAC: 156
Finland
Message 1953300 - Posted: 2 Sep 2018, 0:04:56 UTC - in response to Message 1953232.  
Last modified: 2 Sep 2018, 0:06:28 UTC

"bad arg: -pfl
bad arg: 512"

The flag -pfl N is not supported any more.
Please do not try it with UPPERCASE. It is still there so that I can test it on offline benchmarks and bug hunting.

Same applies to -pfe.

Petri

Hey Petri,
With all these Overflowing BLC11 tasks is there something different we can do to find out why some overflows end up Invalid filled with Triplets?

If anyone can find an Overflow filled with Triplets before it becomes Invalid and disappears, you need to grab it. Just look for an Overflow and see if it's all Triplets.
Unfortunately, it seem those filled with Triplets are Arecibo Tasks....oh well.



Hi,

A noise bomb has so many signals that it makes a standard SW and a special SW hard to decide when to stop recording them.
The WU is processed from start to finish and the collecting part of the software has to have a limit what to store in CPU RAM.
One round is to check signals that have been time-delayed or -forwarded and divided to lengths of 8, 16, 32, 64, 128, 256, 512, 1024, 2048, 4096, ....
Each round could provide hundreds or thousands of "found" signals.
Imagine that to be done in parallel.
Quite hard to select the "correct ones" since everything is "Correct" but only 30 can be reported.

A noise bomb is a noisebomb.
To overcome Heisenbergs:
"You can't always get what you want / but if you try sometimes you just might find / you get what you need." -- Rolling Stones
ID: 1953300 · Report as offensive     Reply Quote
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1953341 - Posted: 2 Sep 2018, 3:18:30 UTC - in response to Message 1953232.  


Hey Petri,
With all these Overflowing BLC11 tasks is there something different we can do to find out why some overflows end up Invalid filled with Triplets?

If anyone can find an Overflow filled with Triplets before it becomes Invalid and disappears, you need to grab it. Just look for an Overflow and see if it's all Triplets.
Unfortunately, it seem those filled with Triplets are Arecibo Tasks....oh well.


. . There's a batch of Arecibo tasks about to come through, there should be some overflows amongst them.

. . Stephen

:)
ID: 1953341 · Report as offensive     Reply Quote
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1953356 - Posted: 2 Sep 2018, 8:21:26 UTC - in response to Message 1953268.  

What do you mean by grabbing it or catching it before being invalidated? Can you reprocess it somehow? Or just copy the WU file for more investigation?

Once the task gets invalidated, its purged and no longer available for download by a download fanout generator. The idea is to grab a suspicious WU while still available and reprocess it again offline with various applications in the benchmark tester for comparison.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1953356 · Report as offensive     Reply Quote
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1953414 - Posted: 2 Sep 2018, 15:38:23 UTC - in response to Message 1953229.  
Last modified: 2 Sep 2018, 16:01:52 UTC

As for the Pulses completely missing, as well as the Best Pulse missing, I tracked a similar problem down to a GPU not fitting squarely in the Slot. It was tilted slightly upwards. See if you keep getting the same problem with the same GPU.

Will check that, of course, but it's hard to be that because i use a cube case, the GPU's are in the vertical position and firmily fixed. The host crunches 1000's of WU every day, and only few (3 yesterday) WU are mark as invalids and all from diferent GPU's.
Well, it looks like that problem is back, and it's on the card that was having the most problems even though it's in a different slot. So, I don't think it's the slot, it was the Top slot before, now the card is in the bottom slot and having the same problem. Fortunately it's only happening on One machine of mine, but I did see the same problem on someone else's machine, so, that makes three of us now having the problem. I wonder why it disappeared for a couple of days, I had been seeing a few a day. I have seen it on all cards, but mostly it was on the card in the Top Slot.
No Pulses Recorded, even the Best Pulse is Blank meaning it didn't find any non-reportable pulses either, https://setiathome.berkeley.edu/results.php?hostid=6796475&&state=5
ID: 1953414 · Report as offensive     Reply Quote
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1953417 - Posted: 2 Sep 2018, 16:29:24 UTC - in response to Message 1953414.  

As for the Pulses completely missing, as well as the Best Pulse missing, I tracked a similar problem down to a GPU not fitting squarely in the Slot. It was tilted slightly upwards. See if you keep getting the same problem with the same GPU.

Will check that, of course, but it's hard to be that because i use a cube case, the GPU's are in the vertical position and firmily fixed. The host crunches 1000's of WU every day, and only few (3 yesterday) WU are mark as invalids and all from diferent GPU's.
Well, it looks like that problem is back, and it's on the card that was having the most problems even though it's in a different slot. So, I don't think it's the slot, it was the Top slot before, now the card is in the bottom slot and having the same problem. Fortunately it's only happening on One machine of mine, but I did see the same problem on someone else's machine, so, that makes three of us now having the problem. I wonder why it disappeared for a couple of days, I had been seeing a few a day. I have seen it on all cards, but mostly it was on the card in the Top Slot.
No Pulses Recorded, even the Best Pulse is Blank meaning it didn't find any non-reportable pulses either, https://setiathome.berkeley.edu/results.php?hostid=6796475&&state=5

Weird at least.
After the one i posted (now cleared from the list) i did not notice any other with a similar problem.
It's above my pay grade but is possible to run the WU in the test program to see if that repeats?
ID: 1953417 · Report as offensive     Reply Quote
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1953421 - Posted: 2 Sep 2018, 16:55:35 UTC - in response to Message 1953414.  
Last modified: 2 Sep 2018, 17:52:43 UTC

As for the Pulses completely missing, as well as the Best Pulse missing, I tracked a similar problem down to a GPU not fitting squarely in the Slot. It was tilted slightly upwards. See if you keep getting the same problem with the same GPU.
Will check that, of course, but it's hard to be that because i use a cube case, the GPU's are in the vertical position and firmily fixed. The host crunches 1000's of WU every day, and only few (3 yesterday) WU are mark as invalids and all from diferent GPU's.
Well, it looks like that problem is back, and it's on the card that was having the most problems even though it's in a different slot. So, I don't think it's the slot, it was the Top slot before, now the card is in the bottom slot and having the same problem. Fortunately it's only happening on One machine of mine, but I did see the same problem on someone else's machine, so, that makes three of us now having the problem. I wonder why it disappeared for a couple of days, I had been seeing a few a day. I have seen it on all cards, but mostly it was on the card in the Top Slot.
No Pulses Recorded, even the Best Pulse is Blank meaning it didn't find any non-reportable pulses either, https://setiathome.berkeley.edu/results.php?hostid=6796475&&state=5
Weird at least.
After the one i posted (now cleared from the list) i did not notice any other with a similar problem.
It's above my pay grade but is possible to run the WU in the test program to see if that repeats?
Oh, I've done everything possible, including swapping out cards with other machines. The problem was it was always just the one machine having the problem. If you ask Petri how many times I've mentioned this, his answer would be LOTS. Now that other people are having the same problem, perhaps he will look at it again. Another one invalid... Don't why it disappeared, but it's back with a vengeance. I've found a few more from last night, seems something happened between 6-7PM EDT, nothing since around 7 last night. That's interesting, seems the problems stopped just after making this post, https://setiathome.berkeley.edu/forum_thread.php?id=83306. Before that I was looking for an Overflow filled with Triplets. However, I have seen the missing Pulses on the two cards Not running the monitor, just not many times.
ID: 1953421 · Report as offensive     Reply Quote
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5124
Credit: 276,046,078
RAC: 462
Message 1953903 - Posted: 5 Sep 2018, 21:40:44 UTC

Tbar,
I would swear someplace in the threads you posted a link to a "turnkey" of the very latest version of the CUDA90.

Since the discussion(s) have wondered over "several" threads I can't find it.

I just got the "Nvidia-396" installed and am wondering if I can get the latest compile that will work with that driver.

I think I read that there was a later driver from Nvidia but when I looked, all I found was another "390" type .run file.

Thank you for the redirects.

Tom
A proud member of the OFA (Old Farts Association).
ID: 1953903 · Report as offensive     Reply Quote
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1953908 - Posted: 5 Sep 2018, 22:04:03 UTC - in response to Message 1953903.  

Tbar,
I would swear someplace in the threads you posted a link to a "turnkey" of the very latest version of the CUDA90.

Since the discussion(s) have wondered over "several" threads I can't find it.

I just got the "Nvidia-396" installed and am wondering if I can get the latest compile that will work with that driver.

I think I read that there was a later driver from Nvidia but when I looked, all I found was another "390" type .run file.

Thank you for the redirects.

Tom


. . It is in this thread but with the volume of messages it is several pages back.

https://setiathome.berkeley.edu/forum_thread.php?id=81271&postid=1952208


Stephen

:)
ID: 1953908 · Report as offensive     Reply Quote
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5124
Credit: 276,046,078
RAC: 462
Message 1953912 - Posted: 5 Sep 2018, 22:29:28 UTC - in response to Message 1953908.  

. . It is in this thread but with the volume of messages it is several pages back.

https://setiathome.berkeley.edu/forum_thread.php?id=81271&postid=1952208


Stephen

:)


Thank you Stephen. Now all I gotta do is copy it into the right folder and make sure all the execute permissions are set. Then I can join the "jetter setters"?

Tom
A proud member of the OFA (Old Farts Association).
ID: 1953912 · Report as offensive     Reply Quote
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5124
Credit: 276,046,078
RAC: 462
Message 1953916 - Posted: 5 Sep 2018, 22:46:01 UTC - in response to Message 1953912.  

. . It is in this thread but with the volume of messages it is several pages back.

https://setiathome.berkeley.edu/forum_thread.php?id=81271&postid=1952208


Stephen

:)


Thank you Stephen. Now all I gotta do is copy it into the right folder and make sure all the execute permissions are set. Then I can join the "jetter setters"?

Tom


Got it set up and watched three "non-bombs" blow through in 2.5 minutes. My "time remaining" estimate shows 3 min 50 seconds.... I guess I did just join the "jetter setters" :)

Tom
A proud member of the OFA (Old Farts Association).
ID: 1953916 · Report as offensive     Reply Quote
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1953928 - Posted: 5 Sep 2018, 23:56:21 UTC - in response to Message 1953908.  

. . It is in this thread but with the volume of messages it is several pages back.

https://setiathome.berkeley.edu/forum_thread.php?id=81271&postid=1952208


Stephen

:)

That is an old link. There is a newer application b2 in various configurations.
Links to 0.97b2 special app for Maxwell/Pascal
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1953928 · Report as offensive     Reply Quote
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1953934 - Posted: 6 Sep 2018, 0:38:07 UTC - in response to Message 1953912.  

. . It is in this thread but with the volume of messages it is several pages back.

https://setiathome.berkeley.edu/forum_thread.php?id=81271&postid=1952208


Stephen

:)


Thank you Stephen. Now all I gotta do is copy it into the right folder and make sure all the execute permissions are set. Then I can join the "jetter setters"?

Tom


. . the "right" folder depends on which version of BOINC you are using. I believe you are running 7.8.3 as per TBar's post so it would be, if done correctly, /home/username/{Desktop/}BOINC/project/setiathome.berkeley.edu. It is probably without the Desktop but may include it if you extracted the archive on your desktop.

Stephen

:)

But be careful with your app_info.xml editing. It may be easiest if you follow Petri's example and rename the new app to the same as the one you are running and simply copy it over the current app file. That way you won't trash any currently cached tasks. I am not sure if it a must but you can edit app_info.xml to remove the references to the external libraries as the new 'static" app has them built in to give it the speed it attains. But again be careful, remove those references from both sections in which they appear.

Stephen

:)
ID: 1953934 · Report as offensive     Reply Quote
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1953935 - Posted: 6 Sep 2018, 0:39:43 UTC - in response to Message 1953928.  

. . It is in this thread but with the volume of messages it is several pages back.

https://setiathome.berkeley.edu/forum_thread.php?id=81271&postid=1952208


Stephen

:)

That is an old link. There is a newer application b2 in various configurations.
Links to 0.97b2 special app for Maxwell/Pascal


. . OK, that is latest version then with -pfe and -pfl neutralised?

. . It is hard to keep up :)

Stephen

? ?
ID: 1953935 · Report as offensive     Reply Quote
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1953940 - Posted: 6 Sep 2018, 1:05:03 UTC - in response to Message 1953934.  

. . It is in this thread but with the volume of messages it is several pages back.

https://setiathome.berkeley.edu/forum_thread.php?id=81271&postid=1952208


Stephen

:)


Thank you Stephen. Now all I gotta do is copy it into the right folder and make sure all the execute permissions are set. Then I can join the "jetter setters"?

Tom


. . the "right" folder depends on which version of BOINC you are using. I believe you are running 7.8.3 as per TBar's post so it would be, if done correctly, /home/username/{Desktop/}BOINC/project/setiathome.berkeley.edu. It is probably without the Desktop but may include it if you extracted the archive on your desktop.

Stephen

:)

But be careful with your app_info.xml editing. It may be easiest if you follow Petri's example and rename the new app to the same as the one you are running and simply copy it over the current app file. That way you won't trash any currently cached tasks. I am not sure if it a must but you can edit app_info.xml to remove the references to the external libraries as the new 'static" app has them built in to give it the speed it attains. But again be careful, remove those references from both sections in which they appear.

Stephen

:)

Don't need to do any app_info editing. Just unpack the archive and copy the new 0.97b2 application and the provided app_info in the Seti projects folder. The only change from the previous 0.97 archive is the new b2 version and the updated app_info to account for the app name change. All the other applications, files and folders are the same.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1953940 · Report as offensive     Reply Quote
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5124
Credit: 276,046,078
RAC: 462
Message 1953946 - Posted: 6 Sep 2018, 1:15:42 UTC - in response to Message 1953940.  

Don't need to do any app_info editing. Just unpack the archive and copy the new 0.97b2 application and the provided app_info in the Seti projects folder. The only change from the previous 0.97 archive is the new b2 version and the updated app_info to account for the app name change. All the other applications, files and folders are the same.


I just copied the whole thing including the app_info.xml over the top of the files in my /setiathome folder, checked to make sure everything was "executable". Set a couple of Shared file librarys that said "none" to executable on general principles and boom, it fired right up.

Yes, I did have to reboot after I installed the "meta-396" drivers for the Nvidia cards.

Tom
A proud member of the OFA (Old Farts Association).
ID: 1953946 · Report as offensive     Reply Quote
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1953955 - Posted: 6 Sep 2018, 1:24:33 UTC - in response to Message 1953946.  

Yes, very simple really.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1953955 · Report as offensive     Reply Quote
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1953956 - Posted: 6 Sep 2018, 1:29:17 UTC - in response to Message 1953940.  


Don't need to do any app_info editing. Just unpack the archive and copy the new 0.97b2 application and the provided app_info in the Seti projects folder. The only change from the previous 0.97 archive is the new b2 version and the updated app_info to account for the app name change. All the other applications, files and folders are the same.


. . I may have missed a detail but I believe Tom was running zi3v not 0.97b1. Anyway, the deed is done ... :)

Stephen

:)
ID: 1953956 · Report as offensive     Reply Quote
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1953962 - Posted: 6 Sep 2018, 1:53:24 UTC - in response to Message 1953956.  

Even if he was, it wouldn't have mattered. The archive has a complete app_info already configured for the new app.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1953962 · Report as offensive     Reply Quote
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5124
Credit: 276,046,078
RAC: 462
Message 1953963 - Posted: 6 Sep 2018, 1:55:33 UTC - in response to Message 1953956.  


Don't need to do any app_info editing. Just unpack the archive and copy the new 0.97b2 application and the provided app_info in the Seti projects folder. The only change from the previous 0.97 archive is the new b2 version and the updated app_info to account for the app name change. All the other applications, files and folders are the same.


. . I may have missed a detail but I believe Tom was running zi3v not 0.97b1. Anyway, the deed is done ... :)

Stephen

:)


I think I was running CUDA90. Not sure if that is zi3v or not.

But it was completely painless as an upgrade. My favorite! :)

Tom
A proud member of the OFA (Old Farts Association).
ID: 1953963 · Report as offensive     Reply Quote
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1953985 - Posted: 6 Sep 2018, 3:13:01 UTC - in response to Message 1953962.  
Last modified: 6 Sep 2018, 3:14:48 UTC

Even if he was, it wouldn't have mattered. The archive has a complete app_info already configured for the new app.


. . Yes, but if he was running zi3v and the new app_info.xml did not have an app section to include tasks tagged for zi3v ... instant ghosts.. I have not seen the newest app_info.xml, does it have a section for zi3v?

. . OK, I have seen Tom's response so it would seem the new app_info has allowed for that :)

. . I must have a look at what is in it.

Stephen

?
ID: 1953985 · Report as offensive     Reply Quote
Previous · 1 . . . 69 · 70 · 71 · 72 · 73 · 74 · 75 . . . 162 · Next

Message boards : Number crunching : Setting up Linux to crunch CUDA90 and above for Windows users


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.