Sutaru Tsureku 的帖子

141) 留言板 : Number crunching : 4x AMD Radeon R9 Fury X (消息 1733063)
发表于:9 Oct 2015 作者: Profile Sutaru Tsureku
Post:
Mike, which problems you had with AMD Catalyst v15.7.1?
It looks like my PC made a self reboot with it.


How to uninstall the AMD Catalyst?
Windows 8.1: Programs and Features
Uninstall 'AMD Catalyst Install Manager'?

If I install the AMD Catalyst,
I should use the 'Express' or the 'Custom' installation?
Last time I made the 'Custom' installation and let all checked (IIRC, 6 entries, incl. Raptr).
(next time I'll uncheck Raptr, because it's just a gamer online thing)
Which is really needed and must be installed (for SETI crunching)?

If I would like to test AMD Catalyst v15.9.1 Beta, (someone tested this version already, with which OS/hardware?)
there is all in it or I need additional software?
Because the file names are different, with (normal) and without (beta version) 'with .NET 4.5'.


woohoo, IIRC, the R9 Fury X is available since mid 2015.
The v15.7.1 was released 2015/07/29.
The v15.7 was released 2015/07/08.
The v14.12 was released 2014/12/09, I guess it wouldn't work with my VGA cards.
142) 留言板 : Number crunching : 4x AMD Radeon R9 Fury X (消息 1732986)
发表于:8 Oct 2015 作者: Profile Sutaru Tsureku
Post:
What do you have in your cmdline.txt file for AP?

Thanks.
143) 留言板 : Number crunching : 4x AMD Radeon R9 Fury X (消息 1732984)
发表于:8 Oct 2015 作者: Profile Sutaru Tsureku
Post:
If I set -no_cpu_lock the above mentioned errors happens:

Messages in BOINC:
Task postponed: Suspicious spike results, host needs reboot or maintenance
...or...
Task postponed: Triplet data corruption, retry from checkpoint.

Maybe I get it a try again?
Can't remember if I tested it already with just 1 WU/GPU.
144) 留言板 : Number crunching : 4x AMD Radeon R9 Fury X (消息 1732982)
发表于:8 Oct 2015 作者: Profile Sutaru Tsureku
Post:
Raistmer wrote:
Yep. First to do is to separate completely mixed factors now.
CPU affinity behavior, multiple tasks per GPU behavior, multiple GPU per se behavior.

So, affinity locked as default, 1 task per GPU as default, await at least 10 tasks completions ON EACH of 4 GPUs before touch anything again.

Well results of...

VGA card 0
http://setiathome.berkeley.edu/result.php?resultid=4434235420
http://setiathome.berkeley.edu/result.php?resultid=4434235462
http://setiathome.berkeley.edu/result.php?resultid=4434235216
http://setiathome.berkeley.edu/result.php?resultid=4434235217
http://setiathome.berkeley.edu/result.php?resultid=4434235228
http://setiathome.berkeley.edu/result.php?resultid=4434235263
http://setiathome.berkeley.edu/result.php?resultid=4434235331
http://setiathome.berkeley.edu/result.php?resultid=4434235344
http://setiathome.berkeley.edu/result.php?resultid=4434235374
http://setiathome.berkeley.edu/result.php?resultid=4434235390

VGA card 1
http://setiathome.berkeley.edu/result.php?resultid=4434235415
http://setiathome.berkeley.edu/result.php?resultid=4434234398
http://setiathome.berkeley.edu/result.php?resultid=4434235235
http://setiathome.berkeley.edu/result.php?resultid=4434234742
http://setiathome.berkeley.edu/result.php?resultid=4434235284
http://setiathome.berkeley.edu/result.php?resultid=4434235302
http://setiathome.berkeley.edu/result.php?resultid=4434235352
http://setiathome.berkeley.edu/result.php?resultid=4434234844
http://setiathome.berkeley.edu/result.php?resultid=4434235370
http://setiathome.berkeley.edu/result.php?resultid=4434235402

VGA card 2
http://setiathome.berkeley.edu/result.php?resultid=4434235396
http://setiathome.berkeley.edu/result.php?resultid=4434234918
http://setiathome.berkeley.edu/result.php?resultid=4434235183
http://setiathome.berkeley.edu/result.php?resultid=4434235441
http://setiathome.berkeley.edu/result.php?resultid=4434235443
http://setiathome.berkeley.edu/result.php?resultid=4434235460
http://setiathome.berkeley.edu/result.php?resultid=4434235297
http://setiathome.berkeley.edu/result.php?resultid=4434235070
http://setiathome.berkeley.edu/result.php?resultid=4434235332
http://setiathome.berkeley.edu/result.php?resultid=4434235334

VGA card 3
http://setiathome.berkeley.edu/result.php?resultid=4434235392
http://setiathome.berkeley.edu/result.php?resultid=4434235404
http://setiathome.berkeley.edu/result.php?resultid=4434235412
http://setiathome.berkeley.edu/result.php?resultid=4434235167
http://setiathome.berkeley.edu/result.php?resultid=4434235182
http://setiathome.berkeley.edu/result.php?resultid=4434235457
http://setiathome.berkeley.edu/result.php?resultid=4434235311
http://setiathome.berkeley.edu/result.php?resultid=4434235373
http://setiathome.berkeley.edu/result.php?resultid=4433115667
http://setiathome.berkeley.edu/result.php?resultid=4433115414

What should I do now?

Thanks.
145) 留言板 : Number crunching : 4x AMD Radeon R9 Fury X (消息 1732892)
发表于:8 Oct 2015 作者: Profile Sutaru Tsureku
Post:

BOINC say the VGA cards support OpenCL v2.0.


-no_cpu_lock as default override
How it can be default if you provide option to override default settings???
Default means no options supplied.

From stderr:

Name: Fiji
Vendor: Advanced Micro Devices, Inc.
Driver version: 1800.8 (VM)
Version: OpenCL 1.2 AMD-APP (1800.8)

That can be issue...

OK, but then all 12 GPU apps will be fixed at CPU-thread#0.
Or I should let run just 1 MB WU/GPU?

Maybe I should test the above mentioned AMD beta driver?
146) 留言板 : Number crunching : 4x AMD Radeon R9 Fury X (消息 1732882)
发表于:8 Oct 2015 作者: Profile Sutaru Tsureku
Post:
Task postponed: Triplet data corruption, retry from checkpoint.

So you need to check if AMD own OpenCL samples work well.

Leave app running in default regime w/o any additional settings.
Will it produce valid results in number being launched on all your GPUs?
Only when valid execution for all 4 GPUs with default settings will be proven firmly worth to try to speedup/improve things.


Hm, it depend from which side I look - 'default'... ;-)

I set '0.33' in app_info.xml and '-no_cpu_lock -hp' in cmdline.txt. This would be 'default'?


BOINC say the VGA cards support OpenCL v2.0.
147) 留言板 : Number crunching : 4x AMD Radeon R9 Fury X (消息 1732871)
发表于:8 Oct 2015 作者: Profile Sutaru Tsureku
Post:
Raistmer wrote:
Also, any task result I looked into has many restarts. Try to be patient a little and not fiddle with settings. Allow few tasks to complete on their own, w/o restarts and re-sheduling between GPUs. Then provide links to their results on web page.

How much MB WUs/VGA card simultaneously? 3?

What in cmdline.txt file?

If I set -no_cpu_lock the above mentioned errors happens:
Task postponed: Suspicious spike results, host needs reboot or maintenance
...or...
Task postponed: Triplet data corruption, retry from checkpoint.
148) 留言板 : Number crunching : 4x AMD Radeon R9 Fury X (消息 1732870)
发表于:8 Oct 2015 作者: Profile Sutaru Tsureku
Post:
Raistmer wrote:
Make sure your config supported by AMD OpenCL runtime.
Test few AMD OpenCL samples from their SDK. If some fail then interaction with AMD support required.

I'm new related AMD things...

I just installed the AMD Catalyst Software Suite (v15.7.1) [of 29.07.2015]...
There is also a 'v15.9.1 Beta' [of 30.09.2015] available.

Could you (or someone other) give me little bit more infos and URLs?

Thanks.
149) 留言板 : Number crunching : 4x AMD Radeon R9 Fury X (消息 1732859)
发表于:8 Oct 2015 作者: Profile Sutaru Tsureku
Post:
Of the readme.txt file:
-no_cpu_lock : To disable affinity management (opposite to -cpu_lock option). For ATi version CPUlock affinity management enabled by default.
[In the meantime it was mentioned already.]

So I used '-no_cpu_lock' and then all CPU-threads were allowed/used.

BOINC showed inter alia:
Task postponed: Suspicious spike results, host needs reboot or maintenance
...or...
Task postponed: Triplet data corruption, retry from checkpoint.

So L2-Cache miss?
GPU app got support from CPU#0 and then from CPU#1?

Then I tried '-cpu_lock -total_GPU_instances_num 12 -hp'.
The result like above with -no_cpu_lock', all CPU-threads allowed/used.

First I tried '-cpu_lock -total_GPU_instances_num 3 -hp', the result was all GPU apps were fixed at CPU-thread#3.

-cpu_lock -instances_per_device 3 -hp
All GPU apps fixed at CPU-thread#0.


Either I use '-no_cpu_lock' and get the above mentioned errors,
or all GPU apps fixed at one CPU-thread, which will be overloaded (one whole thread and all others idle) and reduce very much GPU crunching.

What could I do?


I can't disable one CPU (socket) in BIOS.


1 WU/GPU (in whole 4 GPU apps):
12 CPU-threads = 30 % CPU
24 CPU-threads = 15 % CPU (HT on)
[HT on or off, it's the same CPU support, or?]

2 WUs/GPU (in whole 8 GPU apps):
12 CPU-threads = 45 % CPU

3 WUs/GPU (in whole 12 GPU apps):
12 CPU-threads = 50 % CPU


Until now there is no cc_config.xml file.
BOINC use all 4 VGA cards, it shows:
'0,04C + 0,33 AMD/ATI GPUs (d0)'
d0, d1, d2 and d3 (each 3 times)
150) 留言板 : Number crunching : 4x AMD Radeon R9 Fury X (消息 1732824)
发表于:8 Oct 2015 作者: Profile Sutaru Tsureku
Post:
The max until now was 2 MB WUs/VGA card.
I tried 1 MB WU/VGA card, all 4 GPU apps are still fixed at CPU-thread#0.

In Task-Manager it's named: 'CPU 0 (Knoten*: 0)'

'Knoten*: 1' is also there, I guess this is the second CPU (socket).

[* german]


(BTW. CrossFireX is disabled via AMD tool.)
151) 留言板 : Number crunching : 4x AMD Radeon R9 Fury X (消息 1732816)
发表于:8 Oct 2015 作者: Profile Sutaru Tsureku
Post:
petri33 wrote:
Hi,
Your results show affinity mask 1. That limits the number of CPUs to 1.

Hi,
what you mean?

The cmdline.txt file is empty.

You mean this?
- - - - - - - - - -
(In time with HT off, just 12 CPU-threads)
(...)
<stderr_txt>
Running on device number: 3
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: Advanced Micro Devices, Inc.
BOINC assigns device 3
7 slot of 64 used for this instance
Info: BOINC provided OpenCL device ID used
Info: CPU affinity mask used: 1

Build features: SETI7 (...)
- - - - - - - - - -

I can't change this, or?

So it's an app problem?
152) 留言板 : Number crunching : 4x AMD Radeon R9 Fury X (消息 1732815)
发表于:8 Oct 2015 作者: Profile Sutaru Tsureku
Post:
I disabled HT in BIOS.
Windows see now 12 CPU-threads.
But the same like I wrote above.
All 8 GPU apps (2 MB WUs/GPU) are fixed at CPU-thread#0.

I need to manually choose the affiliation in Task-Manager: 'use all CPU-threads'.
Then nearly all CPU-threads do something, ~50% CPU is working (2 MB WUs/GPU= 8).


The CPUs do nothing, just GPU app support.


I'm very disappointed and angry - I collected the money, build this build, I see the finish line - and then it don't work.


I have no idea why all GPU apps are fixed at CPU-thread#0.


The motherboard have BIOS v5206.
ASUS have v5701 online for upgrade.
Could be the BIOS the problem?

Or maybe the motherboard drivers?
153) 留言板 : Number crunching : 4x AMD Radeon R9 Fury X (消息 1732805)
发表于:8 Oct 2015 作者: Profile Sutaru Tsureku
Post:
Maybe you remember my old thread here.
4x HD7990 didn't worked.

I installed four R9 Fury X VGA cards.

The last Windows 8.1 Pro x64 DVD didn't boot.
After two months I got a new DVD... - and yes, this disk didn't boot also. (The old Windows 8.1 x64 DVD from an other PC worked (the OS wanted to install).)
So Microsoft sell just now DVDs which don't boot?
So I burned an .ISO DVD - and finally I could install Windows.

Motherboard drivers installed.
All updates for Windows.
Newest AMD VGA card driver (v15.7.1) installed.

Installed BOINC.
Opti Lunatics (v0.43b) apps installed (just AP and MB for ATI GPU).

1 MB WU/GPU... - and it 'worked*'.
Then I tested 2 MB WUs/GPU... with '-cpu_lock -instances_per_device 2 -hp' and it 'worked*'.
In GPU-Z nearly no 'GPU Load'.
I opened Task-Manager and I saw all 8 MB GPU WUs were fixed at CPU-thread#0.
[* not really]

I deletd all in 'cmdline.txt' file.
Started BOINC again and all 8 MB WUs were fixed still at CPU-thread#0.

CPU-thread#0 was full loaded - the other 23 CPU-threads were idle (2x Xeon (each 6 Core/12 threads = 24 threads in whole)).

Why all GPU apps are fixed at CPU-thread#0?

Thanks.
154) 留言板 : Number crunching : Alternative Instead Of BOINC v7.6.9? (消息 1730270)
发表于:30 Sep 2015 作者: Profile Sutaru Tsureku
Post:
BOINC v7.4.42 could cause invalid results at Milkyway.

v7.6.9 have a fix, but isn't 'perfect' still - AFAIK. Or this version would be the 'best' solution of all BOINC versions currently?

Which older BOINC version don't cause invalid results at Milkyway?
It looks like this happens since v7.4.42, right?

v7.2.42 would be an alternative?

Thanks.
155) 留言板 : Number crunching : new computer (消息 1730012)
发表于:29 Sep 2015 作者: Profile Sutaru Tsureku
Post:
rob smith wrote:
The question is about running at "full bore", which I take as only using the laptop's own thermal management.
That really is a bit of string - if you are fortunate and the room is very cool, and the laptop has very good intrinsic cooling and thermal management it will last a lot longer than one that is in a very hot room and has very poor intrinsic cooling and thermal management.
Of course you can improve vastly the situation by using a PROPER external cooling pad - one which actually cools the incoming air to below ambient, or a utility like TThrotle.

Could you give examples of this cooling pads which cool down the laptops' incoming air?

Thanks.

- - - - - - - - - -

A friend played 1 1/2 years daily on his laptop and then he's gone up in smoke.

I thought about to buy a laptop (Intel Core i3-4005U incl. Intel HD Graphics 4400) for 24/7 SETI crunching...
I have no idea if the CPU and iGPU c/w-ould crunch simultaneously 24/7 under full load...

Currently I don't know if it would be a good idea to crunch 24/7 on a laptop.
156) 留言板 : Number crunching : Cache filling strategy BOINC 7.4.42 vs. 7.6.6 and/or other improvements? (消息 1719098)
发表于:27 Aug 2015 作者: Profile Sutaru Tsureku
Post:
So it would be better to wait until BOINC v7.6.8 will be the recommended version (already available for download - if you know where... ;-)?
157) 留言板 : Number crunching : Win8.1 x64 - Win Update NVidia Driver (消息 1713670)
发表于:15 Aug 2015 作者: Profile Sutaru Tsureku
Post:
I have this Motherboard/CPU combi since January '15. It's a desktop. ASRock Q1900DC-ITX board with onboard Intel Celeron J1900 CPU incl. iGPU (here screen connected).
I added the NV GT730 in May '15.

...at that time I downloaded the NVidia driver.exe from NVidia, but it don't install, because it say 'no NVidia device installed'.
In device manager I updated the driver of the new installed device, and after a few seconds the 347.52 driver was installed and the NV GT730 was shown.
Long story here.

So, if I'll uninstall the 347.52 driver for to upgrade with a newer driver it wouldn't work, because the driver.exe will say then again 'no NVidia device installed'.

I guess the NV driver.exe don't find a NV device, because it's no screen or VGA dummy plug connected to the GT730.

So in my case, it looks like I have just the option to upgrade the driver with the Windows Update.
158) 留言板 : Number crunching : Win8.1 x64 - Win Update NVidia Driver (消息 1713473)
发表于:15 Aug 2015 作者: Profile Sutaru Tsureku
Post:
I have Win8.1 x64.
Win Update notified me about a new driver:

»nVidia - Graphics Adapter WDDM1.1, Graphics Adapter WDDM1.2, Graphics Adapter WDDM1.3 - NVIDIA GeForce GT 730

Downloadgröße: 283,2 MB

Sie müssen ggf. den Computer neu starten, damit die Änderungen wirksam werden.

Updatetyp: Optional

nVidia Graphics Adapter WDDM1.1, Graphics Adapter WDDM1.2, Graphics Adapter WDDM1.3 software update released in August, 2015

Weitere Informationen:
http://sysdev.microsoft.com/support/default.aspx

Hilfe und Support:
http://support.microsoft.com/select/?target=hub
«

[If I follow the URLs, it don't make me smarter.]


The NV GT730 is just for SETI crunching (2nd GPU). No screen or VGA dummy plug connected.
If I download the driver.exe from NVidia, it don't install the driver, because it say 'no NVidia device installed'...

Over the device manager Win installed automatically the 347.52 driver.

So I must use the Win Update drivers...

But, which version is the above mentioned driver?
And, normally I must uninstall the NV driver first, before I install a new driver... - this above mentioned driver I can install over the current one?

BTW. The above mentioned driver (if we know which driver version it is) work well with the apps of Lunatics Installer v0.43b?

Thanks.
159) 留言板 : Number crunching : Monster GPU Cruncher Build (消息 1707483)
发表于:2 Aug 2015 作者: Profile Sutaru Tsureku
Post:
I'm little bit confused about the ASUS Z9PE-D8 WS board.

I understood it like this that PCIe slot #1 to #4 are connected to CPU#1 and PCIe slot #5 to #7 are connected to CPU#2.

hardwareluxx.de - they write:
»Insgesamt hat man somit 72 PCIe-Lanes in der neuen PCIe-3.0-Spezifikation zur Verfügung - das ist eine gehörige Bandbreite, die hier zustande kommt. Zusatzchips als Bridges oder ähnliches müssen allerdings nicht verwendet werden, da zwei CPUs zum Einsatz kommen: Die Slots 1-4 gehören zur CPU 1, die Slots 5-7 zur CPU 2. Entsprechend läuft die Kommunikation im Quad-SLI über den QPI-Bus und nicht direkt. Da im Dual-CPU-Betrieb zwei QPI-Links zur Verfügung stehen, sollte die Bandbreite aber vollkommen ausreichen. Sämtliche Onboard-Komponenten werden hingegen über den C602-Chipsatz angebunden.«

Google translator said:
»Overall, it has thus 72 PCIe lanes in the new PCIe 3.0 specification is available - this is a proper bandwidth that comes here. Additional chips as Bridges or similar but must not be used as two CPUs are used: The slots 1-4 are CPU 1, the slots 5-7 to the CPU 2. In accordance with the communication in Quad SLI runs over the QPI bus and not directly. Since two QPI links available in dual CPU operation available, the bandwidth should but perfectly sufficient. All onboard components are, however, connected via the C602 chipset.«

The board have two QPI, one each for one CPU.
I don't use SLI or CrossFireX cables.

So does this mean, that VGA card #1 (in PCIe slot #1) and VGA card #2 (in PCIe slot #3) communicate with CPU#1 and
VGA card #3 (in PCIe slot #5) and VGA card #4 (in PCIe slot #7) communicate with CPU#2 ... (I understood it like this) ...?

OK, what happens if I use -cpu_lock in cmdline...
What happens if the project app on VGA card #4 will be fixed on a CPU-thread of CPU#1 (they are not connected)?

Or the two QPI are connected in the northbridge and it could go the above mentioned?
Or the app don't get CPU support and go in a never ending waiting loop?

Thanks.
160) 留言板 : Number crunching : Panic Mode On (99) Server Problems? (消息 1706528)
发表于:30 Jul 2015 作者: Profile Sutaru Tsureku
Post:
I sent E-Mail to the admins.
Because 'not in DB' and '.vlar's to NV GPUs'.

I got E-Mail from Eric, he's looking to it.


前 20 · 后面 20


 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.