Lunatics Windows Installer v0.43 Release Notes

Message boards : Number crunching : Lunatics Windows Installer v0.43 Release Notes
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 . . . 11 · Next

AuthorMessage
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 34258
Credit: 79,922,639
RAC: 80
Germany
Message 1590954 - Posted: 23 Oct 2014, 20:36:10 UTC - in response to Message 1590890.  

In task manager you only see system memory not GPU memory consumption.

That´s exactly why this issue is wierd, why the main memory usage rises to that kind of level on a determinate WU while all others remains low?

When the system memory usage rises to that large number the system runs out of memory very fast.


yep, exactly system (OS, CPU.., not GPU) memory increase expected with issue I described earlier.


I honestly dont think so.
I had 3 30/30 in a row 2 days ago and on my 4GB sys mem host it would have crashed.
Two were running at the same time with those sharp timings.
I never had this happen.

Like i said earlier i watched closely while benching Juans task i had 104K constantly during this bench and it is overflow task.
I think this is host dependend.


With each crime and every kindness we birth our future.
ID: 1590954 · Report as offensive
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 34258
Credit: 79,922,639
RAC: 80
Germany
Message 1590964 - Posted: 23 Oct 2014, 20:49:09 UTC - in response to Message 1590958.  

In task manager you only see system memory not GPU memory consumption.

That´s exactly why this issue is wierd, why the main memory usage rises to that kind of level on a determinate WU while all others remains low?

When the system memory usage rises to that large number the system runs out of memory very fast.


yep, exactly system (OS, CPU.., not GPU) memory increase expected with issue I described earlier.


I honestly dont think so.
I had 3 30/30 in a row 2 days ago and on my 4GB sys mem host it would have crashed.
Two were running at the same time with those sharp timings.
I never had this happen.

Like i said earlier i watched closely while benching Juans task i had 104K constantly during this bench and it is overflow task.
I think this is host dependend.


Talking about timings, do you have any of these extra "-oclfft_plan ", and "-tune " things which would fit my HD7870? -use_sleep, is that something for me too?

For now, I only use: -unroll 12 -ffa_block 8192 -ffa_block_fetch 4096 -instances_per_device 2

Oh, and only if they do not lead to the release of "Magic Smoke" :-)


use_sleep is necessary only on nvidia cards to reduce CPU usage.
Your card is easy to handle.

-unroll 16 -oclfft_plan 256 16 256 -ffa_block 12288 -ffa_block_fetch 6144 -tune 1 64 4 1 -tune 2 64 4 1


With each crime and every kindness we birth our future.
ID: 1590964 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 1590968 - Posted: 23 Oct 2014, 20:53:07 UTC - in response to Message 1590954.  


Like i said earlier i watched closely while benching Juans task i had 104K constantly during this bench and it is overflow task.
I think this is host dependend.


With exactly same command line as he provided?
I configured memory working set logger via perfmon (and learnt smth new , LoL) so now log into file current working set each 10 seconds (configurable).

Will run his workunit later with same command line (but with ATi app cause NV card can't accept same command line).
ID: 1590968 · Report as offensive
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 34258
Credit: 79,922,639
RAC: 80
Germany
Message 1590972 - Posted: 23 Oct 2014, 20:54:33 UTC - in response to Message 1590968.  


Like i said earlier i watched closely while benching Juans task i had 104K constantly during this bench and it is overflow task.
I think this is host dependend.


With exactly same command line as he provided?
I configured memory working set logger via perfmon (and learnt smth new , LoL) so now log into file current working set each 10 seconds (configurable).

Will run his workunit later with same command line (but with ATi app cause NV card can't accept same command line).


Yes, sure.


With each crime and every kindness we birth our future.
ID: 1590972 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1591324 - Posted: 24 Oct 2014, 12:15:16 UTC - in response to Message 1590972.  
Last modified: 24 Oct 2014, 12:16:14 UTC

I think this is host dependend.

I belive is not, why? it´s happening on a totaly diferent OS & GPU, the same happening on a 2x690 who runs Win Server. The only thing they have in common is they use the same old I5 CPU.

After few test i made here remotely, i could confirm the problem is aparently related to the size of the -ffa_block, a 4096 block makes some WU ussing 150MB, larger sizes huge increase the memory usage, a 8192 makes the WU uses about 250MB and so on, the 1GB apears when you use a 16k block size.

For some reason who i cant understand until now, my hosts simply apears to ignore the -oclfft_plan 256 16 256 switch.
ID: 1591324 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 1591330 - Posted: 24 Oct 2014, 12:55:05 UTC - in response to Message 1591324.  

-oclfft_plan 256 16 256 switch.

yep, as I said earlier there is no confirmation from app in log (stderr) that switch was recognized.
ID: 1591330 · Report as offensive
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 34258
Credit: 79,922,639
RAC: 80
Germany
Message 1591332 - Posted: 24 Oct 2014, 12:58:36 UTC - in response to Message 1591324.  
Last modified: 24 Oct 2014, 13:02:05 UTC

I think this is host dependend.

I belive is not, why? it´s happening on a totaly diferent OS & GPU, the same happening on a 2x690 who runs Win Server. The only thing they have in common is they use the same old I5 CPU.

After few test i made here remotely, i could confirm the problem is aparently related to the size of the -ffa_block, a 4096 block makes some WU ussing 150MB, larger sizes huge increase the memory usage, a 8192 makes the WU uses about 250MB and so on, the 1GB apears when you use a 16k block size.

For some reason who i cant understand until now, my hosts simply apears to ignore the -oclfft_plan 256 16 256 switch.


-oclFFT_plan is case sensitive.
Make sure FFT is upper case or just snip it from the read me.
-use_sleep -unroll 16 -oclFFT_plan 256 16 256 -ffa_fetch 8192 -ffa_fetch_block 4096-tune 1 64 4 1 -tune 2 64 4 1

Make sure you get this message in stderr.

oclFFT plan class overrides requested: global radix 256; local radix 16; max workgroup size 256


With each crime and every kindness we birth our future.
ID: 1591332 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1591336 - Posted: 24 Oct 2014, 13:08:03 UTC - in response to Message 1591332.  

-oclFFT_plan is case sensitive.

Maybe is a good ideia change that, all the other switches uses lower case letters.
ID: 1591336 · Report as offensive
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 34258
Credit: 79,922,639
RAC: 80
Germany
Message 1591338 - Posted: 24 Oct 2014, 13:11:32 UTC - in response to Message 1591336.  
Last modified: 24 Oct 2014, 13:14:42 UTC

-oclFFT_plan is case sensitive.

Maybe is a good ideia change that, all the other switches uses lower case letters.


Like Raistmer said its for advanced users.
Everybody can snip it out of the read me.

Sorry for the typo.

My mistake.


With each crime and every kindness we birth our future.
ID: 1591338 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1591364 - Posted: 24 Oct 2014, 14:28:35 UTC

Try to use a smaller -ffa_block and you will see, the memory hogging WU will use a lot less memory (i like the name)
ID: 1591364 · Report as offensive
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 34258
Credit: 79,922,639
RAC: 80
Germany
Message 1591370 - Posted: 24 Oct 2014, 14:37:14 UTC - in response to Message 1591368.  

Try to use a smaller -ffa_block and you will see, the memory hogging WU will use a lot less memory (i like the name)


I'll drop back to my original (since v6) ffa_block and -ffa_block_fetch, as well as my unroll.


Unroll will be bigger then.
Watch out.


With each crime and every kindness we birth our future.
ID: 1591370 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1591372 - Posted: 24 Oct 2014, 14:42:41 UTC - in response to Message 1591370.  
Last modified: 24 Oct 2014, 14:43:19 UTC

Interesting, Sten-Arne uses ATI, not NV like me, so aparently the problem is in the code itself and sure not host related.
ID: 1591372 · Report as offensive
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 34258
Credit: 79,922,639
RAC: 80
Germany
Message 1591373 - Posted: 24 Oct 2014, 14:44:00 UTC - in response to Message 1591372.  
Last modified: 24 Oct 2014, 14:45:39 UTC

Interesting, Sten-Arne uses ATI, not NV like me, so aparently the problem is in the code itself and sure not host related.


It is since i dont get this high memory usage even with biggest unroll and ffa_fetch values.
Same on my sons 7850.


With each crime and every kindness we birth our future.
ID: 1591373 · Report as offensive
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 34258
Credit: 79,922,639
RAC: 80
Germany
Message 1591376 - Posted: 24 Oct 2014, 14:46:20 UTC - in response to Message 1591375.  

Try to use a smaller -ffa_block and you will see, the memory hogging WU will use a lot less memory (i like the name)


I'll drop back to my original (since v6) ffa_block and -ffa_block_fetch, as well as my unroll.


Unroll will be bigger then.
Watch out.


Hmm, is unroll 12 bigger than 16? Am I missing something here? Is it backwards?


I meant if you remove unroll in comand line.


With each crime and every kindness we birth our future.
ID: 1591376 · Report as offensive
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 34258
Credit: 79,922,639
RAC: 80
Germany
Message 1591379 - Posted: 24 Oct 2014, 14:48:31 UTC - in response to Message 1591377.  

Interesting, Sten-Arne uses ATI, not NV like me, so aparently the problem is in the code itself and sure not host related.


It is since i dont get this high memory usage even with biggest unroll and ffa_fetch values.
Same on my sons 7850.


Could this be memory related? This old computer only have slow DDR2 memory, single channel.


Yes, possible.


With each crime and every kindness we birth our future.
ID: 1591379 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1591381 - Posted: 24 Oct 2014, 14:50:41 UTC - in response to Message 1591373.  
Last modified: 24 Oct 2014, 14:52:36 UTC

Interesting, Sten-Arne uses ATI, not NV like me, so aparently the problem is in the code itself and sure not host related.


It is since i dont get this high memory usage even with biggest unroll and ffa_fetch values.
Same on my sons 7850.

Did you try a large ffa_block? On my tests, unroll, ffa_fetch does not produce the problem, only large ffa_block on the range of 16K, smaller number produces only small memory usage.
ID: 1591381 · Report as offensive
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 34258
Credit: 79,922,639
RAC: 80
Germany
Message 1591383 - Posted: 24 Oct 2014, 14:53:43 UTC - in response to Message 1591381.  
Last modified: 24 Oct 2014, 14:54:02 UTC

Interesting, Sten-Arne uses ATI, not NV like me, so aparently the problem is in the code itself and sure not host related.


It is since i dont get this high memory usage even with biggest unroll and ffa_fetch values.
Same on my sons 7850.

Did you try a large ffa_block? On my tests, unroll, ffa_fetch does not produce the problem, only large ffa_block on the range of 16K, smaller number produces only small memory usage.


I did run your units with the settings you did.
No problem on 2 different hosts.


With each crime and every kindness we birth our future.
ID: 1591383 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 1591390 - Posted: 24 Oct 2014, 15:11:28 UTC

Hm... Looks like no matter if I write in English or in Russian... nobody cares to read anyway :/

http://setiathome.berkeley.edu/forum_thread.php?id=75863&postid=1590653

No need to invent new entities w/o need.
ID: 1591390 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1591393 - Posted: 24 Oct 2014, 15:17:28 UTC - in response to Message 1591379.  
Last modified: 24 Oct 2014, 15:18:52 UTC

Interesting, Sten-Arne uses ATI, not NV like me, so aparently the problem is in the code itself and sure not host related.


It is since i dont get this high memory usage even with biggest unroll and ffa_fetch values.
Same on my sons 7850.


Could this be memory related? This old computer only have slow DDR2 memory, single channel.


Yes, possible.

I have 3 ATI Host running the -oclFFT_plan with DDR2, only 1 card May be affected with higher Pulse counts. The Other 5 cards seem to be doing well. But, only 1 Host is running Windows...
ID: 1591393 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1591395 - Posted: 24 Oct 2014, 15:17:50 UTC - in response to Message 1591383.  
Last modified: 24 Oct 2014, 15:22:32 UTC

Interesting, Sten-Arne uses ATI, not NV like me, so aparently the problem is in the code itself and sure not host related.


It is since i dont get this high memory usage even with biggest unroll and ffa_fetch values.
Same on my sons 7850.

Did you try a large ffa_block? On my tests, unroll, ffa_fetch does not produce the problem, only large ffa_block on the range of 16K, smaller number produces only small memory usage.


I did run your units with the settings you did.
No problem on 2 different hosts.

Wierd... what i could tell you for sure, the same problem happening on few of my hosts with diferent CPU/GPU´s combinations, and not in all of the WU´s about one in 10 WU only.

Could be a CPU related problem? You use AMD CPU, I and Sten-Arne uses low end Intel CPU´s, who knows?

Let´s wait to see if Raistmer found something. For now with small ffa_block the memory hugging is relatively small (less than 150MB) so is not a headache since all my hosts has 8GB at least.

<edit> yes i read

I'll check if there is some additional memory leak or not


that´s exacly why i sugest wait to see if some aditional memory leak could be happening.
ID: 1591395 · Report as offensive
Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 . . . 11 · Next

Message boards : Number crunching : Lunatics Windows Installer v0.43 Release Notes


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.