| Author |
Message |
|
|
|
Installed Lunatics on a second PC to crunch on a HD6970....
Getting tasks ok but the GPU is only running in bursts from zero to about 25%.
Can anyone help with this please?
____________
|
|
|
|
|
|
How many tasks per card?
Are you OCing at all that may be causing downclocks?
____________
Executive Director GPU Users Group Inc. -
brad@gpuug.org |
|
|
|
|
|
Only 1 task per card, I haven't changed anything in Lunatics.
No OCing, everything is at standard settings...
____________
|
|
|
Mike Volunteer tester
 Send message
Joined: 17 Feb 01 Posts: 19511 Credit: 21,233,663 RAC: 26,837

|
|
Thats the low GPU usage bug in the drivers.
Suspend the task a few seconds and resume again.
That should help.
____________
|
|
|
|
|
|
Is there a fix for this bug?
Do I have to suspend and resume every new task or every time I reboot?
Is it a bug in the Catalyst drivers? I have version 12.3
____________
|
|
|
Mike Volunteer tester
 Send message
Joined: 17 Feb 01 Posts: 19511 Credit: 21,233,663 RAC: 26,837

|
|
Yes, its a bug in catalyst drivers.
Sometimes suspending GPU cures for a few units in a row.
Sometims only for 1 unit.
You need to check every once in a while.
____________
|
|
|
|
|
|
Um... not exactly the way I want to crunch.
Who wants to babysit their PC, suspending and resuming every 15 mins...
Is this what you and everyone else does with their ATI card?
What version Catalyst did it start from and are we stuck with this crap forever?
____________
|
|
|
|
|
|
you won't get 100% usage out of an ATI 6970 running 1 WU.
Try running 3 at a time.
____________
Proud member of TSWB.
End terrorism by building a school
|
|
|
Mike Volunteer tester
 Send message
Joined: 17 Feb 01 Posts: 19511 Credit: 21,233,663 RAC: 26,837

|
Um... not exactly the way I want to crunch.
Who wants to babysit their PC, suspending and resuming every 15 mins...
Is this what you and everyone else does with their ATI card?
What version Catalyst did it start from and are we stuck with this crap forever?
This bug started with 11.12. driver iirc.
I really hope this will be fixed when OpenCL 1.2 is fully implemented.
Like skildude mentioned you can get better performace out of your card running 2 or 3 units at the same time.
I´m running 2 atm on my 5850.
You need to modify your appinfo.
Edit
For 2 instances
<cmdline>-help -period_iterations_num 20 -instances_per_device 2</cmdline>
<count>0.5</count>
For 3 its of course count 0.33.
Dont forget to shut down Boinc before you change it.
____________
|
|
|
|
|
|
I suspended my home crunching with the warmer weather coming in. however I was using 12.1 on my 6870 without down clocking. I did get a task stuck now and then where the time continued to increment without any progress being done. I think there were 3 or 4 over 6 months or so.
____________
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the BP6/VP6 User Group today! |
|
|
Mike Volunteer tester
 Send message
Joined: 17 Feb 01 Posts: 19511 Credit: 21,233,663 RAC: 26,837

|
|
Its not down clocking HAL.
Its just low GPU utilisation.
____________
|
|
|
|
|
|
Ok this is from my appinfo...
Mike, do you have to do the suspend/resume thing on your 5850 or does it run at 100% by itself?
Do I need to change the <count>1</count> to .50 and the -instances_per_device from 1 to 2, or just the count?
Also this is in there twice for seti enhanced, x2 for astropulse v6 and x4 for astropulse v505, do I have to change them all?
<app_version>
<app_name>setiathome_enhanced</app_name>
<version_num>610</version_num>
<platform>windows_x86_64</platform>
<avg_ncpus>0.05</avg_ncpus>
<max_ncpus>0.05</max_ncpus>
<plan_class>ati13ati</plan_class>
<cmdline>-period_iterations_num 20 -instances_per_device 1</cmdline>
<coproc>
<type>ATI</type>
<count>1</count>
</coproc>
<file_ref>
<file_name>MB6_win_x86_SSE3_OpenCL_ATi_HD5_r390.exe</file_name>
<main_program/>
</file_ref>
<file_ref>
<file_name>MultiBeam_Kernels_r390.cl</file_name>
<copy_file/>
</file_ref>
</app_version>
</app_info>
____________
|
|
|
Mike Volunteer tester
 Send message
Joined: 17 Feb 01 Posts: 19511 Credit: 21,233,663 RAC: 26,837

|
|
If you want to run 2 instances you need to change both instances_per_device to 2 and count to 0.5.
Thats different to cuda app because ATI is OpenCL.
I need to suspend the app also every once in while.
But running 2 or 3 instances is not so bad because mostly at least 1 unit gets enough utilisation.
I´d strongly suggest 3 instances on your 6970.
So number of instances 3 and count 0.33.
You can change it as well on the astropulse apps.
____________
|
|
|
|
|
|
Ok thanks Mike,
I have changed Lunatics to run 3 WU's at once, the GPU is running at about 90% and sometimes up to 98%.
When its doing 3 work units (seti@home enhanced 6.10), the GPU seems to be giving 1 work unit about 80% and the other 2 about 5% each.... one finishes in about 3 mins and the others take about 30 mins.
Is this normal?
____________
|
|
|
Mike Volunteer tester
 Send message
Joined: 17 Feb 01 Posts: 19511 Credit: 21,233,663 RAC: 26,837

|
|
Yes, thats normal.
But under this conditions you get better output.
____________
|
|
|
|
|
|
Well I just ran 3 x AstroPulse v6.
All started at the same time,
First: 53 mins
Second: 150 mins
Third: 260 mins
Doesn't like these as much, the GPU would only run at 65%...
____________
|
|
|
Mike Volunteer tester
 Send message
Joined: 17 Feb 01 Posts: 19511 Credit: 21,233,663 RAC: 26,837

|
Well I just ran 3 x AstroPulse v6.
All started at the same time,
First: 53 mins
Second: 150 mins
Third: 260 mins
Doesn't like these as much, the GPU would only run at 65%...
Those are not representative because 3 are overflow units.
They stopp working after 30 pulses were found.
At least you did 4 APs in 4 hours.
Not so shabby i would say.
Next step is to increase ffa_block and ffa_block_fetch.
Change comandline params in astropulse section.
<cmdline>-instances_per_device 3 -unroll 12 -ffa_block 8192 -ffa_block_fetch 4096</cmdline>
This will speed up your card further.
Keep in mind on astropulses low GPU usage is not always caused by the low GPU usage bug.
Go to your task manager and check how much CPU is in use by the AP app.
If its above 3% its a high blanked unit so low GPU usage is normal because blanking is calculated by the CPU only.
____________
|
|
|
|
|
|
My CPU's are normally always at 100% because I'm crunching Primegrid on them.
But I will suspend PG for a while and see what happens with the AstroPulse tasks.
This is what I currently have but I will change it:
<cmdline>-instances_per_device 3 -unroll 4 -ffa_block 2048 -ffa_block_fetch 1024 -sbs 128</cmdline>
What does that do to make the card faster?
____________
|
|
|
Mike Volunteer tester
 Send message
Joined: 17 Feb 01 Posts: 19511 Credit: 21,233,663 RAC: 26,837

|
My CPU's are normally always at 100% because I'm crunching Primegrid on them.
But I will suspend PG for a while and see what happens with the AstroPulse tasks.
This is what I currently have but I will change it:
<cmdline>-instances_per_device 3 -unroll 4 -ffa_block 2048 -ffa_block_fetch 1024 -sbs 128</cmdline>
What does that do to make the card faster?
I know what you are using.
Higher unroll will simply load more data at once in one kernel call.
More VRAM in use of course.
Higher ffa block will compute them faster.
Of course Raistmer can explain better for sure.
____________
|
|
|
|
|
|
Hey Mike, do I need to remove -sbs 128 from the <cmdline> ?
Also, I have Lunatics installed on another PC running a GTX 590.
That was about a year ago, is there a newer version I need to update to or is the old one ok?
And do you know what to change in the app_info to maximise the 590? I think its only running 2 WU's at a time at about 80%. Thanks
____________
|
|
|