Message boards :
Number crunching :
The GTX750(Ti) Thread
Message board moderation
Previous · 1 · 2 · 3 · 4 · 5 · 6 · Next
Author | Message |
---|---|
Mike Send message Joined: 17 Feb 01 Posts: 34258 Credit: 79,922,639 RAC: 80 |
The unroll might be to high for your host. Try the following. -use_sleep -unroll 6 -oclfft_plan 256 16 256 -ffa_block 2830 -ffa_block_fetch 2830 -tune 1 64 4 1 -tune 2 64 4 1 Finnish a few tasks so i can check it later. With each crime and every kindness we birth our future. |
WezH Send message Joined: 19 Aug 99 Posts: 576 Credit: 67,033,957 RAC: 95 |
The unroll might be to high for your host. Ok, cmd-line options are in place, running <app> <name>astropulse_v7</name> <gpu_versions> <gpu_usage>1</gpu_usage> <cpu_usage>1</cpu_usage> </gpu_versions> </app> Let's see, using this host as I normally do. What I did understand from readme was that -unroll will increase applications GPU memory consumption. EDIT: First one to run with these parameters: http://setiathome.berkeley.edu/result.php?resultid=3879941524. Not looking fast, running 27 minutes and 32% done. "Please keep Your signature under four lines so Internet traffic doesn't go up too much" - In 1992 when I had my first e-mail address - |
Mike Send message Joined: 17 Feb 01 Posts: 34258 Credit: 79,922,639 RAC: 80 |
First i need to see how app behaves. We will increase unroll step by step. As soon this task finnished successfully increase unroll to 8. But only unroll please. With each crime and every kindness we birth our future. |
WezH Send message Joined: 19 Aug 99 Posts: 576 Credit: 67,033,957 RAC: 95 |
First i need to see how app behaves. OK Mike, I will do it. And: Thank You. "Please keep Your signature under four lines so Internet traffic doesn't go up too much" - In 1992 when I had my first e-mail address - |
WezH Send message Joined: 19 Aug 99 Posts: 576 Credit: 67,033,957 RAC: 95 |
First i need to see how app behaves. Task http://setiathome.berkeley.edu/result.php?resultid=3879941524 completed, now unroll 8 now with task http://setiathome.berkeley.edu/result.php?resultid=3880025588 "Please keep Your signature under four lines so Internet traffic doesn't go up too much" - In 1992 when I had my first e-mail address - |
Highlander Send message Joined: 5 Oct 99 Posts: 167 Credit: 37,987,668 RAC: 16 |
WezH, how is your memory config? Single or Dual-Channel setup (aka 1x 2GB or 2x 1GB MemModule)? - Performance is not a simple linear function of the number of CPUs you throw at the problem. - |
JarrettH Send message Joined: 14 Nov 02 Posts: 97 Credit: 25,385,250 RAC: 95 |
Hey Mike, maybe you know the answer to this while we're on the topic of configuring GPUs. I asked what commands I should use for a GT 730 here: http://setiathome.berkeley.edu/forum_thread.php?id=75996&postid=1618100 Is processing one task optimal? My 550 Ti does two fastest. It's the newer 730 with specs as on far right: http://www.geforce.com/hardware/desktop-gpus/geforce-gt-730/specifications Thanks |
Mike Send message Joined: 17 Feb 01 Posts: 34258 Credit: 79,922,639 RAC: 80 |
Hey Mike, maybe you know the answer to this while we're on the topic of configuring GPUs. I asked what commands I should use for a GT 730 here: I need to know how many CU`s the 730 has. One instance on this card should be faster. With each crime and every kindness we birth our future. |
Zalster Send message Joined: 27 May 99 Posts: 5517 Credit: 528,817,460 RAC: 242 |
Hey Wez, Sorry been gone all day, my day off so almost never online. Saw Mike is here, best person to help you. Hope it all gets sorted out. Zalster |
JarrettH Send message Joined: 14 Nov 02 Posts: 97 Credit: 25,385,250 RAC: 95 |
Hey Mike, maybe you know the answer to this while we're on the topic of configuring GPUs. I asked what commands I should use for a GT 730 here: It has two CU |
Mike Send message Joined: 17 Feb 01 Posts: 34258 Credit: 79,922,639 RAC: 80 |
Hey Mike, maybe you know the answer to this while we're on the topic of configuring GPUs. I asked what commands I should use for a GT 730 here: Something to start with. -use_sleep -unroll 4 -oclfft_plan 256 16 512. Maybe some further tuning possible. But i want to see a few results first. With each crime and every kindness we birth our future. |
JarrettH Send message Joined: 14 Nov 02 Posts: 97 Credit: 25,385,250 RAC: 95 |
I'll try that. It is going in the i3 2100 machine if you're wondering. What is -use_sleep for? |
Mike Send message Joined: 17 Feb 01 Posts: 34258 Credit: 79,922,639 RAC: 80 |
I'll try that. It is going in the i3 2100 machine if you're wondering. What is -use_sleep for? It reduces CPU usage. With each crime and every kindness we birth our future. |
JarrettH Send message Joined: 14 Nov 02 Posts: 97 Credit: 25,385,250 RAC: 95 |
Do you think my 550 Ti machine is doing ok as well? I've haven't tinkered with it in a while. One core on an E6600 is free and the 550 Ti crunches two tasks. I believe it's using this: -unroll 10 -ffa_block 6144 -ffa_block_fetch 1536 -hp Are those settings still current? The -oclFFT_plan switch is new to me. Could that be added? And finally, does ffa_block and fetch not apply to the 730? Thanks for all your help, Mike. |
WezH Send message Joined: 19 Aug 99 Posts: 576 Credit: 67,033,957 RAC: 95 |
First i need to see how app behaves. Unroll 8 has been working over night, but runtimes are still high. What I should try next? "Please keep Your signature under four lines so Internet traffic doesn't go up too much" - In 1992 when I had my first e-mail address - |
WezH Send message Joined: 19 Aug 99 Posts: 576 Credit: 67,033,957 RAC: 95 |
WezH, how is your memory config? Single or Dual-Channel setup (aka 1x 2GB or 2x 1GB MemModule)? Dual-channel, 2*1GB "Please keep Your signature under four lines so Internet traffic doesn't go up too much" - In 1992 when I had my first e-mail address - |
Mike Send message Joined: 17 Feb 01 Posts: 34258 Credit: 79,922,639 RAC: 80 |
First i need to see how app behaves. Now increase unroll to 10. But only change unroll again. With each crime and every kindness we birth our future. |
Mike Send message Joined: 17 Feb 01 Posts: 34258 Credit: 79,922,639 RAC: 80 |
Do you think my 550 Ti machine is doing ok as well? I've haven't tinkered with it in a while. One core on an E6600 is free and the 550 Ti crunches two tasks. It could do better. First you are suffering from high CPU usage, this requires -use_sleep switch. -oclfft_plan is a new method to optimze fft kernel planning. Read the read me`s. Try -use_sleep -unroll 10 -oclfft_plan 256 16 256 -ffa_block 2830 -ffa_block_fetch 2830 -tune 1 64 4 1 -tune 2 64 4 1. Maybe unroll 12 is possible on the 550Ti. On the 730 ffa_block values wont give much improvement. We can still add it after you finnished a couple units. With each crime and every kindness we birth our future. |
WezH Send message Joined: 19 Aug 99 Posts: 576 Credit: 67,033,957 RAC: 95 |
Now increase unroll to 10. So far so good, no freezes. http://setiathome.berkeley.edu/result.php?resultid=3880635022 http://setiathome.berkeley.edu/result.php?resultid=3880568416 http://setiathome.berkeley.edu/result.php?resultid=3880809911 So next step will be? "Please keep Your signature under four lines so Internet traffic doesn't go up too much" - In 1992 when I had my first e-mail address - |
Mike Send message Joined: 17 Feb 01 Posts: 34258 Credit: 79,922,639 RAC: 80 |
Now increase unroll to 10. Now change to unroll 12. With each crime and every kindness we birth our future. |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.