Message boards :
Number crunching :
Open Beta test: SoG for NVidia, Lunatics v0.45 - Beta6 (RC again)
Message board moderation
Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · 10 . . . 31 · Next
| Author | Message |
|---|---|
Brent Norman ![]() Send message Joined: 1 Dec 99 Posts: 2786 Credit: 685,657,289 RAC: 835
|
Hey Rasputin42, Are those bc5 tasks? I have noticed that those run more than 3 times slower, so it appears they are stalled, but they are still running. And GPU temps are really low when running them. |
|
Rasputin42 Send message Joined: 25 Jul 08 Posts: 412 Credit: 5,834,661 RAC: 0
|
Hey Brent, They are not running slower, they do not run at all (after the initial about 30%). Elapsed time is progressing, but the percentage stays exactly the same. |
Mike Send message Joined: 17 Feb 01 Posts: 34680 Credit: 79,922,639 RAC: 80
|
The 720M is simply to slow running multiple instances. LowPerformancePath is active so use sleep is activated as well. Try only one instance. With each crime and every kindness we birth our future. |
rob smith ![]() Send message Joined: 7 Mar 03 Posts: 22920 Credit: 416,307,556 RAC: 380
|
With any "new" GPU installation it is worth running with only one task for a few days just to see what the thing will do in the base situation, then step it up to two for a few more days, finally if that is OK, push up to three. As Mike says I very much doubt that a GTX720M is up to running more than one task at a time. Bob Smith Member of Seti PIPPS (Pluto is a Planet Protest Society) Somewhere in the (un)known Universe? |
rob smith ![]() Send message Joined: 7 Mar 03 Posts: 22920 Credit: 416,307,556 RAC: 380
|
It sounds as though system has gone into "trash mode" - it is trying to run too many tasks at a time on the GPU and is failing, so BOINC thinks tasks are getting near their deadline, BOINC pushes them up the priority tree and others get stuck in waiting mode. Trying to run too many tasks ata time will cause the GPU's task scheduling will struggle, particularly with the current crop of low angle tasks. You would probably do better to drop back to one task at a time on the GPU, whereas you are used to running two or three higher angle tasks. Bob Smith Member of Seti PIPPS (Pluto is a Planet Protest Society) Somewhere in the (un)known Universe? |
|
Rasputin42 Send message Joined: 25 Jul 08 Posts: 412 Credit: 5,834,661 RAC: 0
|
Well, the are all "fresh" tasks. That card runs 3 cuda50 tasks with no problem, but i was only running 2 at once. It runs one (SOG task) in about 26min. As far as i know, if there is enough memory, it will run multiple instances, but less efficient, if you run too many. |
rob smith ![]() Send message Joined: 7 Mar 03 Posts: 22920 Credit: 416,307,556 RAC: 380
|
How many times must this be said - the critical thing with GPUs is not MEMORY, but the number of GPU "cores" and their management. There is probably enough memory to support half a dozen tasks, but trying to run more than a couple of tasks (particularly SoG) the GPU's internal task manager will be seriously struggling long before you reach that number. Another thing to consider is that the current data from the servers is dominated by guppi (from the GBT) for which CUDA is not best suited - my GTX960 rig would quite happily run three "normal" Arecibo MB tasks, but try running three guppi at once and it started to sweat, it is much happier running two of them - that is quite a hit! Bob Smith Member of Seti PIPPS (Pluto is a Planet Protest Society) Somewhere in the (un)known Universe? |
Stephen "Heretic" ![]() Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628
|
My main problem, even though I have the sleep command running, is that I have a gpu wu 'running HP' and a gpu wu in 'waiting to run', it's probably nothing, but I thought I'd mention it. . . Can you monitor the memory usage on your GPU card? If there is insufficient memory it can exit a task leaving it in the "Waiting to run" state. |
Stephen "Heretic" ![]() Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628
|
. . . Hello Richard, . . It is doing so as we speak. But I was just wondering. :) |
Stephen "Heretic" ![]() Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628
|
Hey Rasputin42, . . SoG is running nicely on this machine but blc5 certainly take longer than blc6, they fooled me into thinking something was wrong at one point. I am wondering how blc7 will behave as I have a lot of them coming up. |
Stephen "Heretic" ![]() Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628
|
With any "new" GPU installation it is worth running with only one task for a few days just to see what the thing will do in the base situation, then step it up to two for a few more days, finally if that is OK, push up to three. As Mike says I very much doubt that a GTX720M is up to running more than one task at a time. . . My GT730 is only barely capable of running doubles, and comes to grief when a Guppie swims by. |
Stephen "Heretic" ![]() Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628
|
How many times must this be said - the critical thing with GPUs is not MEMORY, but the number of GPU "cores" and their management. . . FWIW, I received the suggestion to run -sbs 512 on my GTX950 to persuade Guppies to play nice. But it wanted more than the 2048MB on the card and it was doing that, dropping out of WUs and leaving them in "waiting to run" state. . . The max it will support is -sbs 384 so it may be a lack of sufficient memory, worth checking it out. |
Zalster Send message Joined: 27 May 99 Posts: 5517 Credit: 528,817,460 RAC: 242
|
Wait for something that works better, I get the same thing, only time to completion becomes days, instead of minutes, I tried 3, then 6, then 33, same result, I can't figure this out, I just went back to cuda42, I give up. Zoom, I would suggest posting your entire commandline rather than just snips of it. It's hard to tell what the computer see when you only post a portion of it. |
|
Rasputin42 Send message Joined: 25 Jul 08 Posts: 412 Credit: 5,834,661 RAC: 0
|
-use_sleep -instances_per_device N: 2 I think, that should be: -use_sleep -instances_per_device 2 |
Zalster Send message Joined: 27 May 99 Posts: 5517 Credit: 528,817,460 RAC: 242
|
-use_sleep -instances_per_device N: 2 Rasputin is correct. Zoom try this -use_sleep -sbs 512 -total_GPU_instances_num 2 -instance_per_device 2 Edit.. Only 1 GPU in your machine correct? |
Zalster Send message Joined: 27 May 99 Posts: 5517 Credit: 528,817,460 RAC: 242
|
You can always decrease the -sbs to 256 if you want and see if that works better |
Raistmer Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121
|
I have noticed, that when running 2 instances of sog, it runs both tasks for a while and then one makes no progress any more.It finishes the other and starts a new one which it continues to process. The first one is still making no progress, but the elapsed time keeps going.If i suspend all other tasks, it finally makes progress again and eventually finishes. Link to such task result? |
|
Rasputin42 Send message Joined: 25 Jul 08 Posts: 412 Credit: 5,834,661 RAC: 0
|
I have noticed, that when running 2 instances of sog, it runs both tasks for a while and then one makes no progress any more.It finishes the other and starts a new one which it continues to process. The first one is still making no progress, but the elapsed time keeps going.If i suspend all other tasks, it finally makes progress again and eventually finishes. I will have to set it up again.Currently doing cuda50. |
Richard Haselgrove ![]() Send message Joined: 4 Jul 99 Posts: 14690 Credit: 200,643,578 RAC: 874
|
I have noticed, that when running 2 instances of sog, it runs both tasks for a while and then one makes no progress any more.It finishes the other and starts a new one which it continues to process. The first one is still making no progress, but the elapsed time keeps going.If i suspend all other tasks, it finally makes progress again and eventually finishes. Stephen's message 1794739 earlier, about VRAM overcommit, may be relevant. Link to result would be helpful - we can check if it's a BOINC client which supports temporary exit. |
Raistmer Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121
|
I have noticed, that when running 2 instances of sog, it runs both tasks for a while and then one makes no progress any more.It finishes the other and starts a new one which it continues to process. The first one is still making no progress, but the elapsed time keeps going.If i suspend all other tasks, it finally makes progress again and eventually finishes. No, it's not. A habit to post at least link to result under question should be developed. App prints how much memory it uses for particular task in its stderr. Regarding SoG task progress: if task is VHAR and processing enters in SoG-only phase when it will enqueue all work to be done for particular task to GPU and then await task completion. What BOINC shows - unrelevant (as we know it shows its own guesses). Cause there is no lack of work for GPU to do even from single task there is nothing bad if runtime decides to finish kernel sequence from one task before switching to another (even if it's real effect that I strongly doubt to be so). There is no such thing as pre-emptive context switching in GPU so far AFAIK after all. |
©2026 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.