.vlar WUs to NVIDIA GPUs (Problem Solved)

Message boards : Number crunching : .vlar WUs to NVIDIA GPUs (Problem Solved)
Message board moderation

To post messages, you must log in.

1 · 2 · 3 · 4 . . . 5 · Next

AuthorMessage
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 1268847 - Posted: 8 Aug 2012, 1:12:04 UTC

I noticed my BOINC got .vlar WUs for my NVIDIA GPU (after the weekly server maintenance).
Normally this shouldn't happen.

I guess something wrong happened during the weekly server maintenance.

I wrote an EMail to the admins that they are informed about this.


* Best regards! :-) * Sutaru Tsureku, team seti.international founder. * Optimize your PC for higher RAC. * SETI@home needs your help. *
ID: 1268847 · Report as offensive
Profile Bill G Special Project $75 donor
Avatar

Send message
Joined: 1 Jun 01
Posts: 1282
Credit: 187,688,550
RAC: 182
United States
Message 1268863 - Posted: 8 Aug 2012, 2:21:28 UTC - in response to Message 1268847.  
Last modified: 8 Aug 2012, 2:47:16 UTC

I noticed my BOINC got .vlar WUs for my NVIDIA GPU (after the weekly server maintenance).
Normally this shouldn't happen.

I guess something wrong happened during the weekly server maintenance.

I wrote an EMail to the admins that they are informed about this.


I am getting lots of .vlar's but none are going to my NVIDIA GPU, they are all for the CPU (when I am able to connect to the server)

[edit] well scratch that, my number two computer just got a schedule of a bunch of .vlar's some of which are scheduled for the NVIDIA card. As before, they just do not download....... [/edit]

SETI@home classic workunits 4,019
SETI@home classic CPU time 34,348 hours
ID: 1268863 · Report as offensive
musicplayer

Send message
Joined: 17 May 10
Posts: 2430
Credit: 926,046
RAC: 0
Message 1268876 - Posted: 8 Aug 2012, 3:35:22 UTC
Last modified: 8 Aug 2012, 4:16:37 UTC

Why not give it a try? Possibly it may be working out.

Checking it out, two of the .vlar tasks I had in my task list completed successfully using CUDA and was uploaded.

There may be some interesting things found at times in these type of tasks. If it could be done in a faster way, results possibly could be obtained.

But, of course, the error rate could be high and definitely success is not always guaranteed doing it this way.
ID: 1268876 · Report as offensive
Profile Zeus Fab3r
Avatar

Send message
Joined: 17 Jan 01
Posts: 649
Credit: 275,335,635
RAC: 597
Serbia
Message 1268879 - Posted: 8 Aug 2012, 3:38:42 UTC

Just got 44 new gpu tasks, of which 30 are vlars.

08-Aug-12 05:30:44	SETI@home	Scheduler request completed: got 44 new tasks
08-Aug-12 05:30:44	SETI@home	[sched_op_debug] estimated total CPU job duration: 0 seconds
08-Aug-12 05:30:44	SETI@home	[sched_op_debug] estimated total NVIDIA GPU job duration: 24075 seconds
08-Aug-12 05:30:46	SETI@home	Started download of 07my10ac.24787.20521.15.10.123.vlar
08-Aug-12 05:30:46	SETI@home	Started download of 07my10ac.24619.20521.14.10.124.vlar
08-Aug-12 05:30:46	SETI@home	Started download of 07my10ac.24787.20521.15.10.125.vlar
.
.


Who the hell is General Failure and why is he reading my harddisk?¿
ID: 1268879 · Report as offensive
Profile arkayn
Volunteer tester
Avatar

Send message
Joined: 14 May 99
Posts: 4438
Credit: 55,006,323
RAC: 0
United States
Message 1268882 - Posted: 8 Aug 2012, 3:59:16 UTC - in response to Message 1268876.  

Why not give it a try? Possibly it may be working out.

Checking it out, two of the .vlar tasks I had in my task list completed successfully using CUDA and was uploaded.

There may be some interesting thing found at times in these type of tasks. If it could be done in a faster way, results possibly could be obtained.

But, of course, the error rate could be high and definitely success is not always guaranteed doing it this way.


It will be a good test of x41z.

ID: 1268882 · Report as offensive
Profile BMaytum
Volunteer tester
Avatar

Send message
Joined: 3 Apr 99
Posts: 104
Credit: 4,382,041
RAC: 2
United States
Message 1268919 - Posted: 8 Aug 2012, 5:50:34 UTC - in response to Message 1268876.  
Last modified: 8 Aug 2012, 6:08:05 UTC

Why not give it a try? Possibly it may be working out.

Checking it out, two of the .vlar tasks I had in my task list completed successfully using CUDA and was uploaded.

There may be some interesting things found at times in these type of tasks. If it could be done in a faster way, results possibly could be obtained.

But, of course, the error rate could be high and definitely success is not always guaranteed doing it this way.


I got some VLARS sent to my GPU too today (7Aug 2012) after the weekly scheduled outage. Two of them completed successfuly but took over 8,100 elapsed seconds; two of them terminated with Time Limit Exceeded at 10,161 seconds. Four more were in progress, with over 1hr:40mins elapsed and about 1h:05m estimated remaining but since Estimated remaining was continually Increasing (thus the elapsed + estimated would eventually exceed 10,161 sec ~2hr:48min), I manually aborted them. I'm running a GTX580 with v301.42 nVidia drivers, with count= 0.25 (thus GPU can process 4 concurrent workunits). Using BOINC v 7.0.28, and Lunatics optimized MB applications.

When I later got another cache-load of 11 WUs, I manually aborted the 4 GPU VLARs in that batch immediately, given my less-than stellar experince with the prior 6 VLARs on my GPU as noted just above.
Sabertooth Z77, i7-3770K@4.2GHz, GTX680, W8.1Pro x64
P5N32-E SLI, C2D E8400@3Ghz, GTX580, Win7SP1Pro x64 & PCLinuxOS2015 x64
ID: 1268919 · Report as offensive
Kamu

Send message
Joined: 19 Jan 02
Posts: 56
Credit: 11,009,499
RAC: 0
Finland
Message 1268937 - Posted: 8 Aug 2012, 6:30:04 UTC

Yep, about 300 Nvidia vlar wu's here. : )

I'm wondering how we'll get credits with these. It's gonna take while to crunch those 300.

-Kimmo-

Computers: obelix
ID: 1268937 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13715
Credit: 208,696,464
RAC: 304
Australia
Message 1268941 - Posted: 8 Aug 2012, 6:52:15 UTC


Just had a look at my systems. A few VLARs on one NVidia card, several dozen on the other.
Grant
Darwin NT
ID: 1268941 · Report as offensive
LadyL
Volunteer tester
Avatar

Send message
Joined: 14 Sep 11
Posts: 1679
Credit: 5,230,097
RAC: 0
Message 1268986 - Posted: 8 Aug 2012, 10:35:30 UTC - in response to Message 1268876.  
Last modified: 8 Aug 2012, 10:53:51 UTC

Why not give it a try? Possibly it may be working out.

Checking it out, two of the .vlar tasks I had in my task list completed successfully using CUDA and was uploaded.

There may be some interesting things found at times in these type of tasks. If it could be done in a faster way, results possibly could be obtained.

But, of course, the error rate could be high and definitely success is not always guaranteed doing it this way.


This one time you are getting the polite version of my reply, because other people, especially those fairly new to the project, may not know why a 'no VLAR to NVidia GPU' policy and code was established:

VLAR run incredibly slow with stock (6.08/6.09/6.10) apps. They can run so badly that the whole system freezes or outright crashes.
That was true with drivers two years ago - I don't think anybody ever established whether new drivers cope better and I'm pretty sure nobody wants to really try.

So at some point code was introduced into the scheduler to mark tasks below a certain AR as VLAR and not to send them to NVidia GPUs.

BTW, for the past few releases and RC x41z, optimised apps have not had that problem (system freeze) - but as has been shown in thread VLAR still run slower (a lot slower/too slow) - iirc at the other end of the processing spectrum from VHAR/shorties there just might be more to precess.
Since we hope v7 MB is still on the agenda and thus the GPU app will eventually become x41z, getting rid of a whole host of frist generation app problems including -12. We hadn't made up our collective mind yet if we had a case to lift the policy - you'd need production hosts testing real life performance on VLAR for a good statistical data basis.

ATM I'd rather have the restriction back - better to err on the side of caution.

NB If you have receieved VLAR and you start getting -177/-197 best to run Fred's rescheduler to extend the time limit, provided you are happy to let them process at such a slow pace.
I'm not the Pope. I don't speak Ex Cathedra!
ID: 1268986 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1268989 - Posted: 8 Aug 2012, 11:27:41 UTC - in response to Message 1268986.  
Last modified: 8 Aug 2012, 11:40:55 UTC

Just kiddin (with a little of true): Help us LadyL you and your team are our only hope!

As i remeembr, A single Vlar takes more than an hour to process on a 580 against the 12 min of a "normal" WU...
ID: 1268989 · Report as offensive
LadyL
Volunteer tester
Avatar

Send message
Joined: 14 Sep 11
Posts: 1679
Credit: 5,230,097
RAC: 0
Message 1269004 - Posted: 8 Aug 2012, 11:52:12 UTC

IIRC the no vlar is a switch in the sheduler, that might have become lost when Eric upgraded - we've had that happen before.

Richard is working on 'how to get the server to resend the VLAR to the CPU' instructions, which he will post once he's confirmed the procedure works reliably.
I'm not the Pope. I don't speak Ex Cathedra!
ID: 1269004 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14644
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1269021 - Posted: 8 Aug 2012, 12:22:10 UTC - in response to Message 1269004.  

Richard is working on 'how to get the server to resend the VLAR to the CPU' instructions, which he will post once he's confirmed the procedure works reliably.

OK - as the Lady says...

First, these instructions are a first draft, and pretty telegraphic. They assume you're already familiar with the terminology, you know where to find the various BOINC files, and you know the rules for making changes to them. That's what we used to call ADVANCED USERS ONLY.

That's the only warning you're going to get. Read the instructions through carefully: check that you understand every point, and how to do it. If you're at all uncomfortable, don't even start. You're on your own from here.

  • Ensure you have a CPU application active for MB tasks
  • Unset 'Use NV GPU' (web preferences)
  • Set 'Use CPU' (web preferences)
  • Set 'No new tasks' (BOINC Manager)
  • Update project (BOINC Manager - if needed, some versions will report work immediately when NNT is set)
  • Suspend networking
  • Stop BOINC
  • Make backup copy of all .vlar datafiles
  • Edit client_state.xml: remove all '<result>' blocks for .vlar tasks
  • Restart BOINC
  • Restore all .vlar datafiles
  • Resume networking
  • Allow new work
  • Wait until all VLAR work has been resent to CPU
  • Set 'Use NV GPU' (web preferences)
  • Rinse and repeat

ID: 1269021 · Report as offensive
Kamu

Send message
Joined: 19 Jan 02
Posts: 56
Credit: 11,009,499
RAC: 0
Finland
Message 1269031 - Posted: 8 Aug 2012, 13:01:32 UTC

Thank You Richard!

It looks like everything went ok, resent, 287 VLARs.

-Kimmo-

Computers: obelix
ID: 1269031 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 1269039 - Posted: 8 Aug 2012, 13:31:46 UTC - in response to Message 1268847.  
Last modified: 8 Aug 2012, 13:35:59 UTC

Sutaru Tsureku wrote:
(...)
I wrote an EMail to the admins that they are informed about this.


Just for info (because I got request)...

I EMailed Dave, Eric, Matt and Jeff.

I got response from Eric, that he will inform/ask Jeff.

So they are aware of the problem and will solve it so fast as possible.


* Best regards! :-) * Sutaru Tsureku, team seti.international founder. * Optimize your PC for higher RAC. * SETI@home needs your help. *
ID: 1269039 · Report as offensive
Profile skildude
Avatar

Send message
Joined: 4 Oct 00
Posts: 9541
Credit: 50,759,529
RAC: 60
Yemen
Message 1269050 - Posted: 8 Aug 2012, 13:48:03 UTC - in response to Message 1269039.  

FOr now I'd suggest getting the BOINC rescheduler and have those VLAR's rescheduled to your CPU. You'll still have the WU's onboard and they won't have to be resent by S@H when they fail on your Nvidia card.


In a rich man's house there is no place to spit but his face.
Diogenes Of Sinope
ID: 1269050 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14644
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1269054 - Posted: 8 Aug 2012, 14:00:34 UTC - in response to Message 1269050.  

FOr now I'd suggest getting the BOINC rescheduler and have those VLAR's rescheduled to your CPU. You'll still have the WU's onboard and they won't have to be resent by S@H when they fail on your Nvidia card.

But you are likely to mess up the server's averaging and credit-granting records. I can't be bothered to work out whether you're likely to request too much credit for yourself (only to be dragged back down by your wingmate), or to request too little and drag your wingmate down with you.

The purpose of my 'resend' recipe was to get the server records updated to show the tasks allocated to CPU - that way, runtime and credit should be accurate.
ID: 1269054 · Report as offensive
Profile Fred E.
Volunteer tester

Send message
Joined: 22 Jul 99
Posts: 768
Credit: 24,140,697
RAC: 0
United States
Message 1269058 - Posted: 8 Aug 2012, 14:23:00 UTC

I noticed that Eric (or someone) added a "Use ATI GPU" preference to the project preference page, probably for the new AP - for - ATI application. I don't have an ATI GPU, but it was set to on. I turned it off and haven't got a VLAR for Nvidia on the last 5 successful gpu work requests.

Can't be certain there's cause and effect here, but if you don't have an ATI card, you might as well turn it off.
Another Fred
Support SETI@home when you search the Web with GoodSearch or shop online with GoodShop.
ID: 1269058 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14644
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1269060 - Posted: 8 Aug 2012, 14:28:21 UTC - in response to Message 1269058.  

I noticed that Eric (or someone) added a "Use ATI GPU" preference to the project preference page ...

It's in the standard BOINC web code, but it's hidden (because presumed useless) until there's an ATI application available for stock download. It will have appeared automatically when Eric added the ATI AP app last night (see news), but it's a good point about checking that the default values on your account are right for you.
ID: 1269060 · Report as offensive
Kevin Olley

Send message
Joined: 3 Aug 99
Posts: 906
Credit: 261,085,289
RAC: 572
United Kingdom
Message 1269117 - Posted: 8 Aug 2012, 16:03:07 UTC - in response to Message 1268986.  

This one time you are getting the polite version of my reply, because other people, especially those fairly new to the project, may not know why a 'no VLAR to NVidia GPU' policy and code was established:

VLAR run incredibly slow with stock (6.08/6.09/6.10) apps. They can run so badly that the whole system freezes or outright crashes.
That was true with drivers two years ago - I don't think anybody ever established whether new drivers cope better and I'm pretty sure nobody wants to really try.



I did, My GPU's (470's) will run VLAR's at 1 WU per card without any problems but is slow, 2 WU per card causes lag and 3 WU per card causes major problems.

I tried it when we had download problems a while ago.


Kevin


ID: 1269117 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14644
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1269192 - Posted: 8 Aug 2012, 18:35:12 UTC

Hmmm. My recipe worked very nicely while I only had a few VLARs, and they'd all arrived in a neat contiguous block. But now I've got a boatload more, and they're all dotted around individually in ones and twos.

Does anyone know of a nice automated way of finding/deleting a block like this?

<result>
<name>20fe12ag.23781.23954.10.10.174.vlar_2</name>
<final_cpu_time>0.000000</final_cpu_time>
<final_elapsed_time>0.000000</final_elapsed_time>
<exit_status>0</exit_status>
<state>2</state>
<platform>windows_intelx86</platform>
<version_num>610</version_num>
<plan_class>cuda_fermi</plan_class>
<wu_name>20fe12ag.23781.23954.10.10.174.vlar</wu_name>
<report_deadline>1348464226.000000</report_deadline>
<received_time>1344446227.080149</received_time>
<file_ref>
<file_name>20fe12ag.23781.23954.10.10.174.vlar_2_0</file_name>
<open_name>result.sah</open_name>
</file_ref>
</result>

- in other words, everything from <result> to </result> inclusive, but ONLY when they contain both vlar and version 610 - I don't want to delete the ones I've already had resent...
ID: 1269192 · Report as offensive
1 · 2 · 3 · 4 . . . 5 · Next

Message boards : Number crunching : .vlar WUs to NVIDIA GPUs (Problem Solved)


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.