app_info for AP500, AP503, MB603 and MB608


log in

Advanced search

Message boards : Number crunching : app_info for AP500, AP503, MB603 and MB608

1 · 2 · 3 · 4 . . . 12 · Next
Author Message
Profile MarkJ
Volunteer tester
Avatar
Send message
Joined: 17 Feb 08
Posts: 936
Credit: 19,596,974
RAC: 17,729
Australia
Message 876502 - Posted: 17 Mar 2009, 11:20:33 UTC

I was going to post this under another thread in reply to someone's question but the thread got locked just as I was about to post. Below are some instructions I typed up about how to have a combined app_info. Please take note of the disclaimer.

MarkJ


Setting up Seti@home to run cuda and non-cuda apps

You will need a copy of BOINC 6.6.15 (or later). This won't work with earlier versions of BOINC.


Disclaimer:
BOINC 6.6.15 is a Development version and may not function correctly. If you are not comfortable with editing an app_info, using optimised apps and running development versions of BOINC then this is not for you.

BOINC is very unforgiving of an incorrect app_info and usually will delete all tasks if you get it wrong.

Do NOT use Internet Explorer to edit the xml files, it will stuff up your app_info. Use Notepad or another text editor.

Thanks go to Richard Haselgrove and Claggy for ideas in building this app_info.xml file.


Notes:
Upgrade your BOINC client first and get it working before changing anything else.

The app_info.xml below is based on a 32 bit Windows platform. If you are running on another platform you may need to add/amend the <platform_name> tags.

My computers support the SSSE3 instruction set. SSE2, SSE3 or SSE4.1 may be more appropiate for you. You will need to amend the program names as appropiate.

I've assumed that you have your cuda-capable card up and running and have the necessary nvidia drivers (minimum version is 180.48).

Programs needed:
a) Optimised multibeam and optimised Astropulse

AK_v8_win_SSSE3x.exe
ap_5.00r103_SSE3.exe
ap_5.03r112_SSE3.exe

b) Cuda multibeam and support libraries
setiathome_6.08_windows_intelx86__cuda.exe
cudart.dll
cufft.dll
libfftw3f-3-1-1a_upx.dll


1. Download and install BOINC 6.6.15. Get this working before changing anything else.

2. Empty your cache of Seti@home work. This is best achieved by setting the project to No new work and letting it finish off its tasks. Make sure they are all uploaded and reported, there should be none on your tasks list.

3. Download the optimised multibeam and astropulse apps (usually from the lunatics web site) if you don't already have them.

4. Download the cuda multibeam app (from the Seti web site) if you don't already have them. If you run the stock cuda multibeam app then you should already have these in your projects\Setiathome.berkeley.edu folder.

5. Disable network communications in BOINC. Shutdown BOINC. Make sure its shutdown.

6. Browse your client_state.xml file (its in the BOINC data directory) and look for the entry <p_flops>. We need to use this number. Do NOT change this file.

7. Browse the BOINC log file to get the estimated speed of your GPU. This is usually given at the top of the log in Gflops. My 9800GT was estimated at 60Gflops.

8. For each of the apps multiply the p_flops value by the factor below and put this into the appropiate flops entry in the app_info given below. For multibeam 608 you need the estimated Gflops.

Astropulse 500 flops = p_flops x 2.25
Astropulse 503 flops = p_flops x 2.6
Multibeam 603 flops = p_flops x 1.75
Multibeam 608 flops = Est.Gflops x 0.2 (eg 60,000,000,000 x 0.2 = 12,000,000,000)

9. Make sure you have all the programs listed above in the projects\Setiathome.berkeley.edu folder. If not copy them there.

10. Save your app_info in the projects\Setiathome.berkeley.edu folder.

11. Start up BOINC. Check the messages tab to see if it lists any [file error] messages. If there are shut BOINC, check you have the correct program names referenced. Go back to step 9.

12. If okay then enable new work for the Seti@home project.

13. Enable network communications again.

14. It should now download work of all types. If not check your Seti@home preferences on the Seti web site, that Astropulse, Astropulse_v5 and "Use GPU if available" are all ticked. If you have a slower computer you may not get astropulse work units anyway.


<app_info>
<app>
<name>astropulse</name>
</app>
<file_info>
<name>ap_5.00r103_SSE3.exe</name>
<executable/>
</file_info>
<app_version>
<app_name>astropulse</app_name>
<version_num>500</version_num>
<flops>5306156897</flops>
<file_ref>
<file_name>ap_5.00r103_SSE3.exe</file_name>
<main_program/>
</file_ref>
</app_version>
<app>
<name>astropulse_v5</name>
</app>
<file_info>
<name>ap_5.03r112_SSE3.exe</name>
<executable/>
</file_info>
<app_version>
<app_name>astropulse_v5</app_name>
<version_num>503</version_num>
<flops>6131559081</flops>
<file_ref>
<file_name>ap_5.03r112_SSE3.exe</file_name>
<main_program/>
</file_ref>
</app_version>
<app>
<name>setiathome_enhanced</name>
</app>
<file_info>
<name>AK_v8_win_SSSE3x.exe</name>
<executable/>
</file_info>
<file_info>
<name>setiathome_6.08_windows_intelx86__cuda.exe</name>
<executable/>
</file_info>
<file_info>
<name>cudart.dll</name>
<executable/>
</file_info>
<file_info>
<name>cufft.dll</name>
<executable/>
</file_info>
<file_info>
<name>libfftw3f-3-1-1a_upx.dll</name>
<executable/>
</file_info>
<app_version>
<app_name>setiathome_enhanced</app_name>
<version_num>603</version_num>
<platform>windows_intelx86</platform>
<flops>4127010920</flops>
<file_ref>
<file_name>AK_v8_win_SSSE3x.exe</file_name>
<main_program/>
</file_ref>
</app_version>
<app_version>
<app_name>setiathome_enhanced</app_name>
<version_num>608</version_num>
<platform>windows_intelx86</platform>
<avg_ncpus>0.127970</avg_ncpus>
<max_ncpus>0.127970</max_ncpus>
<flops>12000000000</flops>
<plan_class>cuda</plan_class>
<file_ref>
<file_name>setiathome_6.08_windows_intelx86__cuda.exe</file_name>
<main_program/>
</file_ref>
<file_ref>
<file_name>cudart.dll</file_name>
</file_ref>
<file_ref>
<file_name>cufft.dll</file_name>
</file_ref>
<file_ref>
<file_name>libfftw3f-3-1-1a_upx.dll</file_name>
</file_ref>
<coproc>
<type>CUDA</type>
<count>1</count>
</coproc>
</app_version>
</app_info>

____________
BOINC blog

Profile ccappel
Avatar
Send message
Joined: 27 Jan 00
Posts: 362
Credit: 1,516,412
RAC: 0
United States
Message 876541 - Posted: 17 Mar 2009, 15:25:45 UTC

This is great info. I nominate this thread to be made sticky.

Profile perryjay
Volunteer tester
Avatar
Send message
Joined: 20 Aug 02
Posts: 3377
Credit: 14,913,520
RAC: 12,237
United States
Message 876553 - Posted: 17 Mar 2009, 15:55:39 UTC - in response to Message 876541.

This is great info. I nominate this thread to be made sticky.





I'll second that motion.
____________


PROUD MEMBER OF Team Starfire World BOINC

Profile ignorance is no excuse
Avatar
Send message
Joined: 4 Oct 00
Posts: 9529
Credit: 44,433,274
RAC: 0
Korea, North
Message 876568 - Posted: 17 Mar 2009, 16:22:53 UTC
Last modified: 17 Mar 2009, 16:27:57 UTC

I'm not so sure that running the development app is such a good idea for the general public. I'd say yes to sticky if the 6.6.15 were the standard. Currently 6.4.7 is. Mention of which OS this has worked on would be great as well...
____________
In a rich man's house there is no place to spit but his face.
Diogenes Of Sinope

End terrorism by building a school

Profile ccappel
Avatar
Send message
Joined: 27 Jan 00
Posts: 362
Credit: 1,516,412
RAC: 0
United States
Message 876578 - Posted: 17 Mar 2009, 20:34:39 UTC - in response to Message 876568.

Ok, postpone my nomination to when 6.6.15 (or whichever becomes the stable version) is available for general release. :)

Profile MarkJ
Volunteer tester
Avatar
Send message
Joined: 17 Feb 08
Posts: 936
Credit: 19,596,974
RAC: 17,729
Australia
Message 876598 - Posted: 17 Mar 2009, 21:04:45 UTC - in response to Message 876568.

I'm not so sure that running the development app is such a good idea for the general public. I'd say yes to sticky if the 6.6.15 were the standard. Currently 6.4.7 is. Mention of which OS this has worked on would be great as well...


I made it very clear that it relates to a Development version in the disclaimer. While it would be nice if it worked under earlier versions of BOINC, unfortunately it doesn't.

Re: The OS, It says at the top. Windows 32 bit. To be more precise XP Pro with SP3.
____________
BOINC blog

Profile Blurf
Volunteer tester
Send message
Joined: 2 Sep 06
Posts: 7412
Credit: 6,485,676
RAC: 4,380
United States
Message 876699 - Posted: 17 Mar 2009, 23:13:58 UTC - in response to Message 876553.

This is great info. I nominate this thread to be made sticky.



I'll second that motion.


Sorry guys-moderator have been repeatedly asked by users to keep the stickies to a minimum wherever possible.
____________


chuck495
Volunteer tester
Send message
Joined: 11 Aug 00
Posts: 48
Credit: 7,280,608
RAC: 29,598
United States
Message 876720 - Posted: 17 Mar 2009, 23:41:01 UTC

<p_fpops>?

Richard Haselgrove
Volunteer tester
Send message
Joined: 4 Jul 99
Posts: 8375
Credit: 46,704,413
RAC: 20,102
United Kingdom
Message 876724 - Posted: 17 Mar 2009, 23:48:31 UTC - in response to Message 876720.

<p_fpops>?

"Floating Point Operations per Second" - a measure of speed (CPU in this case).

You can copy the value from the line with this tag in the first block of data in client_state.xml: alternatively, the same figure is shown on the 'host details' page on this website, in the line "Measured floating point speed 1830.43 million ops/sec".

Notice that the speed shown on the website is in millions of <p_fpops>: you need to multiply those millions back in before using the number in Mark's formula.

Profile MarkJ
Volunteer tester
Avatar
Send message
Joined: 17 Feb 08
Posts: 936
Credit: 19,596,974
RAC: 17,729
Australia
Message 876832 - Posted: 18 Mar 2009, 8:51:01 UTC - in response to Message 876724.

<p_fpops>?

"Floating Point Operations per Second" - a measure of speed (CPU in this case).

You can copy the value from the line with this tag in the first block of data in client_state.xml: alternatively, the same figure is shown on the 'host details' page on this website, in the line "Measured floating point speed 1830.43 million ops/sec".

Notice that the speed shown on the website is in millions of <p_fpops>: you need to multiply those millions back in before using the number in Mark's formula.


I actually cut it from client_state and pasted it into calculator under windows and then did the multiplication. Less chance of errors that way :)

You'll note that I dropped off the decimals in the app_info provided. I figure a half a fpop isn't going to make any difference.
____________
BOINC blog

john deneer
Volunteer tester
Avatar
Send message
Joined: 16 Nov 06
Posts: 331
Credit: 20,996,195
RAC: 0
Netherlands
Message 876893 - Posted: 18 Mar 2009, 16:36:09 UTC - in response to Message 876502.

Taken from the app_info.xml posted:

<platform>windows_intelx86</platform>

Does this actually do anything? And should it be changed when running on the 64-bit version of Windows, or does it only have to be changed when running on Linux or OS X?

I'm asking because I feel that after building up my courage for a week I am now ready to adopt 6.6.14 (or .15) on my rig :-) But it runs under x64, not x86.

Another question: if I don't want to run multibeam on the cpu, do I just remove the part describing the 6.03 application (ak v8, that is) or are more drastic measures necessary?

Regards,
John.

Fred W
Volunteer tester
Send message
Joined: 13 Jun 99
Posts: 2524
Credit: 11,954,210
RAC: 0
United Kingdom
Message 876939 - Posted: 18 Mar 2009, 19:11:01 UTC - in response to Message 876893.

Taken from the app_info.xml posted:

<platform>windows_intelx86</platform>

Does this actually do anything? And should it be changed when running on the 64-bit version of Windows, or does it only have to be changed when running on Linux or OS X?

I'm asking because I feel that after building up my courage for a week I am now ready to adopt 6.6.14 (or .15) on my rig :-) But it runs under x64, not x86.

Another question: if I don't want to run multibeam on the cpu, do I just remove the part describing the 6.03 application (ak v8, that is) or are more drastic measures necessary?

Regards,
John.

The equivalent line from my installation:

<platform>windows_x86_64</platform>

Please do change it - I trashed a lot of WU's because I forgot about this wrinkle :-(

With 6.6.14 and .15 you won't get any 6.03's anyway even if you leave the AK_v8 line in there. It is all branded as MultiBeam so the server uses the highest version number (i.e. the 6.08) and you only get CUDA branded MB.

F.
____________

Richard Haselgrove
Volunteer tester
Send message
Joined: 4 Jul 99
Posts: 8375
Credit: 46,704,413
RAC: 20,102
United Kingdom
Message 876943 - Posted: 18 Mar 2009, 19:32:49 UTC - in response to Message 876939.

With 6.6.14 and .15 you won't get any 6.03's anyway even if you leave the AK_v8 line in there. It is all branded as MultiBeam so the server uses the highest version number (i.e. the 6.08) and you only get CUDA branded MB.

F.

Not true. With 6.6.14 and above, you can receive (I have received) 603 work for AK_v8 as part of a standard work fetch from the project, as well as 608 CUDA work. You can also (and I have done this as a test, but I don't recommend it) ask for and receive AK_v8 work branded as 608, and CUDA work branded as 608, all in the same configuration.

The point is, the <plan_class>cuda</plan_class> line (in the 608 section in MarkJ's example) now serves to differentiate that section from the 603 section: MultiBeam and MultiBeam CUDA, although being identical tasks and datafiles, can now be tracked separately through the system and requested individually.

That's why v6.6.14 and above is such a big upgrade, and has to be handled with such care.

(And Fred - it's also why I'm looking forward with such anticipation to the 608 --> 603 rebranding script for VLAR. One tester, at your service whenever called upon!)

Fred W
Volunteer tester
Send message
Joined: 13 Jun 99
Posts: 2524
Credit: 11,954,210
RAC: 0
United Kingdom
Message 876953 - Posted: 18 Mar 2009, 19:53:23 UTC - in response to Message 876943.

With 6.6.14 and .15 you won't get any 6.03's anyway even if you leave the AK_v8 line in there. It is all branded as MultiBeam so the server uses the highest version number (i.e. the 6.08) and you only get CUDA branded MB.

F.

Not true. With 6.6.14 and above, you can receive (I have received) 603 work for AK_v8 as part of a standard work fetch from the project, as well as 608 CUDA work. You can also (and I have done this as a test, but I don't recommend it) ask for and receive AK_v8 work branded as 608, and CUDA work branded as 608, all in the same configuration.

The point is, the <plan_class>cuda</plan_class> line (in the 608 section in MarkJ's example) now serves to differentiate that section from the 603 section: MultiBeam and MultiBeam CUDA, although being identical tasks and datafiles, can now be tracked separately through the system and requested individually.

That's why v6.6.14 and above is such a big upgrade, and has to be handled with such care.

(And Fred - it's also why I'm looking forward with such anticipation to the 608 --> 603 rebranding script for VLAR. One tester, at your service whenever called upon!)

My apologies, then, for providing duff info. I was going by my own experience which is that I have never received a 603 WU since I fired up my graphics card. I bow to Richard's differing experience and assume I was just "unlucky" - but then I prefer to leave as much CPU time for AP as possible anyway (apart from the darned VLARs); seems much cleaner that way.

(And Richard - almost there. Just checking the automation on Vista and XP. Think on this, though. With (6.6.x) Boinc's propensity to process CUDA WU's in deadline order, a run of the script to re-brand the VLARs in the cache followed by a work fetch often brings down more VLAR's that have earlier deadlines than the existing non-VLARs so whatever the size of cache some get crunched on the GPU anyway. Another effect of the same "bug" that I noticed immediately following Tuesday's outage was that CUDAs were downloaded with progressively closer deadlines so that I eventually had 8 partially crunched "waiting to run" which had been usurped be those that came after.)

F.
____________

Richard Haselgrove
Volunteer tester
Send message
Joined: 4 Jul 99
Posts: 8375
Credit: 46,704,413
RAC: 20,102
United Kingdom
Message 877047 - Posted: 19 Mar 2009, 0:15:22 UTC - in response to Message 876953.

... I have never received a 603 WU since I fired up my graphics card ...

(And Richard - almost there. Just checking the automation on Vista and XP. Think on this, though. With (6.6.x) Boinc's propensity to process CUDA WU's in deadline order, a run of the script to re-brand the VLARs in the cache followed by a work fetch often brings down more VLAR's that have earlier deadlines than the existing non-VLARs so whatever the size of cache some get crunched on the GPU anyway. Another effect of the same "bug" that I noticed immediately following Tuesday's outage was that CUDAs were downloaded with progressively closer deadlines so that I eventually had 8 partially crunched "waiting to run" which had been usurped be those that came after.)

F.

I only got the 603s with deliberate testing (by disabling AP allocation). But I got 'em.

re VLARs - what threshhold are you using for defining VLAR? I think you can cap it at 0.05, which is the definition of a 'true' VLAR. I believe Raistmer took the dividing line a bit higher for safety - something like 0.14

If you take the 0.05 line, then all VLARs will have exactly the same deadline interval after issue by the scheduler, which means that a 'new' VLAR will never queue-jump another VLAR: but you're right, a new VLAR could easily jump the next available non-VLAR.

john deneer
Volunteer tester
Avatar
Send message
Joined: 16 Nov 06
Posts: 331
Credit: 20,996,195
RAC: 0
Netherlands
Message 877176 - Posted: 19 Mar 2009, 12:32:34 UTC - in response to Message 876939.

[quoteThe equivalent line from my installation:

<platform>windows_x86_64</platform>

Please do change it - I trashed a lot of WU's because I forgot about this wrinkle :-(
[/quote]

Thanks Fred, I will use that one as well, then.

The funny thing is: I'm at this moment running 6.6.15 with my 'old' app_info.xml that I had for 6.4.7. It has optimized versions of old and new astropulse and the stock cuda application in it. None of the fancy stuff that the proposed app_info has.

I upgraded the old boinc by installing 6.6.15 right over it, changing nothing else (well, I removed my cc_config because it had the n+1 cpus in it). When I started the new boincmanager it picked up the old astropulse work flawlessly and started asking for (and got, hooray) new cuda work. It worked right out of the box, somewhat to my surprise.

I'm bound to start tinkering with it, though. I'll probably end up using a copy of the app_info provided by MarkJ. Major reason to do so is that I also feel the need to do something about those annoying vlar cuda's, so I'll have a description of akv8 ready to get those things offloaded to the cpu when your fix is ready :-)

Regards,
John.

john deneer
Volunteer tester
Avatar
Send message
Joined: 16 Nov 06
Posts: 331
Credit: 20,996,195
RAC: 0
Netherlands
Message 877410 - Posted: 20 Mar 2009, 9:33:40 UTC - in response to Message 876939.

The equivalent line from my installation:

<platform>windows_x86_64</platform>

Please do change it - I trashed a lot of WU's because I forgot about this wrinkle :-(

I had the _x86_64 at both occurrences of <platform> but it trashed all my cuda units immediately on startup. I got an error message indicating that there was no version info for the 64bit cuda executable (?)

20-Mar-2009 10:10:22 [SETI@home] [error] No app version for result: windows_x86_64 608 cuda
20-Mar-2009 10:10:22 [SETI@home] [error] No app version for result: windows_x86_64 608 cuda

etc. It seems to think that I want my 608 to be handled by a 64-bit application, at least that's what I make of this message ....

I then changed the <platform> in the cuda section back to windows_x86. Seems to work, at least I get no error messages and it is asking for work (but not getting any, according to the message on the home page the science database crashed, no work available).

Does this make sense? My ak_v8 is a 64-bit application and uses the x86_64 platform description, my cuda executable is a 32-bit application (I'm not aware of the existence of a 64-bit cuda executable?) and uses the x86 description. Is this as it is supposed to be or am I just screwing things up, as usual :-)

Regards,
John.

Fred W
Volunteer tester
Send message
Joined: 13 Jun 99
Posts: 2524
Credit: 11,954,210
RAC: 0
United Kingdom
Message 877423 - Posted: 20 Mar 2009, 10:36:18 UTC - in response to Message 877410.

The equivalent line from my installation:

<platform>windows_x86_64</platform>

Please do change it - I trashed a lot of WU's because I forgot about this wrinkle :-(

I had the _x86_64 at both occurrences of <platform> but it trashed all my cuda units immediately on startup. I got an error message indicating that there was no version info for the 64bit cuda executable (?)

20-Mar-2009 10:10:22 [SETI@home] [error] No app version for result: windows_x86_64 608 cuda
20-Mar-2009 10:10:22 [SETI@home] [error] No app version for result: windows_x86_64 608 cuda

etc. It seems to think that I want my 608 to be handled by a 64-bit application, at least that's what I make of this message ....

I then changed the <platform> in the cuda section back to windows_x86. Seems to work, at least I get no error messages and it is asking for work (but not getting any, according to the message on the home page the science database crashed, no work available).

Does this make sense? My ak_v8 is a 64-bit application and uses the x86_64 platform description, my cuda executable is a 32-bit application (I'm not aware of the existence of a 64-bit cuda executable?) and uses the x86 description. Is this as it is supposed to be or am I just screwing things up, as usual :-)

Regards,
John.

After I had trashed my cache, I decided that the harm had already been done and left the platform description at x86_64 throughout the app_info. That's how it has been running quite happily for over a week now.

F.
____________

Claggy
Volunteer tester
Send message
Joined: 5 Jul 99
Posts: 4048
Credit: 32,693,129
RAC: 604
United Kingdom
Message 877425 - Posted: 20 Mar 2009, 10:48:19 UTC - in response to Message 877423.

The equivalent line from my installation:

<platform>windows_x86_64</platform>

Please do change it - I trashed a lot of WU's because I forgot about this wrinkle :-(

I had the _x86_64 at both occurrences of <platform> but it trashed all my cuda units immediately on startup. I got an error message indicating that there was no version info for the 64bit cuda executable (?)

20-Mar-2009 10:10:22 [SETI@home] [error] No app version for result: windows_x86_64 608 cuda
20-Mar-2009 10:10:22 [SETI@home] [error] No app version for result: windows_x86_64 608 cuda

etc. It seems to think that I want my 608 to be handled by a 64-bit application, at least that's what I make of this message ....

I then changed the <platform> in the cuda section back to windows_x86. Seems to work, at least I get no error messages and it is asking for work (but not getting any, according to the message on the home page the science database crashed, no work available).

Does this make sense? My ak_v8 is a 64-bit application and uses the x86_64 platform description, my cuda executable is a 32-bit application (I'm not aware of the existence of a 64-bit cuda executable?) and uses the x86 description. Is this as it is supposed to be or am I just screwing things up, as usual :-)

Regards,
John.

After I had trashed my cache, I decided that the harm had already been done and left the platform description at x86_64 throughout the app_info. That's how it has been running quite happily for over a week now.

F.

I've left mine as windows_x86, lost several caches worth before the new scheduler was installed on Seti Main, while i tried to get GPU requests to work,
and haven't run the cache down yet to try again.

Claggy

dyeman
Volunteer tester
Send message
Joined: 3 Apr 99
Posts: 23
Credit: 4,022,422
RAC: 0
Australia
Message 877804 - Posted: 21 Mar 2009, 8:53:05 UTC - in response to Message 877425.

Well this week's outage opened a good opportunity to empty some queues and install 6.6.17. I still had some AP V5s (and WCG work which hopefully won't care). After the install the APs are estimated to finish in about 3 hours (I wish!) - I guess the estimates will equalise over time (or I've cocked up the flop values in the APP_INFO). Without MB work available I downloaded some GPUGRID work and all seems to re running happily (AP WCG and GPUGRID).
____________

1 · 2 · 3 · 4 . . . 12 · Next

Message boards : Number crunching : app_info for AP500, AP503, MB603 and MB608

Copyright © 2014 University of California