Posts by ivan


log in
1) Message boards : Number crunching : Panic Mode On (84) Server Problems? (Message 1381824)
Posted 3 days ago by Profile ivan
Grant. I think you're going to worry yourself
into an early grave.

Not worry, Panic.

Relax. Go do some rock-sitting. Have a Darwin Stubby or three.
2) Message boards : Number crunching : Intel Xeon Phi (Message 1381626)
Posted 3 days ago by Profile ivan
And keep in mind this is the first generation of Xeon Phi, GPGPU computing has been around for over 5 generations.
The Xeon Phi has a huge potential in it's field.

I'd agree with most of that (apart from the greengrocer's apostrophe...) Let's say I've heard "rumours" about future Xeon Phis, such that it's possible the product line may expand...
3) Message boards : Number crunching : Intel Xeon Phi (Message 1381511)
Posted 4 days ago by Profile ivan
A quick glance at my Linux boxes shows Astropulse runs about 50 MB and Seti7 runs 40 MB so there's no way to get all 240 threads running a WU each. Intel reckons that two threads/core is the sweet spot for big calculations, though.

Can it execute OpenCL on CPUs ?
If yes then OpenCL MB and AP can be configured to run few own instances each using few dozens of CPU cores, for example. This will reduce memory footprint considerably while using almost all CPU cores (in degree almost all GPU "cores" used now).

Yes, that's one of the programming models, though we didn't explore it in the workshop I went to at CERN. It's not (yet) as efficient as other models, but it does exist. If things come to fruition I might ask to try your OpenCL code.
4) Message boards : Number crunching : Intel Xeon Phi (Message 1381453)
Posted 4 days ago by Profile ivan
Are there any plans for SETI@HOME to support the Intel Xeon Phi coprocessors?

For something that specialised it'll be up to someone to volunteer their services to develop the support for the hardware.

We've decided to order one. Unfortunately the model on offer is the 60-core passively-cooled one so I had to convince the boss that we needed to spend as much again on a chassis with forced-air cooling. I have a box it would have dropped into if it was actively cooled; the two-GPU Linux cruncher would have taken it at the expense of one of the GPUs but that's heavily-used for data analysis so not the machine for Phi development.

For sure I'll get BOINC running, if only for the impressive start-up message. S@H is a bit more problematic; the Phi only has 8 GB of memory on-board, and that includes the file-system (with no swap). A quick glance at my Linux boxes shows Astropulse runs about 50 MB and Seti7 runs 40 MB so there's no way to get all 240 threads running a WU each. Intel reckons that two threads/core is the sweet spot for big calculations, though.

There's a chance for parallelism, though, depending on where the bottlenecks are -- several threads each executing one pass through a loop structure (OpenMP).
Also their vector units are much wider (2x?) than the CPUs so the compiler should be able to use that to speed up operations.
5) Message boards : News : SETI@home Version 7 has been released (Message 1381365)
Posted 4 days ago by Profile ivan
And an extra-Z80 processor so I can run CP/M apps.
I'm still working on porting SETI@home to it.

I wonder if there's an FFT that can run in 1KB (ZX80) or 48KB (C64). :-)

I used to do 256-channel FFTs in CBASIC on a (Z80) Exidy Sorcerer -- cutting-edge stuff in Antarctica in 1980. It was faster to precompute the arrays of sins and coses needed, and then read them in from floppy disk, than to calculate them every run.

Hey - another Sorcerer user - I've still got mine in the cellar. But I did boring things like word processing and mailing lists on mine.

I've still got mine too, in storage in Byron Bay. Last time I checked, maybe 20 years ago, there was a bus problem between the computer and the S100 expansion unit. I'm hoping next time I'm home to at least retrieve the Micropolis floppy disks and see if I can find someone who can read them -- some results I got in Mawson which were pooh-poohed as not fitting the orthodox view of upper-atmospheric science, and so never published, are now apparently mainstream; it'd be nice to claim priority!
6) Message boards : News : SETI@home Version 7 has been released (Message 1380754)
Posted 5 days ago by Profile ivan
And an extra-Z80 processor so I can run CP/M apps.
I'm still working on porting SETI@home to it.

I wonder if there's an FFT that can run in 1KB (ZX80) or 48KB (C64). :-)

I used to do 256-channel FFTs in CBASIC on a (Z80) Exidy Sorcerer -- cutting-edge stuff in Antarctica in 1980. It was faster to precompute the arrays of sins and coses needed, and then read them in from floppy disk, than to calculate them every run.
7) Message boards : Number crunching : Linux: v7 CUDA? (Message 1380068)
Posted 7 days ago by Profile ivan

Discussed in 'Porting s@h V7 to Linux':
http://setiathome.berkeley.edu/forum_thread.php?id=71818


And unfortunately as far as I'm concerned, real work is intruding on my efforts. My direct port works well as a standalone, and produces results as close to the CPU executable as you could expect. But it produces access violations when run from BOINC. JG has had more success with his port but last I heard wasn't fully satisfied with it. We'll get there someday -- I keep threatening to retire to get more time for this sort of thing but there are small problems like the lack of enough pension fund...
8) Message boards : Number crunching : Lunatics Windows Installer v0.41 Release Notes (Message 1378643)
Posted 10 days ago by Profile ivan
All 3 rigs now updated to 314.22 drivers.

We'll give 'er a meow and see how it flies.

I'm running 314.07 here where I had the problem.
<Eccles> Thinks! I should check my other two Win+Nvidia machines <Spanish Inquisition> three Win+Nvidia macines </SI>... <checks> No, no computation errors there, but they are all Win7 while my home machine is XP </checks>. </Eccles>
9) Message boards : Number crunching : Astropulse Nvidia errors (Message 1378640)
Posted 10 days ago by Profile ivan
The ones I checked happened after completion of the work. The OpenCL AstroPulse crash after processing completion - write here. thread discusses that. The cause is not fully understood, so the problem not fully fixed.
Joe

Thanks Joe. I saw that thread at the time, but it wasn't relevant to me back then. I'd forgotten about it by now, of course. I'll hold off on GPU AP work until a solution is found -- I'm losing enough RAC at the moment without doing "work for no pay" too. ;-)
10) Message boards : Number crunching : Astropulse Nvidia errors (Message 1378599)
Posted 10 days ago by Profile ivan
Just noticed that my home PC is having a large number of computational errors in its Astropulse tasks on my GTX560 GPU. Is anyone else seeing that with their rigs?

[Edit] In the meantime I re-installed the Lunatics package w/o CPU AP, which led to time-out errors... [/Edit]
11) Message boards : Number crunching : Porting s@h V7 to Linux (Message 1377694)
Posted 12 days ago by Profile ivan
if only I could come up with a method to find that pesky segfault! (Using the unstripped version didn't result in a traceback or any other information more in the stderr file.)

Add a pause at the beginning and once BOINC has started it attach gdb to running process?

[And hope the pause doesn't make the error disappear :)

edit: Come to think of it, you could configure you system to dump core and give that to gdb.

Thanks, I'll give that some consideration, probably at the weekend. I need to concentrate on work work tomorrow, have to come up with a working system of something I've been delegated lately -- almost there!
12) Message boards : Number crunching : Porting s@h V7 to Linux (Message 1377692)
Posted 12 days ago by Profile ivan
Here's a partial listing of the benchmark results. Unfortunately we had a network glitch this afternoon in my building, which led to my desktop getting a new address from DHCP and "orphaning" my ssh session to the server. The results look close enough, if only I could come up with a method to find that pesky segfault! (Using the unstripped version didn't result in a traceback or any other information more in the stderr file.) Jason, which version of BOINC are you using on your Frankenputer?


Looks like 6.10.58.


Yep, prepackaged 6.10.58 Berkeley one installed in my home folder.... as opposed to the locally tailored 6.10.58 on my Windows machines.


Did you actually build that yourself? Otherwise what are you linking the CUDA executables against?

I'm not very familiar with git -- as far as I understand the git clone command downloaded all commits to the repository, so do I just do a checkout command for the 6.10.58 version (presumably after a make clean...) and then do the ./_autosetup and "./configure --disable-server --enable-manager" commands (plus make) to rebuild it?
13) Message boards : Number crunching : Porting s@h V7 to Linux (Message 1377660)
Posted 12 days ago by Profile ivan
Here's a partial listing of the benchmark results. Unfortunately we had a network glitch this afternoon in my building, which led to my desktop getting a new address from DHCP and "orphaning" my ssh session to the server. The results look close enough, if only I could come up with a method to find that pesky segfault! (Using the unstripped version didn't result in a traceback or any other information more in the stderr file.) Jason, which version of BOINC are you using on your Frankenputer?

----------------------------------------------------------------
Current WU: FG00091_V7.wu

----------------------------------------------------------------
Running default app with command :... setiathome_7.01_i686-pc-linux-gnu
./setiathome_7.01_i686-pc-linux-gnu 9825.01 sec 9755.18 sec 67.30 sec
Elapsed Time: ....................... 9825 seconds

----------------------------------------------------------------
Running app with command : .......... setiathome_x41_x86_64-pc-linux-gnu_cuda41
./setiathome_x41_x86_64-pc-linux-gnu_cuda41 7138.65 sec 106.92 sec 23.79 sec
Elapsed Time : ...................... 7139 seconds
Speed compared to default : ......... 137 %
-----------------
Comparing results
Result : Strongly similar, Q= 99.94%

----------------------------------------------------------------
Done with FG00091_V7.wu

====================================================================
Current WU: FG00134_V7.wu

----------------------------------------------------------------
Running default app with command :... setiathome_7.01_i686-pc-linux-gnu
./setiathome_7.01_i686-pc-linux-gnu 9680.99 sec 9603.07 sec 75.33 sec
Elapsed Time: ....................... 9681 seconds

----------------------------------------------------------------
Running app with command : .......... setiathome_x41_x86_64-pc-linux-gnu_cuda41
./setiathome_x41_x86_64-pc-linux-gnu_cuda41 6808.87 sec 97.33 sec 20.85 sec
Elapsed Time : ...................... 6809 seconds
Speed compared to default : ......... 142 %
-----------------
Comparing results
Result : Strongly similar, Q= 99.97%

----------------------------------------------------------------
Done with FG00134_V7.wu

====================================================================
Current WU: FG01307_V7.wu

----------------------------------------------------------------
Running default app with command :... setiathome_7.01_i686-pc-linux-gnu
./setiathome_7.01_i686-pc-linux-gnu 9441.82 sec 9363.48 sec 74.12 sec
Elapsed Time: ....................... 9442 seconds

----------------------------------------------------------------
Running app with command : .......... setiathome_x41_x86_64-pc-linux-gnu_cuda41
./setiathome_x41_x86_64-pc-linux-gnu_cuda41 2161.47 sec 88.19 sec 17.85 sec
Elapsed Time : ...................... 2161 seconds
Speed compared to default : ......... 436 %
-----------------
Comparing results
Result : Strongly similar, Q= 99.97%

----------------------------------------------------------------
Done with FG01307_V7.wu

====================================================================
Current WU: FG02968_V7.wu

----------------------------------------------------------------
Running default app with command :... setiathome_7.01_i686-pc-linux-gnu
./setiathome_7.01_i686-pc-linux-gnu 14072.77 sec 14000.73 sec 69.50 sec
Elapsed Time: ....................... 14073 seconds

----------------------------------------------------------------
Running app with command : .......... setiathome_x41_x86_64-pc-linux-gnu_cuda41
./setiathome_x41_x86_64-pc-linux-gnu_cuda41 1502.04 sec 122.73 sec 16.70 sec
Elapsed Time : ...................... 1502 seconds
Speed compared to default : ......... 936 %
-----------------
Comparing results
Result : Strongly similar, Q= 99.89%

----------------------------------------------------------------
Done with FG02968_V7.wu

====================================================================
Current WU: FG03853_V7.wu

----------------------------------------------------------------
Running default app with command :... setiathome_7.01_i686-pc-linux-gnu
./setiathome_7.01_i686-pc-linux-gnu 13757.72 sec 13642.14 sec 112.06 sec
Elapsed Time: ....................... 13758 seconds

----------------------------------------------------------------
Running app with command : .......... setiathome_x41_x86_64-pc-linux-gnu_cuda41
./setiathome_x41_x86_64-pc-linux-gnu_cuda41 1250.23 sec 105.88 sec 23.00 sec
Elapsed Time : ...................... 1250 seconds
Speed compared to default : ......... 1100 %
-----------------
Comparing results
Result : Strongly similar, Q= 99.91%

----------------------------------------------------------------
Done with FG03853_V7.wu
14) Message boards : Number crunching : What has your RAC done......since..... (Message 1377529)
Posted 12 days ago by Profile ivan
Well even though the credits have plummeted my 2 recently upgraded rigs are still climbing the RAC ladder with my Q6600 now up to 72nd and my 2500K up to 94th (78th & 104th prior to change) so it isn't all that bad from my POV.

Cheers.

I hope you waved as you went by -- without GPUs my big Linux rig is plummeting down past 200.
15) Message boards : Number crunching : Porting s@h V7 to Linux (Message 1377138)
Posted 13 days ago by Profile ivan
Good move! Just one point (that might be annoying for some of the Linux users) : Users with older (incompatible) glibc will not be able to use your app. (just a hint, no bad intentions)


What version of glibc do you call older? I upgraded my Linux pc's to glibc 2.14.1 to be able to use the x41g application.

This could be a good opportunity for a major upgrade of my Linux pc's as any Boinc version beyond 7.0.25 complains about missing or out-of-date libraries

Some might not have the opportunity to update their glibc's or BOINC as quick as you. (el5/el6 support by default glibc 2.11.x) Knowing that gave some base premise for building new opt. apps.

That's the problem I have (forced into SLC5) and why I've been building my own BOINCs for a while (plus they're needed to build s@h).

I've realised that there's an unstripped version of the executable in the build tree. I'm hoping that would give me a traceback on the exception (but does the stack dump go into stdout or stderr?), but I won't try for a while as I've got the benchmark script running right now. Minor differences in the results for the first WU (FG00091_V7.wu).
16) Message boards : Number crunching : Porting s@h V7 to Linux (Message 1377002)
Posted 14 days ago by Profile ivan
Should we invite Ivan over to CA?


yep, good Idea. Might make things a lot easier.


Will see if I can get my Linux beast alive sometime after work, then figure out what's needed for the full set of Cuda apps in parallel with Ivan. If it builds *something* pretty much off the bat, that's a good start. Figuring out if it's Xbranch, boincapi, boinc or other library/driver breaking might be the adventure, we'll see.

A couple of days of mainly doing other things...

It still segfaults when I try to run it under BOINC. I ran the Lunatics installer on my Windows machines last night and compared my app_config.xml file to the one the installer builds -- astropulse worked but the CUDA MB crashed. I tried moving my boinc apps back to the version I built the seti task with, and also added as many log flags as seemed useful to the cc_config.xml but didn't find any smoking guns in the BOINC log.
17) Message boards : Number crunching : Porting s@h V7 to Linux (Message 1375523)
Posted 16 days ago by Profile ivan
Right, this probably makes more sense:

[eesridr:seti] > for i in V7_WU_FGset/* ; do n=`basename $i .wu`; echo $i, $n; \
cp $i work_unit.sah; time ./setiathome_x41_x86_64-pc-linux-gnu_cuda41; mkdir $n; mv result.sah $n;
\mv stderr.txt $n; mv state.sah $n; done
V7_WU_FGset/FG00091_V7.wu, FG00091_V7

real 118m42.337s
user 1m34.298s
sys 0m20.664s
V7_WU_FGset/FG00134_V7.wu, FG00134_V7

real 112m57.705s
user 1m41.795s
sys 0m22.125s
V7_WU_FGset/FG01307_V7.wu, FG01307_V7

real 36m10.537s
user 1m28.819s
sys 0m20.910s
V7_WU_FGset/FG02968_V7.wu, FG02968_V7

real 24m35.531s
user 1m43.075s
sys 0m18.435s
V7_WU_FGset/FG03853_V7.wu, FG03853_V7

real 20m56.194s
user 1m49.579s
sys 0m25.081s
V7_WU_FGset/FG04160_V7.wu, FG04160_V7

real 19m21.320s
user 1m47.907s
sys 0m18.671s
V7_WU_FGset/FG04221_V7.wu, FG04221_V7

real 19m7.739s
user 1m45.008s
sys 0m21.347s
V7_WU_FGset/FG04317_V7.wu, FG04317_V7

real 18m43.080s
user 1m40.340s
sys 0m17.893s
V7_WU_FGset/FG04465_V7.wu, FG04465_V7

real 18m23.984s
user 1m40.191s
sys 0m16.684s
V7_WU_FGset/FG09362_V7.wu, FG09362_V7

real 13m56.876s
user 1m15.963s
sys 0m11.871s
V7_WU_FGset/FG11753_V7.wu, FG11753_V7

real 11m1.364s
user 1m11.966s
sys 0m11.003s
V7_WU_FGset/FG13462_V7.wu, FG13462_V7

real 10m57.744s
user 1m16.691s
sys 0m12.424s
V7_WU_FGset/FG24857_V7.wu, FG24857_V7

real 10m41.414s
user 1m11.185s
sys 0m10.843s
V7_WU_FGset/FG53024_V7.wu, FG53024_V7

real 10m39.754s
user 1m14.204s
sys 0m11.442s
V7_WU_FGset/FG76516_V7.wu, FG76516_V7

real 10m41.165s
user 1m10.789s
sys 0m13.692s

Results should be in the same web location -- http://www.brunel.ac.uk/~eesridr/FG.tgz. I'd run the comparison suite but I'm in the middle of watching today's motorcycle races from Mugello.
18) Message boards : Number crunching : Porting s@h V7 to Linux (Message 1375279)
Posted 17 days ago by Profile ivan
I would suggest providing an init_data.xml, either from a Boinc install or the test pack. probably won't change much, only help see if there is failure further on, such as checking for Boinc's shared mem (which won't be there standalone, or in a test bench)

Copying across the init_data.xml from one of my working slots seemed to cause problems, so I deleted it.

Couple of things:
- That explains the 20 seconds. There must have been a completed state file present (uncleared from the first test task ?, indeed first one looks proper timing for a VLAR)
- On Windows, Cuda 4.1 release had a rather serious flaw involving the CUFFT & mixed generation GPUs in the same host. That could conceivably be a problem on Linux as well I guess, and may well explain your Sig 11's (just one possibility). Cuda 3.2, 4.2, 5, and probably the 5.5rc shouldn't have that particular issue.

OK; I was a bit confused about state.sah, it didn't always appear as far as I recall. I've added it to the list of files moved to the subdirectory and am re-running the test. Let's see what's happened after I've done the shopping...
19) Message boards : News : SETI@home Version 7 has been released (Message 1375276)
Posted 17 days ago by Profile ivan
How many creds do you need to replace a 1990 Oldsmobile Ciera anyway?

Just replace it; whatever you replace it with (outside an Alfa Romeo) will magically multiply your street cred!
20) Message boards : Number crunching : Porting s@h V7 to Linux (Message 1375057)
Posted 17 days ago by Profile ivan
Meanwhile the FGset have finished so I'll process them anyway.

[eesridr:seti] > for i in V7_WU_FGset/* ; do n=`basename $i .wu`; echo $i, $n; \
cp $i work_unit.sah; time ./setiathome_x41_x86_64-pc-linux-gnu_cuda41; mkdir $n; \
mv result.sah $n; mv stderr.txt $n; done
V7_WU_FGset/FG00091_V7.wu, FG00091_V7

real 116m40.409s
user 1m57.721s
sys 0m27.602s
V7_WU_FGset/FG00134_V7.wu, FG00134_V7

real 5m12.050s
user 0m17.461s
sys 0m0.603s
V7_WU_FGset/FG01307_V7.wu, FG01307_V7

real 0m19.874s
user 0m16.071s
sys 0m0.199s
V7_WU_FGset/FG02968_V7.wu, FG02968_V7

real 9m10.382s
user 0m49.887s
sys 0m8.376s
V7_WU_FGset/FG03853_V7.wu, FG03853_V7

real 1m9.393s
user 0m20.379s
sys 0m1.131s
V7_WU_FGset/FG04160_V7.wu, FG04160_V7

real 0m28.049s
user 0m17.603s
sys 0m0.320s
V7_WU_FGset/FG04221_V7.wu, FG04221_V7

real 0m20.991s
user 0m16.145s
sys 0m0.210s
V7_WU_FGset/FG04317_V7.wu, FG04317_V7

real 0m18.224s
user 0m16.030s
sys 0m0.192s
V7_WU_FGset/FG04465_V7.wu, FG04465_V7

real 0m18.177s
user 0m16.001s
sys 0m0.173s
V7_WU_FGset/FG09362_V7.wu, FG09362_V7

real 0m19.182s
user 0m16.988s
sys 0m0.189s
V7_WU_FGset/FG11753_V7.wu, FG11753_V7

real 0m18.198s
user 0m16.019s
sys 0m0.174s
V7_WU_FGset/FG13462_V7.wu, FG13462_V7

real 0m18.445s
user 0m16.256s
sys 0m0.185s
V7_WU_FGset/FG24857_V7.wu, FG24857_V7

real 0m18.274s
user 0m16.083s
sys 0m0.183s
V7_WU_FGset/FG53024_V7.wu, FG53024_V7

real 0m19.254s
user 0m17.066s
sys 0m0.180s
V7_WU_FGset/FG76516_V7.wu, FG76516_V7

real 0m19.371s
user 0m17.180s
sys 0m0.188s

Results available in http://www.brunel.ac.uk/~eesridr/FG.tgz


Next 20

Copyright © 2013 University of California