Astropulse rip off

Message boards : Number crunching : Astropulse rip off
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Profile Tuvok

Send message
Joined: 14 Apr 03
Posts: 16
Credit: 13,753,353
RAC: 0
Canada
Message 800831 - Posted: 22 Aug 2008, 15:38:55 UTC

Just noticed that a unit one of my computers resulted in an unfair credit result.

I claimed 714. My co-pilot claimed 67....

guess what, I got 67 for 387,742.90 seconds of CPU time.
ID: 800831 · Report as offensive
Profile Tuvok

Send message
Joined: 14 Apr 03
Posts: 16
Credit: 13,753,353
RAC: 0
Canada
Message 800833 - Posted: 22 Aug 2008, 15:39:26 UTC - in response to Message 800831.  

Just noticed that a unit one of my computers resulted in an unfair credit result.

I claimed 714. My co-pilot claimed 67....

guess what, I got 67 for 387,742.90 seconds of CPU time.



http://setiathome.berkeley.edu/workunit.php?wuid=312349656
ID: 800833 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14690
Credit: 200,643,578
RAC: 874
United Kingdom
Message 800838 - Posted: 22 Aug 2008, 15:57:20 UTC

Ooooh, nasty.

Would a Linux specialist like to have a look at your wingman, host 4515096?

Running BOINC v5.10.45, which should be OK.

SETI MB 5.28 was running fine.

SETI MB 6.03 has been crashing consistently with Signal 11

Astropulse seemed to run to a finish, then start all over again: it was only a short way through the second run when it reported complete, hence the low credit claim. Yet it seems to have uploaded a complete result file, and it validated (the few restarts I've seen in the past, for SETI tasks, have uploaded an incomplete/partially overwritten result file, and hence been invalid).

@ Tuvok,

Extremely bad luck. You seem to have been paired with a fellow-cruncher with a very sick computer. Not your fault, not anything you would expect to happen (and certainly not anything designed into the Astropulse search). I hope it doesn't put you off crunching: the chance of this happening again is very, very low.
ID: 800838 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 800839 - Posted: 22 Aug 2008, 15:58:56 UTC

Ripoff, yes. Unfair, maybe not.

The simple truth is that we're all likely to run up against some machine that will radically underclaim from time to time, and when we do, we get ripped off.

If there was some mechanism that paired all of those low-claiming machines with you exclusively, that would be unfair.

It's a ripoff -- but it's an equal-opportunity ripoff.
ID: 800839 · Report as offensive
Urs Echternacht
Volunteer tester
Avatar

Send message
Joined: 15 May 99
Posts: 692
Credit: 135,197,781
RAC: 211
Germany
Message 801096 - Posted: 23 Aug 2008, 2:22:11 UTC

I just started testing linux x86_64 v6.03 at beta and do get the same errors like host 4515096.
_\|/_
U r s
ID: 801096 · Report as offensive
Eric Korpela Project Donor
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 3 Apr 99
Posts: 1382
Credit: 54,506,847
RAC: 60
United States
Message 801100 - Posted: 23 Aug 2008, 2:52:53 UTC - in response to Message 801096.  

That's weird. It's getting a signal 11 after calling boinc_finish(). Hmmmmm.... time to delve into what boinc_finish() actually does and where it could segfault.

I just started testing linux x86_64 v6.03 at beta and do get the same errors like host 4515096.


Eric
@SETIEric@qoto.org (Mastodon)

ID: 801100 · Report as offensive
Eric Korpela Project Donor
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 3 Apr 99
Posts: 1382
Credit: 54,506,847
RAC: 60
United States
Message 801102 - Posted: 23 Aug 2008, 2:54:26 UTC - in response to Message 801096.  

BTW, I manually granted credit to the users and hosts involved in this one.

Eric
@SETIEric@qoto.org (Mastodon)

ID: 801102 · Report as offensive
Eric Korpela Project Donor
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 3 Apr 99
Posts: 1382
Credit: 54,506,847
RAC: 60
United States
Message 801104 - Posted: 23 Aug 2008, 3:00:39 UTC - in response to Message 801102.  

I'm going to deprecate 6.03 for x86_64 until I know what's causing this.

Eric
@SETIEric@qoto.org (Mastodon)

ID: 801104 · Report as offensive
Profile Tuvok

Send message
Joined: 14 Apr 03
Posts: 16
Credit: 13,753,353
RAC: 0
Canada
Message 801116 - Posted: 23 Aug 2008, 3:46:23 UTC - in response to Message 801102.  

BTW, I manually granted credit to the users and hosts involved in this one.

Eric

wow...thanks.

Its nice to know the big guys are watching the forums.
ID: 801116 · Report as offensive
Profile jason_gee
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 24 Nov 06
Posts: 7489
Credit: 91,093,184
RAC: 0
Australia
Message 801143 - Posted: 23 Aug 2008, 4:38:31 UTC - in response to Message 801100.  
Last modified: 23 Aug 2008, 4:39:57 UTC

That's weird. It's getting a signal 11 after calling boinc_finish(). Hmmmmm.... time to delve into what boinc_finish() actually does and where it could segfault.

I just started testing linux x86_64 v6.03 at beta and do get the same errors like host 4515096.


Eric


*Could* possibly be remotely related to some wrestling with boinc_finish() I had when profiling windows AK_v8 builds (not Linux, perhaps there's a similar issue there though) . The instrumented builds would dump out before writing their information. Looking deeper I found the exit strategy involved (after calling boinc_finish()) was using Windows' TerminateProcess() instead of ExitProcess() which, in an explanation from Rom Walton, was to counter a tendency in the graphics builds to not close threads fast enough. For me that was OK as I wasn't making graphics builds at the time, so I switched it to ExitProcess() with a preprocessor wrapped directive set for the non-graphics build.
My understanding is that TerminateProcess() will terminate all threads immediately, which is not advised by MS when using DLL's (and I guess other global data) as one or the other thread global data will be in an unknown state for a short time. So my guess is one thread is still reading/writing when shared memory is released, but of course may not apply to the Linux segfault situation at all, as I guess it'd be using some different mechanism.

Jason
"Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions.
ID: 801143 · Report as offensive
Profile TerryG
Avatar

Send message
Joined: 11 Mar 01
Posts: 16
Credit: 15,351,703
RAC: 37
United Kingdom
Message 801797 - Posted: 24 Aug 2008, 23:00:16 UTC

I've completed 3 or 4 AP WUs now, all around the 250000 - 300000 second mark and the highest I've been granted was about 760 - half of what I claimed. Here's an example that hasn't been granted as of writing (the others have dropped off the end of the list):

http://setiathome.berkeley.edu/workunit.php?wuid=313876961

It does make me want to abort any further AP WUs until there's a fix or possibly at least a good explanation!
ID: 801797 · Report as offensive
Profile xilef

Send message
Joined: 18 Jul 01
Posts: 43
Credit: 17,485,094
RAC: 0
United States
Message 805349 - Posted: 5 Sep 2008, 22:42:36 UTC - in response to Message 801797.  
Last modified: 5 Sep 2008, 22:43:06 UTC

I've completed 3 or 4 AP WUs now, all around the 250000 - 300000 second mark and the highest I've been granted was about 760 - half of what I claimed. Here's an example that hasn't been granted as of writing (the others have dropped off the end of the list):

http://setiathome.berkeley.edu/workunit.php?wuid=313876961

It does make me want to abort any further AP WUs until there's a fix or possibly at least a good explanation!


I worked one that long and got half that much credit.---20 times longer to process than regular WU's only 5 times the credit.

No more AP units for me. Found the switch in preferences.
ID: 805349 · Report as offensive
BarryAZ

Send message
Joined: 1 Apr 01
Posts: 2580
Credit: 16,982,517
RAC: 0
United States
Message 805367 - Posted: 6 Sep 2008, 0:02:29 UTC - in response to Message 805349.  

For me I've found that SETI Astropulse yields somewhere on the order of 7 to 8 credits per hour (one core of a dual core AMD 5200 or similar). Other projects vary but none of the others are that low -- typically they run from 13 to 20 credits per hour.

But there is something of a problem with the 'regular' SETI work units and multicore AMD processors where they go into 'black hole' mode of no progress with dozens or hours of processing time. So I'm in something of a bind. For now, I'll let the existing AP work units complete, but will wait until some correction is made.

I'll not engage in the 'our credits are right, and everyone else is wrong' discussion that seems to be running loose on the board here, I see that as something of a religious/political argument where parties on either side end up generating more heat than light -- I figure that would give ET a bad impression of the degree to which we are enlightened. Rather, for now, I'll simply shift over to other projects which also have social redeeming goals.


I've completed 3 or 4 AP WUs now, all around the 250000 - 300000 second mark and the highest I've been granted was about 760 - half of what I claimed. Here's an example that hasn't been granted as of writing (the others have dropped off the end of the list):

http://setiathome.berkeley.edu/workunit.php?wuid=313876961

It does make me want to abort any further AP WUs until there's a fix or possibly at least a good explanation!


I worked one that long and got half that much credit.---20 times longer to process than regular WU's only 5 times the credit.

No more AP units for me. Found the switch in preferences.


ID: 805367 · Report as offensive
W-K 666 Project Donor
Volunteer tester

Send message
Joined: 18 May 99
Posts: 19546
Credit: 40,757,560
RAC: 67
United Kingdom
Message 805396 - Posted: 6 Sep 2008, 1:12:50 UTC - in response to Message 805367.  
Last modified: 6 Sep 2008, 1:13:58 UTC

It has been known for some time, before AP was introduced here, that AMD cpu's do not do AP units well.
In a comparison on the Beta site two computers with similar benchmark numbers completed AP units in ~36 and ~86 hrs. The cpu's were Intel E6600 and AMD X2 6000+.
And before anyone jumps in and says in must be that bad AMD string that degrades performance, Berkeley does not use the Intel compiler, they cannot afford it.

For me I've found that SETI Astropulse yields somewhere on the order of 7 to 8 credits per hour (one core of a dual core AMD 5200 or similar). Other projects vary but none of the others are that low -- typically they run from 13 to 20 credits per hour.

But there is something of a problem with the 'regular' SETI work units and multicore AMD processors where they go into 'black hole' mode of no progress with dozens or hours of processing time. So I'm in something of a bind. For now, I'll let the existing AP work units complete, but will wait until some correction is made.

I'll not engage in the 'our credits are right, and everyone else is wrong' discussion that seems to be running loose on the board here, I see that as something of a religious/political argument where parties on either side end up generating more heat than light -- I figure that would give ET a bad impression of the degree to which we are enlightened. Rather, for now, I'll simply shift over to other projects which also have social redeeming goals.


I've completed 3 or 4 AP WUs now, all around the 250000 - 300000 second mark and the highest I've been granted was about 760 - half of what I claimed. Here's an example that hasn't been granted as of writing (the others have dropped off the end of the list):

http://setiathome.berkeley.edu/workunit.php?wuid=313876961

How did you claim 2 * granted, before introduction AP moved to flop counting for claimed credits, and when AP was introduced here claimed less than 700cr. Since than, due to Eric's credit adjustment the claims/granted have risen steadily to 775+.

It does make me want to abort any further AP WUs until there's a fix or possibly at least a good explanation!


I worked one that long and got half that much credit.---20 times longer to process than regular WU's only 5 times the credit.

No more AP units for me. Found the switch in preferences.

ID: 805396 · Report as offensive
Profile Uli
Volunteer tester
Avatar

Send message
Joined: 6 Feb 00
Posts: 10923
Credit: 5,996,015
RAC: 1
Germany
Message 805408 - Posted: 6 Sep 2008, 1:54:08 UTC

My AMD is working just fine for AP and MB. Since I upgraded, I only had one error in 8 mos.
Pluto will always be a planet to me.

Seti Ambassador
Not to late to order an Anni Shirt
ID: 805408 · Report as offensive
BarryAZ

Send message
Joined: 1 Apr 01
Posts: 2580
Credit: 16,982,517
RAC: 0
United States
Message 805459 - Posted: 6 Sep 2008, 7:38:16 UTC - in response to Message 805408.  

It isn't that the AMD processors don't work for AstroPulse, but rather that the AstroPulse application apparently runs inefficiently on AMD processors. Thus, instead of taking 40 to 50 hours to complete an AstroPulse work unit (which apparently is the case on an Intel system of similar speed), the AMD based system will take 75 to 100 hours.

It is as if an Intel system runs a 'regular' application while the AMD runs a 'disoptimized' application.

No big deal for me, for now, I'm shifting elsewhere. I've configured my workstations for no new work. At some point, when the AP work units have flushed thru the farm, I'll reconfigure -- that is, I allow only the non AP work units for those AMD processors (via group definition), and run the optimized SETI application -- which apparently will resolve the problem I've encountered all too often with the 'regular' SETI work units and AMD multicores (of running hours with no progress). At that point, I'll be back in a 'happy SETI' place.

During the interim, other projects are picking up the slack for me.



My AMD is working just fine for AP and MB. Since I upgraded, I only had one error in 8 mos.


ID: 805459 · Report as offensive
Sirius B Project Donor
Volunteer tester
Avatar

Send message
Joined: 26 Dec 00
Posts: 24927
Credit: 3,081,182
RAC: 7
Ireland
Message 805497 - Posted: 6 Sep 2008, 11:57:57 UTC

All the AP wu's I've crunched have only been on the AMD rigs on my farm.

The last AP wu crunched 4402202 totalled 177.89 hours with a granted credit of 673.70. This gives a c/ph of 3.79, low but it all helps.
ID: 805497 · Report as offensive
BarryAZ

Send message
Joined: 1 Apr 01
Posts: 2580
Credit: 16,982,517
RAC: 0
United States
Message 805561 - Posted: 6 Sep 2008, 17:47:22 UTC - in response to Message 805497.  

OK -- sort of confirms the discussion here -- sounds like you have an older AMD for that last work unit (perhaps something like an XP1600).

The Astropulse work units yield about 1/2 the credit/hour on AMD's compared to the regular SETI work units. That is a fairly steep (and unintentional) tax on AMD processing power.

What I expect to do when I get the time, is get the optimized application in place and configure for the regular work units. My understanding is that combination will help me avoid the 'black hole' work units that the AMD multi-core processors running Win2K and WinXP seem to encounter with the current batch of regular work units (this used to happen rarely -- say two months ago, but I was running into it a LOT this past month).



All the AP wu's I've crunched have only been on the AMD rigs on my farm.

The last AP wu crunched 4402202 totalled 177.89 hours with a granted credit of 673.70. This gives a c/ph of 3.79, low but it all helps.


ID: 805561 · Report as offensive
Sirius B Project Donor
Volunteer tester
Avatar

Send message
Joined: 26 Dec 00
Posts: 24927
Credit: 3,081,182
RAC: 7
Ireland
Message 805595 - Posted: 6 Sep 2008, 20:33:56 UTC - in response to Message 805561.  

OK -- sort of confirms the discussion here -- sounds like you have an older AMD for that last work unit (perhaps something like an XP1600).

The Astropulse work units yield about 1/2 the credit/hour on AMD's compared to the regular SETI work units. That is a fairly steep (and unintentional) tax on AMD processing power.

What I expect to do when I get the time, is get the optimized application in place and configure for the regular work units. My understanding is that combination will help me avoid the 'black hole' work units that the AMD multi-core processors running Win2K and WinXP seem to encounter with the current batch of regular work units (this used to happen rarely -- say two months ago, but I was running into it a LOT this past month).



All the AP wu's I've crunched have only been on the AMD rigs on my farm.

The last AP wu crunched 4402202 totalled 177.89 hours with a granted credit of 673.70. This gives a c/ph of 3.79, low but it all helps.



Athlon 64 X2 4200

ID: 805595 · Report as offensive
Profile Fred J. Verster
Volunteer tester
Avatar

Send message
Joined: 21 Apr 04
Posts: 3252
Credit: 31,903,643
RAC: 0
Netherlands
Message 805609 - Posted: 6 Sep 2008, 21:01:50 UTC - in response to Message 805595.  
Last modified: 6 Sep 2008, 21:18:54 UTC

OK -- sort of confirms the discussion here -- sounds like you have an older AMD for that last work unit (perhaps something like an XP1600).

The Astropulse work units yield about 1/2 the credit/hour on AMD's compared to the regular SETI work units. That is a fairly steep (and unintentional) tax on AMD processing power.

What I expect to do when I get the time, is get the optimized application in place and configure for the regular work units. My understanding is that combination will help me avoid the 'black hole' work units that the AMD multi-core processors running Win2K and WinXP seem to encounter with the current batch of regular work units (this used to happen rarely -- say two months ago, but I was running into it a LOT this past month).



All the AP wu's I've crunched have only been on the AMD rigs on my farm.

The last AP wu crunched 4402202 totalled 177.89 hours with a granted credit of 673.70. This gives a c/ph of 3.79, low but it all helps.



Athlon 64 X2 4200


Hi, there appears to be a huge difference between the used CPU and probably the whole system compaired to crunching times, I use a QX9650 @ 3434MHz, X38 chipset, a AP WU takes about 24 hours, that gives about 30cr/hour. Credit claim was 750, got 720.
I wander, what causes this huge difference in crunching time, it's NOT FLOPs only or RAM, looks like these WU's are less memory dependent then MB Units, but I also doubt that?
Must be the code used?
Or the L2 cache (12MByte)?
Maybe my CPU has twice the FLOPs then the Athlon 64 X2 4200, per core. Or a bit more. ;) Beats me, so far.
Oh, I don't mind, crunching them, someone(s) got to do them, anyway ;)
ID: 805609 · Report as offensive
1 · 2 · Next

Message boards : Number crunching : Astropulse rip off


 
©2025 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.