AstroPulse errors - Reporting

Message boards : Number crunching : AstroPulse errors - Reporting
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 9 · 10 · 11 · 12 · 13 · 14 · Next

AuthorMessage
Profile dnolan
Avatar

Send message
Joined: 30 Aug 01
Posts: 1228
Credit: 47,779,411
RAC: 32
United States
Message 820578 - Posted: 19 Oct 2008, 17:53:59 UTC
Last modified: 19 Oct 2008, 17:54:59 UTC

I just noticed that an AP I'm paired up with, one other cruncher errored out early on the WU (mine is still pending) and got partial credit granted:
Over Client error Done 122,606.70 290.27 290.27

I wasn't aware that error results were getting credit granted, is this a change in policy?

-Dave

[edit] WU is Here
ID: 820578 · Report as offensive
Kurt Schmucker

Send message
Joined: 11 Jan 00
Posts: 72
Credit: 130,823,400
RAC: 207
United States
Message 822341 - Posted: 23 Oct 2008, 20:33:21 UTC

I have noticed that none of my Mac clients are getting AP WUs, and these are by far the fastest clients I have.

Is this an AP WU deployment error?

My MUCH slower Windows-based clients are gettting AP WUs, and sometimes these will crunch for more than 100 hours on a single WU. Is this expected behavior? I thought that slower clients weren't supposed to get AP WUs.
ID: 822341 · Report as offensive
Profile Leaps-from-Shadows
Volunteer tester
Avatar

Send message
Joined: 11 Aug 08
Posts: 323
Credit: 259,220
RAC: 0
United States
Message 822379 - Posted: 23 Oct 2008, 22:21:31 UTC

Macs don't get Astropulse automatically. You have to manually install the application and modify the app_info.xml file. See this thread.
Cruiser
Gateway GT5692 L-f-S Edition
-Phenom X4 9650 CPU
-4GB 667MHz DDR2 RAM
-500GB SATA HD
-Vista x64 SP1
-BOINC 6.2.19 32-bit client
-SSE3 optimized 32-bit apps
ID: 822379 · Report as offensive
OzzFan Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Apr 02
Posts: 15691
Credit: 84,761,841
RAC: 28
United States
Message 822395 - Posted: 23 Oct 2008, 22:53:25 UTC - in response to Message 822379.  

Macs don't get Astropulse automatically. You have to manually install the application and modify the app_info.xml file. See this thread.


I thought that was only true if you are running an optimized application, regardless of platform. Macs (nor PCs) should have an app_info.xml file by default.
ID: 822395 · Report as offensive
Profile Mumps [MM]
Volunteer tester
Avatar

Send message
Joined: 11 Feb 08
Posts: 4454
Credit: 100,893,853
RAC: 30
United States
Message 822405 - Posted: 23 Oct 2008, 23:18:49 UTC - in response to Message 822395.  

Macs don't get Astropulse automatically. You have to manually install the application and modify the app_info.xml file. See this thread.


I thought that was only true if you are running an optimized application, regardless of platform. Macs (nor PCs) should have an app_info.xml file by default.

IIRC, there is no official MAC O/S release of AstroPulse... So to run AP, you would have to use a custom build such as Dotsch's...

The only binaries that seem to live on "fanout" are for Win 32 bit and Linux 32/64 bit.
ID: 822405 · Report as offensive
OzzFan Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Apr 02
Posts: 15691
Credit: 84,761,841
RAC: 28
United States
Message 822422 - Posted: 24 Oct 2008, 0:05:03 UTC - in response to Message 822405.  

Macs don't get Astropulse automatically. You have to manually install the application and modify the app_info.xml file. See this thread.


I thought that was only true if you are running an optimized application, regardless of platform. Macs (nor PCs) should have an app_info.xml file by default.

IIRC, there is no official MAC O/S release of AstroPulse... So to run AP, you would have to use a custom build such as Dotsch's...

The only binaries that seem to live on "fanout" are for Win 32 bit and Linux 32/64 bit.


Ah. OK. It was the word 'modify' that made it sound like Macs would have one by default, as opposed to 'create'.
ID: 822422 · Report as offensive
Profile arkayn
Volunteer tester
Avatar

Send message
Joined: 14 May 99
Posts: 4438
Credit: 55,006,323
RAC: 0
United States
Message 822452 - Posted: 24 Oct 2008, 1:09:00 UTC - in response to Message 822341.  

I have noticed that none of my Mac clients are getting AP WUs, and these are by far the fastest clients I have.

Is this an AP WU deployment error?

My MUCH slower Windows-based clients are gettting AP WUs, and sometimes these will crunch for more than 100 hours on a single WU. Is this expected behavior? I thought that slower clients weren't supposed to get AP WUs.


Crunch3r has just released an optimized SSE3 Mac AP app.

http://calbe.dw70.de/astrop/astropulse-4.28.i686-apple-darwin_sse3.zip

I have it installed, but it will be a couple of days before the 2 units I downloaded make it to the top of the list.

ID: 822452 · Report as offensive
HFB1217
Avatar

Send message
Joined: 25 Dec 05
Posts: 102
Credit: 9,424,572
RAC: 0
United States
Message 824658 - Posted: 29 Oct 2008, 18:20:42 UTC
Last modified: 29 Oct 2008, 19:04:11 UTC

I run performance monitor on my Quads It lets me check the CPU usage in real time. When observing AP work units a few times I will observe an erratic CPU usage showing wild swings in CPU loads and speeds. ESIT is disabled so the speeds should be constant. Normally the speeds and loads are constant at 100% and 3.86 gig.


Shut down and restating Boinc does not correct the problem and I have to abort the AP unit. In checking the wing-man's results I see that they will have errored out after completing the AP work unit. I have lucked out and observed the problem early in the crunching process. After aborting the erratic WU a new one starts without any problems and completes to success.

But what gives with the work units? Why are they failing?
Any ideas what performance monitor is indicating??

Here is a link to a set with myself and the wing man.
http://setiathome.berkeley.edu/workunit.php?wuid=349461703

This is my work unit in the grouping http://setiathome.berkeley.edu/result.php?resultid=1027661304
Come and Visit Us at
BBR TeamStarFire


****My 9th year of Seti****A Founding Member of the Original Seti Team Starfire at Broadband Reports.com ****
ID: 824658 · Report as offensive
Profile jason_gee
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 24 Nov 06
Posts: 7489
Credit: 91,093,184
RAC: 0
Australia
Message 824747 - Posted: 30 Oct 2008, 0:22:07 UTC - in response to Message 824658.  
Last modified: 30 Oct 2008, 0:26:58 UTC

I run performance monitor on my Quads It lets me check the CPU usage in real time. When observing AP work units a few times I will observe an erratic CPU usage showing wild swings in CPU loads and speeds. ESIT is disabled so the speeds should be constant. Normally the speeds and loads are constant at 100% and 3.86 gig.


Shut down and restating Boinc does not correct the problem and I have to abort the AP unit. In checking the wing-man's results I see that they will have errored out after completing the AP work unit. I have lucked out and observed the problem early in the crunching process. After aborting the erratic WU a new one starts without any problems and completes to success.

But what gives with the work units? Why are they failing?
Any ideas what performance monitor is indicating??

Here is a link to a set with myself and the wing man.
http://setiathome.berkeley.edu/workunit.php?wuid=349461703

This is my work unit in the grouping http://setiathome.berkeley.edu/result.php?resultid=1027661304


Now that "0'd statefile" text in the stderr is a clue, and is part of stock code designed to try and track down an issue that seems to occur rarely (with both stock and optimised 4.35). The 'erratic CPU jumping' is the app code trying to wait for the statefile to recover, so is a stock induced symptom of the problem not a cause.

We have seen that occur only once during testing, and have been trying a fix in the optimised code. If yourself (Or anyone else reading this) comes across a WU that does that, please try and preserve the workunit by making a copy of it. (note that we'll need maybe a few of these WUs to completely verify if our fix attempts are working, but not heaps of them)
"Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions.
ID: 824747 · Report as offensive
HFB1217
Avatar

Send message
Joined: 25 Dec 05
Posts: 102
Credit: 9,424,572
RAC: 0
United States
Message 824802 - Posted: 30 Oct 2008, 3:32:38 UTC - in response to Message 824747.  
Last modified: 30 Oct 2008, 3:37:28 UTC

Jason_gee thanks for the answer I have had EIGHT of them in a few days days All were from a contiguous download.

If this occurs again I will not abort it only suspend it and post it to the forum.

Thanks again.

Hank
Come and Visit Us at
BBR TeamStarFire


****My 9th year of Seti****A Founding Member of the Original Seti Team Starfire at Broadband Reports.com ****
ID: 824802 · Report as offensive
Profile SATAN
Avatar

Send message
Joined: 27 Aug 06
Posts: 835
Credit: 2,129,006
RAC: 0
United Kingdom
Message 826316 - Posted: 2 Nov 2008, 23:06:21 UTC

<core_client_version>6.2.18</core_client_version>
<![CDATA[
<stderr_txt>
In ap_gfx_main.cpp: in ap_graphics_init(): Starting client.
AstroPulse v. 4.35
FFTW USE_CONVERSION_OPT SPLIT_COMPLEX USE_SSE3
OSX 32bit FFTW/SSE3
In ap_gfx_main.cpp: in ap_graphics_init(): Starting client.
###Restarted at _some_ percent.
called boinc_finish

</stderr_txt>
]]>
Has ended with a client state of VALID, so, SATAN is not a happy cruncher at the minute.
ID: 826316 · Report as offensive
Profile arkayn
Volunteer tester
Avatar

Send message
Joined: 14 May 99
Posts: 4438
Credit: 55,006,323
RAC: 0
United States
Message 826323 - Posted: 2 Nov 2008, 23:35:37 UTC - in response to Message 826316.  

Eric will probably fix that one as I saw no problems whatsoever, of course there is still a third computer in the mix as well.

ID: 826323 · Report as offensive
Josef W. Segur
Volunteer developer
Volunteer tester

Send message
Joined: 30 Oct 99
Posts: 4504
Credit: 1,414,761
RAC: 0
United States
Message 826429 - Posted: 3 Nov 2008, 3:41:03 UTC - in response to Message 826323.  

Eric will not need to do anything. No canonical result has been chosen, so it was sent to a third host and when that result is received all three will be checked.
                                                                  Joe
ID: 826429 · Report as offensive
Profile SATAN
Avatar

Send message
Joined: 27 Aug 06
Posts: 835
Credit: 2,129,006
RAC: 0
United Kingdom
Message 826495 - Posted: 3 Nov 2008, 11:12:21 UTC - in response to Message 826429.  

Cheers Josef.
ID: 826495 · Report as offensive
Profile burnz
Avatar

Send message
Joined: 14 Apr 04
Posts: 26
Credit: 178,564
RAC: 0
Australia
Message 826498 - Posted: 3 Nov 2008, 11:20:13 UTC

what is an astroplause WU, is this new type of data from aracebo??
i have an ap wu worth 144 hrs http://setiathome.berkeley.edu/workunit.php?wuid=358469828
ID: 826498 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14679
Credit: 200,643,578
RAC: 874
United Kingdom
Message 826499 - Posted: 3 Nov 2008, 11:23:21 UTC - in response to Message 826498.  

what is an astroplause WU, is this new type of data from aracebo??
i have an ap wu worth 144 hrs http://setiathome.berkeley.edu/workunit.php?wuid=358469828

Astropulse FAQ
ID: 826499 · Report as offensive
Profile burnz
Avatar

Send message
Joined: 14 Apr 04
Posts: 26
Credit: 178,564
RAC: 0
Australia
Message 826502 - Posted: 3 Nov 2008, 11:37:00 UTC - in response to Message 826499.  

thank you
ID: 826502 · Report as offensive
K5GP - Dr Gene Preston

Send message
Joined: 29 May 99
Posts: 3
Credit: 1,531,179
RAC: 1
United States
Message 826994 - Posted: 4 Nov 2008, 14:17:16 UTC - in response to Message 818723.  

Further testing shows that my screen blanking not going blank was caused by an MRU (spyware) being posted every time I used Bill Gates Internet Explorer web browser. I can locate and remove this spyware using the AdAware free program. When I get one of these MRUs the screen turn off feature stops working. There is nothing wrong with the SETI software. Thanks.. Dr Gene Preston
Gene ... g.preston@ieee.org
ID: 826994 · Report as offensive
Profile tullio
Volunteer tester

Send message
Joined: 9 Apr 04
Posts: 8797
Credit: 2,930,782
RAC: 1
Italy
Message 830144 - Posted: 14 Nov 2008, 6:38:57 UTC

I am running the SSE3 optimized astropulse app on a Linux box. When it terminates successfully, no credit is given, even if a wingman has terminated successfully. The same day, the WU is sent to a third wingman and only if he is successful too the credit is given. Does this means the the astropulse quorum is three, or that the results of an optimized app on Linux and a standard app on Windows are somehow different? I see my wingman is using standard app from his stderr.txt file which is different from mine and more verbose,
Tullio
ID: 830144 · Report as offensive
Profile Leaps-from-Shadows
Volunteer tester
Avatar

Send message
Joined: 11 Aug 08
Posts: 323
Credit: 259,220
RAC: 0
United States
Message 830174 - Posted: 14 Nov 2008, 7:28:33 UTC

If that happens, it means that the results from the first two machines weren't similar enough to truly validate. Once the third machine returns a result, a canonical result will be chosen and credit will be given.

Happens all the time, and as long as you (eventually) receive credit for it, I wouldn't worry about it.
Cruiser
Gateway GT5692 L-f-S Edition
-Phenom X4 9650 CPU
-4GB 667MHz DDR2 RAM
-500GB SATA HD
-Vista x64 SP1
-BOINC 6.2.19 32-bit client
-SSE3 optimized 32-bit apps
ID: 830174 · Report as offensive
Previous · 1 . . . 9 · 10 · 11 · 12 · 13 · 14 · Next

Message boards : Number crunching : AstroPulse errors - Reporting


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.