Computation Error - Bad Workunit Header

Message boards : Number crunching : Computation Error - Bad Workunit Header
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 8 · Next

AuthorMessage
Profile Dr. C.E.T.I.
Avatar

Send message
Joined: 29 Feb 00
Posts: 16019
Credit: 794,685
RAC: 0
United States
Message 723748 - Posted: 9 Mar 2008, 15:45:49 UTC - in response to Message 723731.  


[quote] <snip> (Or maybe Matt hasn't got all the status page pointers pointing right.)


eh all - Eric told me yesterday that Matt's NOT in for a bit and shall be returning soon - whenever that'll be . . . ;)


BOINC Wiki . . .

Science Status Page . . .
ID: 723748 · Report as offensive
MGCJerry
Avatar

Send message
Joined: 8 Dec 02
Posts: 37
Credit: 3,174,560
RAC: 0
United States
Message 723749 - Posted: 9 Mar 2008, 15:47:22 UTC

I've had quite a number of these bad work units over the past couple weeks. However, I've only had but one so far in this batch.

Unrecoverable error for result 13fe08ac.24787.11115.4.7.231_0 ( - exit code -1073741819 (0xc0000005))

ID: 723749 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14649
Credit: 200,643,578
RAC: 874
United Kingdom
Message 723750 - Posted: 9 Mar 2008, 15:48:24 UTC - in response to Message 723736.  

Are you running the 5.10.13 client on that machine?

Yes - why do you ask?

It seemed a relatively old version of the client.

As Andy says - if it ain't broke, why fix it?

I tend to be more interested in testing new science applications - that host spent some time doing a 100% timing run over at Einstein, to test a new app. I don't think it's a good idea to test two changes at once, so I tend to find a BOINC I like and stick with it until I see a stable upgrade with new features I'm interested in.

In this case, I liked the v5.10.xx range because of the separation between Connect Interval and Additional Cache: and I think that v5.10.13 was about the first of the range to be generally regarded as stable. So I upgraded from v5.8.16, and left it at that.
ID: 723750 · Report as offensive
W-K 666 Project Donor
Volunteer tester

Send message
Joined: 18 May 99
Posts: 19013
Credit: 40,757,560
RAC: 67
United Kingdom
Message 723751 - Posted: 9 Mar 2008, 15:48:59 UTC - in response to Message 723747.  

Now here's a funny thing. I picked up a task from WU 234686530 overnight. It's a 13fe08ac.6032 datafile, and it has the same empty splitter_cfg block that I reported on yesterday: needless to say, it crashed out instantly with a 'Bad workunit header'.

Yet the WU was created at 8 Mar 2008 7:45:22 UTC, which is after this thread was started and we'd already started commenting on problems with 13fe08ac: and I'm quite certain there hasn't been a 13fe08ac tape in progress on the Server Status page since then.

So we have a supernumerary splitter, quite separate from the ten listed on Lando/Bambi, which is continuing to feed dodgy data into the SETI database.

I thought we had a beta project for things like that?


Are you running the 5.10.13 client on that machine?

Yes - why do you ask?


It seemed a relatively old version of the client.

The question is, do you belong to the school newer is better, Vista v XP or even 2000 or if it ain't broke why fix it? Especially if it allows you more control.


I was running 2000 up to about a month and a half ago. Upgraded to XP then.

With Microsoft anything less than about 3 years old is no better than a beta version. Except IE which is pre-Alpha without regard to the version.

Where, oh, where has OS/2 gone?

So like us you hang on to the old working, controllable version. Still running 2000 on Pent M here.

One answer, most of it was stolen and modified to become new technology.
ID: 723751 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14649
Credit: 200,643,578
RAC: 874
United Kingdom
Message 723752 - Posted: 9 Mar 2008, 15:49:24 UTC - in response to Message 723748.  


[quote] <snip> (Or maybe Matt hasn't got all the status page pointers pointing right.)


eh all - Eric told me yesterday that Matt's NOT in for a bit and shall be returning soon - whenever that'll be . . . ;)


Did Eric know that he's got a splitter running wild? Or did you tell him?
ID: 723752 · Report as offensive
Profile Dr. C.E.T.I.
Avatar

Send message
Joined: 29 Feb 00
Posts: 16019
Credit: 794,685
RAC: 0
United States
Message 723753 - Posted: 9 Mar 2008, 15:54:06 UTC - in response to Message 723752.  
Last modified: 9 Mar 2008, 15:56:52 UTC


[quote] <snip> (Or maybe Matt hasn't got all the status page pointers pointing right.)


eh all - Eric told me yesterday that Matt's NOT in for a bit and shall be returning soon - whenever that'll be . . . ;)


Did Eric know that he's got a splitter running wild? Or did you tell him?


< actually Richard, as You may of been aware of yesterday - Eric was 'a whole lot-a-busy' with some other Issues on the Boards that he had to deal with . . .

here's what i was talkin' to Eric re:


Hi Richard,

Matt's out sick and the donation processing script is offline with problems until he gets back. Sorry, and thanks for the donation.

Eric


so, IF i get the opportunity (with some luck) i'll mention your point today . . .

Respectfully,

richard
[edit] just mailed Eric note, Richard
BOINC Wiki . . .

Science Status Page . . .
ID: 723753 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14649
Credit: 200,643,578
RAC: 874
United Kingdom
Message 723754 - Posted: 9 Mar 2008, 16:00:18 UTC - in response to Message 723753.  

so, IF i get the opportunity (with some luck) i'll mention your point today . . .

Respectfully,

richard

Thanks for that. If Matt's off sick, Eric will need all the help he can get - and if the Moderators are being part of the problem instead of part of the solution, we'll have to find alternative ways.

Tell Eric he may find messages 723496 and 723462 have done some of his work for him.
ID: 723754 · Report as offensive
Profile Dr. C.E.T.I.
Avatar

Send message
Joined: 29 Feb 00
Posts: 16019
Credit: 794,685
RAC: 0
United States
Message 723760 - Posted: 9 Mar 2008, 16:14:19 UTC - in response to Message 723754.  
Last modified: 9 Mar 2008, 16:14:54 UTC

so, IF i get the opportunity (with some luck) i'll mention your point today . . .

Respectfully,

richard

Thanks for that. If Matt's off sick, Eric will need all the help he can get - and if the Moderators are being part of the problem instead of part of the solution, we'll have to find alternative ways.

Tell Eric he may find messages 723496 and 723462 have done some of his work for him.


> Richard - the Link that i sent him refers to this thread - so he'll see your other links too

as for help - there must be @ least ONE person - that could volunteer some help - as per Eric's request quite a number of times - where he requested assistance with many an Issue and as far as i have seen - nobody's jump in and offered . . . i think that those requests of Eric's get LOST in all that goes on here on the Boards . . . i wish i knew Code etc - i'd of been right there with the help needed - so, somehow we have to find a way to get the word out there to ALL those that are quite capable of assisting - but they're just NOT aware of the problems @ hand . . .

[edit] sp
BOINC Wiki . . .

Science Status Page . . .
ID: 723760 · Report as offensive
Eric Korpela Project Donor
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 3 Apr 99
Posts: 1382
Credit: 54,506,847
RAC: 60
United States
Message 723775 - Posted: 9 Mar 2008, 16:44:45 UTC - in response to Message 723760.  

Thanks. I think I have identified the problem and am working to solve it.

Eric
@SETIEric@qoto.org (Mastodon)

ID: 723775 · Report as offensive
Profile Dr. C.E.T.I.
Avatar

Send message
Joined: 29 Feb 00
Posts: 16019
Credit: 794,685
RAC: 0
United States
Message 723791 - Posted: 9 Mar 2008, 17:08:32 UTC - in response to Message 723775.  


Thanks. I think I have identified the problem and am working to solve it.

Eric


Thank You Sir! . . .


BOINC Wiki . . .

Science Status Page . . .
ID: 723791 · Report as offensive
Profile SATAN
Avatar

Send message
Joined: 27 Aug 06
Posts: 835
Credit: 2,129,006
RAC: 0
United Kingdom
Message 723794 - Posted: 9 Mar 2008, 17:10:51 UTC - in response to Message 723775.  

Thanks. I think I have identified the problem and am working to solve it.

Eric


Cheers boss!

ID: 723794 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14649
Credit: 200,643,578
RAC: 874
United Kingdom
Message 723799 - Posted: 9 Mar 2008, 17:13:58 UTC - in response to Message 723775.  

Thanks. I think I have identified the problem and am working to solve it.

Eric

Thanks, Eric.

Sorry we messed up your Sunday (again). Let us know if there's anything we can do to help.
ID: 723799 · Report as offensive
Profile [B^S] madmac
Volunteer tester
Avatar

Send message
Joined: 9 Feb 04
Posts: 1175
Credit: 4,754,897
RAC: 0
United Kingdom
Message 723828 - Posted: 9 Mar 2008, 18:21:32 UTC - in response to Message 723775.  

Thanks. I think I have identified the problem and am working to solve it.

Eric

Thamk you for doing this, just found out I have got two more so that is four, which for me would be a days work.
ID: 723828 · Report as offensive
Matthias Lehmkuhl Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 5 Oct 99
Posts: 28
Credit: 10,832,348
RAC: 53
Germany
Message 723836 - Posted: 9 Mar 2008, 18:36:37 UTC - in response to Message 723440.  

i got one WU with error - exit code -1073741819 (0xc0000005)
13fe08ac.23325
its not the same error, but also from 13fe08ac

one result is with setiathome_5.27_windows_intel boinc 5.8.16 XP SP2
my is with KWSN_2.4V_SSSE3_MB boinc 5.10.30 vista

both crunched results got the same error immediately after the start

wuid=234464976


next result finished with the same error
setiathome_5.27_windows_intel boinc 5.8.16 XP SP2
Matthias

ID: 723836 · Report as offensive
MadCat
Avatar

Send message
Joined: 10 Nov 07
Posts: 10
Credit: 1,548,024
RAC: 0
United States
Message 724101 - Posted: 10 Mar 2008, 9:37:44 UTC

Looks like I got one too. WU 778268241


ID: 724101 · Report as offensive
Odysseus
Volunteer tester
Avatar

Send message
Joined: 26 Jul 99
Posts: 1808
Credit: 6,701,347
RAC: 6
Canada
Message 724302 - Posted: 10 Mar 2008, 21:17:02 UTC

My G5 Mac has gotten a few errors from the 13fe08ac batch, but the problem is a little worse than bad headers: the tasks run for fifteen to twenty seconds before crashing.

<message>
process exited with code 131 (0x83)
</message>
<stderr_txt>
SIGBUS: bus error

Crashed executable name: seti_enhanced-ppc-v8-g5-nographics
built using BOINC library version 5.5.0
Machine type PowerPC 970
System version: Macintosh OS 10.4.11 build 8S165
Sun Mar  9 05:13:53 2008
Stack frame backtrace:
 #  Flags Frame Addr  Caller PC   Return Address Symbol
===  ===  ==========  ==========  =====================
  1  FP-  0x00000000  0x00000000  

Thread number 1: Stack frame backtrace:
 #  Flags Frame Addr  Caller PC   Return Address Symbol
===  ===  ==========  ==========  =====================
  1  F--  0x00000000  0x900411f8  mach_wait_until + 0x8
  2  ---  0xf0080c70  0x90040fc4  nanosleep + 0x184
  3  ---  0xf0080d10  0x90040df0  sleep + 0x90
  4  ---  0xf0080d80  0x000aaac8  _Z11boinc_sleepd + 0x30
  5  ---  0xf0080de0  0x0008f9f8  _Z12timer_threadPv + 0x28
  6  ---  0xf0080e30  0x9002bd08  _pthread_body + 0x60
  7  -P-  0xf0080f00  0x00000000  
  8  FP-  0x00000000  0xffffffffffffffff  

Exiting...
</stderr_txt>

ID: 724302 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 724314 - Posted: 10 Mar 2008, 22:18:00 UTC
Last modified: 10 Mar 2008, 22:42:50 UTC

I got today some more:

<message>
 - exit code -6 (0xfffffffa)
</message>
<stderr_txt>
SETI@home error -6 Bad workunit header
!swi.data_type || !found || !swi.nsamples
File: ..\\seti_header.cpp
Line: 235


</stderr_txt>



13fe08ac.6464. x 2
13fe08ac.6032. x 3 [EDIT: one more now -> x 4]
13fe08ac.8515.
13fe08ac.24787. x 2
13fe08ac.31738.
13fe08ac.23325.
13fe08ac.14310.
ID: 724314 · Report as offensive
Profile SATAN
Avatar

Send message
Joined: 27 Aug 06
Posts: 835
Credit: 2,129,006
RAC: 0
United Kingdom
Message 724317 - Posted: 10 Mar 2008, 22:20:36 UTC

I've given up counting. Must be about 100 of them over the past few days. Hopefully the cache I have is enough for them to have passed through the system before I get any more work.
ID: 724317 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 724320 - Posted: 10 Mar 2008, 22:26:35 UTC - in response to Message 724317.  

I've given up counting. Must be about 100 of them over the past few days. Hopefully the cache I have is enough for them to have passed through the system before I get any more work.

Just got home from work a bit ago, and started to check on the crunchers' activities for the day........
Went into a bit of a panic when I saw the phased quaddy with a bevy of compute errors showing........thought something had seriously gone away.....
Then upon further review, I realized it was a passel of these bad header WUs, and all the wingmen reporting them had errors as well....whew...
This kitties feel much better now...LOL.

"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 724320 · Report as offensive
Gwendolyn

Send message
Joined: 11 Oct 99
Posts: 1
Credit: 31,221
RAC: 0
United States
Message 724332 - Posted: 10 Mar 2008, 22:41:22 UTC

My workunits errors since the 8th are:

13fe08ac.24787.890.4.7.12_0
13fe08ac.6464.4162.5.7.125_1
13fe08ac.31738.12342.8.7.20_1
ID: 724332 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 8 · Next

Message boards : Number crunching : Computation Error - Bad Workunit Header


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.