Fiber channel woes, Chicken App, etc. (May 21 2007)

Message boards : Technical News : Fiber channel woes, Chicken App, etc. (May 21 2007)
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · Next

AuthorMessage
Profile arkayn
Volunteer tester
Avatar

Send message
Joined: 14 May 99
Posts: 4438
Credit: 55,006,323
RAC: 0
United States
Message 573763 - Posted: 22 May 2007, 14:04:03 UTC

I got a weird message last night while I was sleeping.

I has my iMac working without the app_info.xml with the optimized client, it was working fine and then all of a sudden this message popped up and every unit that was waiting to run errored out.

2007-05-22 01:36:41 [SETI@home] [error] Application file seti_enhanced-i386-v7.2-core2-nographics missing signature
2007-05-22 01:36:41 [SETI@home] [error] BOINC cannot accept this file
2007-05-22 01:36:41 [SETI@home] Deferring communication for 1 min 0 sec
2007-05-22 01:36:41 [SETI@home] Reason: Unrecoverable error for result 16ja05ab.25838.14656.322166.3.126_0 (Can't create shared memory: system shmat)
2007-05-22 01:36:42 [SETI@home] Deferring communication for 1 min 0 sec
2007-05-22 01:36:42 [SETI@home] Reason: Unrecoverable error for result 11fe05aa.10139.22336.390890.3.204_0 (app_version download error: couldn't get input files:
<file_xfer_error>
<file_name>seti_enhanced-i386-v7.2-core2-nographics</file_name>
<error_code>-123</error_code>
<error_message>missing signature</error_message>
</file_xfer_error>


ID: 573763 · Report as offensive
Profile J. Ritchie Morrow

Send message
Joined: 26 Nov 03
Posts: 30
Credit: 910,752
RAC: 0
United States
Message 573831 - Posted: 22 May 2007, 15:12:54 UTC

I've been trying to follow all the posts and keep up on the 'hiccups' but really am not following (I'm still trying to figure out what this 'Chicken App' is...). So, forgive me if this is rehashing something that has already been discussed and I just missed it...

My SETI results board is showing that I have 19 outstanding WUs that I should be processing. However, when I pull up BOINC there is nothing from SETI. I'm also processing for ROSETTA and have been getting WUs from them just fine (althought it seems that I manually have to do an update to get them submitted...). I have tried exiting BOINC and restarting but that didn't work. Any other suggestions/comments/ideas welcome...

Thanks!
ID: 573831 · Report as offensive
crazyrabbit1

Send message
Joined: 17 Sep 06
Posts: 35
Credit: 2,282,319
RAC: 0
Germany
Message 573841 - Posted: 22 May 2007, 15:52:21 UTC - in response to Message 573831.  
Last modified: 22 May 2007, 15:53:42 UTC

I've been trying to follow all the posts and keep up on the 'hiccups' but really am not following (I'm still trying to figure out what this 'Chicken App' is...). So, forgive me if this is rehashing something that has already been discussed and I just missed it...

My SETI results board is showing that I have 19 outstanding WUs that I should be processing. However, when I pull up BOINC there is nothing from SETI. I'm also processing for ROSETTA and have been getting WUs from them just fine (althought it seems that I manually have to do an update to get them submitted...). I have tried exiting BOINC and restarting but that didn't work. Any other suggestions/comments/ideas welcome...

Thanks!


Ohh the chicken soup is realy great stuff. It is an optimized app for seti and is a lot faster the the original app, but at the moment there is a problem on serverside with non standard apps. They can cause ghost wu's, means the result page say you have wu's but your client did not get them. I just get also a new ghost today.
To rosetta i can not say anything i only run seti and einstein.

HTH

ps.
you can download the chicken app at http://lunatics.at/index.php?module=Downloads
ID: 573841 · Report as offensive
Profile Demiurg
Volunteer tester
Avatar

Send message
Joined: 2 Jul 02
Posts: 883
Credit: 28,286
RAC: 0
Sweden
Message 573845 - Posted: 22 May 2007, 15:54:47 UTC - in response to Message 573841.  

I've been trying to follow all the posts and keep up on the 'hiccups' but really am not following (I'm still trying to figure out what this 'Chicken App' is...). So, forgive me if this is rehashing something that has already been discussed and I just missed it...

My SETI results board is showing that I have 19 outstanding WUs that I should be processing. However, when I pull up BOINC there is nothing from SETI. I'm also processing for ROSETTA and have been getting WUs from them just fine (althought it seems that I manually have to do an update to get them submitted...). I have tried exiting BOINC and restarting but that didn't work. Any other suggestions/comments/ideas welcome...

Thanks!


Ohh the chicken soup is realy great stuff. It is an optimized app for seti and is a lot faster the the original app, but at the moment there is a problem on serverside with non standard apps. They can cause ghost wu's, means the result page say you have wu's but your client did not get them. I just get also a new ghost today.
To rosetta i can not say anything i only run seti and einstein.

HTH

ps.
you can download the chicken app at http://lunatics.at/index.php?module=Downloads


The bug in the Boinc client has been fixed and will probably be updated during the weekly outage.
Carl
It is SEXY to DONATE!
Skype = demiurg2
ID: 573845 · Report as offensive
Profile Francesco Forti
Avatar

Send message
Joined: 24 May 00
Posts: 334
Credit: 204,421,005
RAC: 15
Switzerland
Message 573848 - Posted: 22 May 2007, 16:00:22 UTC - in response to Message 573180.  


I talked to Blurf this morning and learned that people using Simon's optimized "Chicken App" were having problems connecting with that app, but not with the normal app. The problem seems to have resolved somewhat, since some people using it are getting work now. I don't know what caused it.


Eric, thank for your info.
Here some info from me.
For my experience, where I have done the app_info.xml workaround
everything now is working as usuals.
But I have some far host in which I haven't done the workaround and
I can see in their result page that they are not working. Sometimes they
download some job (ghosts?) but they don't send (upload o report).
I have visited now one of them. No task, even if the result page
was full of Rus sent. I have done the app_info.xml workaround
7 hors ago and now that host is working.

So I think that the problem is still open for thoose who have
and app_inf file and are not able (or doesn't know) to do the
workaround.

Bye,
Franz
ID: 573848 · Report as offensive
Josef W. Segur
Volunteer developer
Volunteer tester

Send message
Joined: 30 Oct 99
Posts: 4504
Credit: 1,414,761
RAC: 0
United States
Message 573853 - Posted: 22 May 2007, 16:07:32 UTC - in response to Message 573831.  

I've been trying to follow all the posts and keep up on the 'hiccups' but really am not following (I'm still trying to figure out what this 'Chicken App' is...). So, forgive me if this is rehashing something that has already been discussed and I just missed it...

Most of the discussion has been in the Number Crunching forum, where users can create threads. The 'Chicken app' term refers to optimised versions of setiathome_enhanced made available from a web site set up by Simon Zadra who uses the moniker "KWSN - Chicken of Angnor".

My SETI results board is showing that I have 19 outstanding WUs that I should be processing. However, when I pull up BOINC there is nothing from SETI. I'm also processing for ROSETTA and have been getting WUs from them just fine (althought it seems that I manually have to do an update to get them submitted...). I have tried exiting BOINC and restarting but that didn't work. Any other suggestions/comments/ideas welcome...

Thanks!

These are known as "ghost" units, and occur when the Scheduler thinks it told your host to download work but your host didn't get the message. It can happen any time, but is most likely during the storm of activity following an outage. The WUs will time out at deadline and be sent to another host if a canonical result hasn't been found.

Because you're using the stock setiathome_enhanced you probably won't end up with more ghosts. For those running optimized apps or platforms for which the project doesn't supply stock apps, there's a recent problem with BOINC mishandling the anonymous platform information. That causes a lot of ghosts, much of the discussion has focussed on how to work around that for optimized apps.
                                                                    Joe
ID: 573853 · Report as offensive
Profile J. Ritchie Morrow

Send message
Joined: 26 Nov 03
Posts: 30
Credit: 910,752
RAC: 0
United States
Message 573859 - Posted: 22 May 2007, 16:15:42 UTC - in response to Message 573853.  

Josef and CrazyRabbit...thanks for the assistance!
ID: 573859 · Report as offensive
Mike Tilford

Send message
Joined: 18 Aug 99
Posts: 1
Credit: 8,915,154
RAC: 9
United States
Message 573872 - Posted: 22 May 2007, 16:44:40 UTC

Sorry for posting this here, but it will not let me start a new thread.

Twice today I have gotten this error:

5/22/2007 11:59:46 AM|SETI@home|[error] Error on file upload: [18mr05aa.13127.29649.928404.3.214_1_0] locked by file_upload_handler PID=1011104

And one work unit stays in the uploading state.

Any ideas? Or is this related to the other problems going on?

Thanks.

Mike.
ID: 573872 · Report as offensive
Eric Korpela Project Donor
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 3 Apr 99
Posts: 1382
Credit: 54,506,847
RAC: 60
United States
Message 573874 - Posted: 22 May 2007, 16:46:40 UTC - in response to Message 573872.  

Sorry for posting this here, but it will not let me start a new thread.

Twice today I have gotten this error:

5/22/2007 11:59:46 AM|SETI@home|[error] Error on file upload: [18mr05aa.13127.29649.928404.3.214_1_0] locked by file_upload_handler PID=1011104

And one work unit stays in the uploading state.

Any ideas? Or is this related to the other problems going on?

Thanks.

Mike.


Hi Mike, that probably means that there is a file upload handler process on this side that still has the file locked from a previous upload attempt. After our outage today the locks should be cleared.

Eric
@SETIEric@qoto.org (Mastodon)

ID: 573874 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 573912 - Posted: 22 May 2007, 21:40:22 UTC - in response to Message 573747.  

Why not re-set the clock in the Win98 machine? Just right click on the time in the taskbar and choose "adjust Time/Date". when that machine needs a re-boot, go into setup and permanently adjust the time. (...or get a utility that will do the adjust for you, like Atomic Clock Sync from worldtimeserver.com) The clocks in most PC's (particularly overclocked ones) aren't particularly accurate.

Don't worry, I do that periodically, but it drifts - it isn't an important machine, I only keep it running because it does a few tasks better than the others, and it crunches, of course. (Hence the 'canary' function).

If I really need to know the time, I look at one of the servers which is configured for SNTP sync from a proper tier-2 public time server (much more reliable than the mickey-mouse M$ ones, which tend to be overloaded). They in turn keep their local domains in line through group policy.

Putting one of the NTP client apps. on the machine would be nice. I've used Tardis on my server here, and K9 on most of the workstations. Very low overhead.
ID: 573912 · Report as offensive
Profile Toweri's SETI Effort
Avatar

Send message
Joined: 21 May 99
Posts: 7
Credit: 1,996,608
RAC: 0
Finland
Message 573936 - Posted: 22 May 2007, 22:05:25 UTC

No work from project - Help!

Having read here about all kinds of problems with optimized clients, and not getting any work for a week or so, I decided to try a clean re-install.

Now all I'm getting are error messages:

23.5.2007 1:01:27|SETI@home|Deferring communication for 11 sec
23.5.2007 1:01:27|SETI@home|Reason: requested by project
23.5.2007 1:01:27|SETI@home|Deferring communication for 8 min 40 sec
23.5.2007 1:01:27|SETI@home|Reason: no work from project

Backlog? Some server down?
Anyone else having this?
--
I have been known to be wrong...
ID: 573936 · Report as offensive
PhonAcq

Send message
Joined: 14 Apr 01
Posts: 1656
Credit: 30,658,217
RAC: 1
United States
Message 573941 - Posted: 22 May 2007, 22:06:55 UTC

Yes, and am also getting HTTP errors like before. Nothing has changed it seems.
ID: 573941 · Report as offensive
Profile [B^S] Spydermb
Volunteer tester
Avatar

Send message
Joined: 16 Jul 99
Posts: 496
Credit: 10,860,148
RAC: 0
United States
Message 573962 - Posted: 22 May 2007, 22:33:32 UTC

I'm also getting the HTTP errors, yes i run chicken, other than going back to stock or renaming the app_info. Is there a solution to the issue, do we have a possible time table?
BOINC SYNERGY is an International Team and We Welcome All BOINC Participants!
BOINC Synergy Click to Join BOINC Synergy
ID: 573962 · Report as offensive
Profile Fuzzy Hollynoodles
Volunteer tester
Avatar

Send message
Joined: 3 Apr 99
Posts: 9659
Credit: 251,998
RAC: 0
Message 573993 - Posted: 22 May 2007, 22:58:38 UTC - in response to Message 573574.  


... My alarm clock is set for 5.5 hours from now. When I finish this message, I'm going to bed.

And with Matt gone, SETI's operations staff is essentially me and Jeff. Jeff has a real job, which means he doesn't work 24 hours a day. Lynn would also kill him if he tried. I'm a scientist, so I'm expected to work until I drop. After I drop I work in a reclining position. But I've got a proposal due on campus on Thursday, so I can't spend all my working hours watching the server logs. (I do, and have had two windows open on the feeder logs which I have been glancing at. Right now each system is handling about 10 results a second.)

Regarding censorship here. Please remember that most of the moderators are not university employees and they are human. Complain to the moderators list (setimods at ssl.berkeley.edu) or to me (korpela at ssl.berkeley.edu, warning: very aggressive spam filter) with a link to the posts in question and an explanation of what was meant. Under normal circumstances, moderation decisions can be overturned, or agreement can be reached about permissible language. Often times the problem can be including too much of a post which was deleted for a reason or withdrawn by the original poster with a request that quotes also be deleted.

Good night. 5h15 before the alarm goes off.

--

Eric


Thanks for the update, Eric, and also for your hard work. It's very much appreciated.

Yes, you are supposed to live in the lab, sleeping on an airbed on the floor, and be ready to take action every time the servers grumbles, surviving on coffee and what snacks you can get from the vending machine. Besides having your classes and your other work with developing. ;-D

No, I really hope you are over the worst and soon can get some rest.


"I'm trying to maintain a shred of dignity in this world." - Me

ID: 573993 · Report as offensive
Profile ohiomike
Avatar

Send message
Joined: 14 Mar 04
Posts: 357
Credit: 650,069
RAC: 0
United States
Message 574003 - Posted: 22 May 2007, 23:08:19 UTC
Last modified: 22 May 2007, 23:08:47 UTC

He gets an airbed? Things have improved since I did that in the late 70's.

Boinc Button Abuser In Training >My Shrubbers<
ID: 574003 · Report as offensive
Steve MacKenzie
Volunteer tester
Avatar

Send message
Joined: 2 Jan 00
Posts: 146
Credit: 6,504,803
RAC: 1
United States
Message 574022 - Posted: 22 May 2007, 23:28:43 UTC


First off. Thanks to the guys for all the very hard work on getting the systems
back up and running normally for the average user.
Sun Micro. This will be remembered and rewarded in a very positive way.
Eric, Matt, Carl and everyone in the LAB how can we reward you guys.

How a donation for a weekend in the mountains for everyone. Can we earmark donations for that ??? --- Ohhh oh well.

I for one am Soooooooo glad I never opted for all those overclocking chicken gizmo's. I do understand the hobbyist interest in souped up hardware and software though. Me I'm a plain vanilla guy.

I have just let BOINC run and last week when there were no SETI results to process. It got Einstein ones. Now it's back getting SETI units fine.

Just chugging along. Pulling further away from my closest sign up date companion who has been getting goose egs for days.

To me, "If it ain't broke. Don't fix it" is king.

Steve
ID: 574022 · Report as offensive
Profile Fuzzy Hollynoodles
Volunteer tester
Avatar

Send message
Joined: 3 Apr 99
Posts: 9659
Credit: 251,998
RAC: 0
Message 574040 - Posted: 22 May 2007, 23:56:04 UTC - in response to Message 574003.  

He gets an airbed? Things have improved since I did that in the late 70's.


No, he hasn't got an airbed! He brought it himself, the old bones creaked too much from sleeping on the floor, even he doesn't sleep, he's working while lying there. He only gets catnaps from time to time, but what the heck, he's a scientist. The sleep lab on campus are making their experiments on him in their research of sleep deprivation... ;-D



"I'm trying to maintain a shred of dignity in this world." - Me

ID: 574040 · Report as offensive
Eric Korpela Project Donor
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 3 Apr 99
Posts: 1382
Credit: 54,506,847
RAC: 60
United States
Message 574062 - Posted: 23 May 2007, 0:30:53 UTC - in response to Message 573941.  

Yes, and am also getting HTTP errors like before. Nothing has changed it seems.


Jeff is still trying to get the latest server code to work. The CGI still segfaults somewhere in the bowels of the BOINC server code. Once we get it working, we'll install it in beta. I'll keep you informed.

If you are using the Chicken app, your options are to use the "rename app_info.xml" trick in order to get new work, temporarily change to a stock app, or to join a second project.

If you are using anonymous platform by necessity, the only option is to join a second project that has an app that runs on your platform.

Sorry. :(

--
@SETIEric@qoto.org (Mastodon)

ID: 574062 · Report as offensive
Profile Misfit
Volunteer tester
Avatar

Send message
Joined: 21 Jun 01
Posts: 21804
Credit: 2,815,091
RAC: 0
United States
Message 574079 - Posted: 23 May 2007, 0:52:07 UTC - in response to Message 574062.  

Jeff is still trying to get the latest server code to work. The CGI still segfaults somewhere in the bowels of the BOINC server code. Once we get it working, we'll install it in beta. I'll keep you informed.

If you are using the Chicken app, your options are to use the "rename app_info.xml" trick in order to get new work, temporarily change to a stock app, or to join a second project.

If you are using anonymous platform by necessity, the only option is to join a second project that has an app that runs on your platform.

Sorry. :(

--

Here I go again... I'm using chicken and everything is fine on my end without any modification on my part. You just have to give it time as you would after every outage. The longer the outage the more time is required.
me@rescam.org
ID: 574079 · Report as offensive
Profile RottenMutt
Avatar

Send message
Joined: 15 Mar 01
Posts: 1011
Credit: 230,314,058
RAC: 0
United States
Message 574167 - Posted: 23 May 2007, 2:34:59 UTC

chicken app still BROKE
ID: 574167 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 · 6 · Next

Message boards : Technical News : Fiber channel woes, Chicken App, etc. (May 21 2007)


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.