Hole in the Rock (Jan 22 2009)


log in

Advanced search

Message boards : Technical News : Hole in the Rock (Jan 22 2009)

1 · 2 · Next
Author Message
Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar
Send message
Joined: 1 Mar 99
Posts: 1389
Credit: 74,079
RAC: 0
United States
Message 856590 - Posted: 22 Jan 2009, 23:34:01 UTC

We continue to have problems mounting our raw data drives (which we fill down at Arecibo and drain up here). The symptoms are random, the error messages are random, and where these messages actually appear is random. Jeff and I are pretty much giving up trying to figure it out. We'll most likely remove as many moving parts from the whole system and deal with continuing issues as they arise. Not sure who/what to blame. Linux? SATA? USB? The enclosures? The cables? The drives themselves?

I actually got the software radar blanker working. Whether or not the output it generates is worth anything remains to be seen, but at first glance it looks pretty good. The proof is when I run this on a whole file and make some workunits, and then see if these workunits explode.

- Matt

____________
-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude

Richard Haselgrove
Volunteer tester
Send message
Joined: 4 Jul 99
Posts: 8376
Credit: 46,791,997
RAC: 23,602
United Kingdom
Message 856595 - Posted: 22 Jan 2009, 23:48:24 UTC

Ye gods. Exploding workunits? Does my warranty cover that? <grin>

1mp0£173
Volunteer tester
Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 856601 - Posted: 23 Jan 2009, 0:13:23 UTC - in response to Message 856595.
Last modified: 23 Jan 2009, 0:13:40 UTC

Ye gods. Exploding workunits? Does my warranty cover that? <grin>

Who cares about the warranty, who is going to clean up the mess? :-)
____________

Profile Geek@Play
Volunteer tester
Avatar
Send message
Joined: 31 Jul 01
Posts: 2463
Credit: 85,121,964
RAC: 12,158
United States
Message 856619 - Posted: 23 Jan 2009, 0:44:16 UTC

Matt........your bringing back some wonderful memories.

Been there, done that.

____________
Boinc....Boinc....Boinc....Boinc....

Profile Dr. C.E.T.I.
Avatar
Send message
Joined: 29 Feb 00
Posts: 15993
Credit: 690,597
RAC: 6
United States
Message 856620 - Posted: 23 Jan 2009, 0:44:26 UTC


. . . Thanks as usual for the Updates Matt

ps - you might look into sometjhing i sent Eric a short bit ago - interesting




____________
BOINC Wiki . . .

Science Status Page . . .

WinterKnight
Volunteer tester
Send message
Joined: 18 May 99
Posts: 8520
Credit: 23,138,564
RAC: 15,929
United Kingdom
Message 856680 - Posted: 23 Jan 2009, 2:52:50 UTC

So now after the exploding downloads, Got *much* more work than asked for, we are going to get exploding tasks.
Time to dig out the protective clothing etc.

Profile KWSN THE Holy Hand Grenade!
Volunteer tester
Avatar
Send message
Joined: 20 Dec 05
Posts: 1902
Credit: 9,223,143
RAC: 13,684
United States
Message 856753 - Posted: 23 Jan 2009, 10:07:12 UTC
Last modified: 23 Jan 2009, 10:16:09 UTC

at about 0200 PST, I started getting:

1/23/2009 1:56:17 AM|SETI@home|Message from server: No work sent
1/23/2009 1:56:17 AM|SETI@home|Message from server: CPU app exists for SETI@home Enhanced but no CPU work requested
1/23/2009 1:56:17 AM|SETI@home|Message from server: (your BOINC client is old - please install current version)

followed by:

1/23/2009 1:58:50 AM|SETI@home|Message from server: No work sent
1/23/2009 1:58:50 AM|SETI@home|Message from server: CPU app exists for SETI@home Enhanced but no CPU work requested
1/23/2009 1:58:50 AM|SETI@home|Message from server: No work available for the applications you have selected. Please check your settings on the web site.
1/23/2009 1:58:50 AM|SETI@home|Message from server: (your BOINC client is old - please install current version)

This just after sucessfully getting a block of 20 WU's for the same computer (which is running version 5.10.45...) Did something change on Berkeley's end, 'cause nothing changed on mine!

[add]Each time I got these messages, I also got "communication deffered" for 24 hrs, BTW [/add]
____________
.

Profile Pooh Bear 27
Volunteer tester
Avatar
Send message
Joined: 14 Jul 03
Posts: 3221
Credit: 2,068,389
RAC: 1,826
United States
Message 856973 - Posted: 24 Jan 2009, 0:20:51 UTC

Matt,

Thanks for all the updates.

What happened to your taking Thursday off, and being in on Friday? That didn't last long.

____________

OzzFan
Volunteer tester
Avatar
Send message
Joined: 9 Apr 02
Posts: 13542
Credit: 29,416,745
RAC: 16,005
United States
Message 857079 - Posted: 24 Jan 2009, 5:55:28 UTC - in response to Message 856590.

Not sure who/what to blame. Linux? SATA? USB? The enclosures? The cables? The drives themselves?


Either blame Canada or blame Misfit.
____________

Zebra3
Avatar
Send message
Joined: 22 Oct 01
Posts: 186
Credit: 13,657,908
RAC: 0
Canada
Message 857118 - Posted: 24 Jan 2009, 8:52:03 UTC - in response to Message 857079.

Not sure who/what to blame. Linux? SATA? USB? The enclosures? The cables? The drives themselves?


Either blame Canada or blame Misfit.



Its always Canada's fault...why can't it be Switzerland's fault this time...lol
____________
http://www.novascotia.com

Greg
Send message
Joined: 12 Oct 07
Posts: 6
Credit: 1,031,943
RAC: 0
Australia
Message 857141 - Posted: 24 Jan 2009, 10:44:27 UTC

Well you know the golden rule of flakey hardware - Its always the last thing you test!

If all else fails, check that the hamster's wheel hasn't shifted off axis.

Lois Petrolito
Send message
Joined: 9 Nov 03
Posts: 10
Credit: 68,170
RAC: 0
United States
Message 857185 - Posted: 24 Jan 2009, 15:15:21 UTC

Maybe that's why I haven't gotten any new work in about a week. Or, it could be BOINC itself doing it. I'm on multiple projects, and when I get work units from one, I get nothing from anyone else unless I suspend that particular project.
____________

Profile Fluffy
Volunteer tester
Avatar
Send message
Joined: 30 Nov 00
Posts: 44
Credit: 200,235
RAC: 0
United States
Message 857202 - Posted: 24 Jan 2009, 15:58:39 UTC - in response to Message 857118.

Not sure who/what to blame. Linux? SATA? USB? The enclosures? The cables? The drives themselves?


Either blame Canada or blame Misfit.



Its always Canada's fault...why can't it be Switzerland's fault this time...lol


There are no Gremlins in Switzerland. AMC couldn't afford to send any there.
____________

HAY! Watch this!
LLLLEEEEEERRRROOOOOYYYYY!
...yikes...

clive G1FYE
Volunteer moderator
Send message
Joined: 4 Nov 04
Posts: 1300
Credit: 23,054,144
RAC: 0
United Kingdom
Message 857265 - Posted: 24 Jan 2009, 18:46:19 UTC

since about 15:00 hrs utc friday the gigabit net graphs show that the connection is running at maximum rate.
Ime getting connection timeouts,
everything seems to be working,
its just got an lot of work to do.

Profile Neil Blaikie
Volunteer tester
Avatar
Send message
Joined: 17 May 99
Posts: 142
Credit: 6,466,200
RAC: 6
Canada
Message 857267 - Posted: 24 Jan 2009, 18:49:46 UTC

Seems like something is going on as haven't been able to upload anything since very early this morning.
____________

Kaylie
Send message
Joined: 26 Jul 08
Posts: 39
Credit: 332,100
RAC: 0
United States
Message 857304 - Posted: 24 Jan 2009, 20:07:19 UTC - in response to Message 857267.

Is it possible that work is being split into such small parts that there are so many of them they are jamming up transfers? It seems all the MB tasks I have been getting are smaller than usual.

Bendarr
Send message
Joined: 27 Dec 08
Posts: 1
Credit: 11,624
RAC: 0
United States
Message 857312 - Posted: 24 Jan 2009, 20:13:06 UTC - in response to Message 857267.
Last modified: 24 Jan 2009, 20:13:46 UTC

I am likewise having upload problems. I had roughly 4 units and managed to upload maybe 2 units and I've got 2 more but I keep getting deferred as well.

Adrian Foster
Send message
Joined: 26 Jul 06
Posts: 3
Credit: 98,885
RAC: 4
United Kingdom
Message 857319 - Posted: 24 Jan 2009, 20:19:31 UTC

My post may or may not be related to this thread. Don't know. All I know is that for one whole week now my computer has suspended all work on BOINC. All computation and all upload. I have an astropulse ready to upload and for some reason the computer has suspended it. I cannot unsuspend it. Oh, and I promise you - I have done nothing in that week, barely used the computer infact. Why would it be doing this ? I cannot unsuspend it, upload the completed file or get new ones ?????

Profile Krisk
Avatar
Send message
Joined: 18 Jul 99
Posts: 9
Credit: 1,845,297
RAC: 0
United States
Message 857333 - Posted: 24 Jan 2009, 20:43:25 UTC

Haven't been able to upload since last night 1/23/09, 22.30 PST.
____________

Profile Misfit
Volunteer tester
Avatar
Send message
Joined: 21 Jun 01
Posts: 21790
Credit: 2,510,901
RAC: 0
United States
Message 857339 - Posted: 24 Jan 2009, 21:16:54 UTC - in response to Message 857079.

Not sure who/what to blame. Linux? SATA? USB? The enclosures? The cables? The drives themselves?

Either blame Canada or blame Misfit.

I was going to quote that line and say how disappointed I was. I am no longer disappointed.
____________

1 · 2 · Next

Message boards : Technical News : Hole in the Rock (Jan 22 2009)

Copyright © 2014 University of California