Panic Mode On (46) Server problems


log in

Advanced search

Message boards : Number crunching : Panic Mode On (46) Server problems

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 . . . 12 · Next
Author Message
Sten-Arne
Volunteer tester
Send message
Joined: 1 Nov 08
Posts: 3406
Credit: 20,072,526
RAC: 22,025
Sweden
Message 1097135 - Posted: 14 Apr 2011, 15:32:09 UTC - in response to Message 1097064.
Last modified: 14 Apr 2011, 15:33:01 UTC

Looks like the Cricket Graph has taken a nose dive.

http://fragment1.berkeley.edu/newcricket/grapher.cgi?target=%2Frouter-interfaces%2Finr-250%2Fgigabitethernet2_3;view=Octets;ranges=d

Quoting from Lunatics:

The grapevine says 'Gowron failed a drive and is hung. It will probably be 3-4 days to re-sync the RAID array, so no work until then.'


3-4 days is no problem whatsoever. As long as this board stays up, so I can keep myself amused, reading all posts complaining and whining about what disaster it is, and how incompetent SETI staff is :-)
____________

Sten-Arne
Volunteer tester
Send message
Joined: 1 Nov 08
Posts: 3406
Credit: 20,072,526
RAC: 22,025
Sweden
Message 1097157 - Posted: 14 Apr 2011, 17:51:06 UTC

Obviously not too bad. From S@H home page:

Storage Server Issues
The server that stores our workunits crashed last night, but is recovering now. We hope to be fully back on line in a couple hours. 14 Apr 2011 | 16:51:19 UTC
____________

Profile HAL9000
Volunteer tester
Avatar
Send message
Joined: 11 Sep 99
Posts: 4189
Credit: 114,783,229
RAC: 143,920
United States
Message 1097170 - Posted: 14 Apr 2011, 19:18:06 UTC - in response to Message 1097158.
Last modified: 14 Apr 2011, 19:22:08 UTC

Obviously not too bad. From S@H home page:

Storage Server Issues
The server that stores our workunits crashed last night, but is recovering now. We hope to be fully back on line in a couple hours. 14 Apr 2011 | 16:51:19 UTC


Ooohh...
I like that flavor of grapes much better!

Looks like we are back online. Normal catch-up dropped connections seem to be in play.
Also the BOINC Stats images are fixed.
____________
SETI@home classic workunits: 93,865 CPU time: 863,447 hours

Join the BP6/VP6 User Group today!

Profile MikeProject donor
Volunteer tester
Avatar
Send message
Joined: 17 Feb 01
Posts: 24095
Credit: 33,111,584
RAC: 23,286
Germany
Message 1097418 - Posted: 15 Apr 2011, 9:01:33 UTC


9000 seconds backlog on replica balanced out in 3 hours.
Looks nice.


____________

Profile TOM
Volunteer tester
Avatar
Send message
Joined: 5 Apr 01
Posts: 59
Credit: 54,930,063
RAC: 0
Germany
Message 1097709 - Posted: 16 Apr 2011, 1:46:46 UTC

No uploads, No downloads...

>SETI@home 16.04.2011 03:40:09 Temporarily failed download of 17dc10ac.17841.9883.10.10.70: can't resolve hostname
>SETI@home 16.04.2011 03:40:09 Temporarily failed download of 17dc10ac.17841.9883.10.10.70: can't resolve hostname
>16.04.2011 03:40:12 Internet access OK - project servers may be temporarily down.
>SETI@home 16.04.2011 03:42:37 Temporarily failed upload of 18no10ab.2383.81964.16.10.67_0_0: can't resolve hostname

Server status page looks nice.

____________

Profile TOM
Volunteer tester
Avatar
Send message
Joined: 5 Apr 01
Posts: 59
Credit: 54,930,063
RAC: 0
Germany
Message 1097778 - Posted: 16 Apr 2011, 3:29:16 UTC

The crew is back in town ;-)

____________

Sten-Arne
Volunteer tester
Send message
Joined: 1 Nov 08
Posts: 3406
Credit: 20,072,526
RAC: 22,025
Sweden
Message 1099757 - Posted: 22 Apr 2011, 19:55:34 UTC

Panic is creeping in here. I have a terrible problem finding any problem with the project right now.

I may have to type format c: in a cmd window, to create a problem....

____________

Profile HAL9000
Volunteer tester
Avatar
Send message
Joined: 11 Sep 99
Posts: 4189
Credit: 114,783,229
RAC: 143,920
United States
Message 1099792 - Posted: 22 Apr 2011, 21:14:06 UTC - in response to Message 1099757.

Panic is creeping in here. I have a terrible problem finding any problem with the project right now.

I may have to type format c: in a cmd window, to create a problem....

Hmmm... that would only wipe the one partition.
I like a nice
C:\diskpart
DISKPART> select dive 0
DISKPART> clean

If that is to much you could do a boinccmd --set_network_mode never 5184000.
Sixty days of no network comms would defiantly cause a tiny bit of worry for BOINC.
____________
SETI@home classic workunits: 93,865 CPU time: 863,447 hours

Join the BP6/VP6 User Group today!

Dave
Avatar
Send message
Joined: 29 Mar 02
Posts: 774
Credit: 23,193,139
RAC: 0
United Kingdom
Message 1099802 - Posted: 22 Apr 2011, 21:55:40 UTC - in response to Message 1099792.

If that is to much you could do a boinccmd --set_network_mode never 5184000.
Sixty days of no network comms would defiantly cause a tiny bit of worry for BOINC.


That looks a useful cmd, if you don't want it to comm for a while (not 60 days).
____________

rcthardcore
Send message
Joined: 23 Nov 08
Posts: 19
Credit: 608,816
RAC: 3,172
United States
Message 1099873 - Posted: 23 Apr 2011, 3:33:55 UTC

SETI must be having server problems again. Taking forever just to download 2 workunits.
____________
http://www.boincstats.com/signature/user_2151976.gif

Grant (SSSF)
Send message
Joined: 19 Aug 99
Posts: 5823
Credit: 59,052,982
RAC: 47,875
Australia
Message 1099906 - Posted: 23 Apr 2011, 6:55:59 UTC - in response to Message 1099873.


No problems, just a bunch of AP WOrk Units being distributed. An hour or 2 & the network traffic will drop off again & things will speed up.
____________
Grant
Darwin NT.

Sten-Arne
Volunteer tester
Send message
Joined: 1 Nov 08
Posts: 3406
Credit: 20,072,526
RAC: 22,025
Sweden
Message 1100055 - Posted: 23 Apr 2011, 18:42:51 UTC

Oh, the pain the pain....No more AP's to split. How can I survive the pain? I only have 10 days of AP's left on my AP only machine....

LOL
____________

Profile MikeProject donor
Volunteer tester
Avatar
Send message
Joined: 17 Feb 01
Posts: 24095
Credit: 33,111,584
RAC: 23,286
Germany
Message 1100101 - Posted: 23 Apr 2011, 21:10:35 UTC

I feel your pain Sten.

____________

Profile j mercer
Avatar
Send message
Joined: 3 Jun 99
Posts: 1446
Credit: 11,646,280
RAC: 230
United States
Message 1100107 - Posted: 23 Apr 2011, 21:39:26 UTC

owie owie owie . . .
____________
...

Profile Fred J. Verster
Volunteer tester
Avatar
Send message
Joined: 21 Apr 04
Posts: 3242
Credit: 31,786,107
RAC: 4,681
Netherlands
Message 1100115 - Posted: 23 Apr 2011, 22:07:39 UTC - in response to Message 1100101.
Last modified: 23 Apr 2011, 22:50:19 UTC

Those bursts, with maxed out bandwith, indicate, too much host trying to connect, at the same time, I think, in some cases, looking at the download speed,
which is wrong, cause it's UPLoads causing the trouble, although sometime also
DownLoad, but I never experienced this, files come through complete, or not
at all, which is a good thing, NetWork improvement as well hardware (GigaBitLAN) as software, fetching ahead, f.i., works very well.

Lots and lots of work, specially AP and MB and a lot VLARs and now I've been without power 12 hours, now restored,
2 rigs were shutdown, very rare a power outage in the Netherlands.
I'd almost forgotten about them, since the last major power outage in the north,
Groningen, happened about 30 years ago!

But it's also still hot, also very strange, 20C, a.t.m(past midnight). makes you wonder...........

Maybe cause, there is one thing, which I never did change, my cache setting of 4 days and I've never been completely out of SETI work! (Well, maybe
one time.) B.t.w. I'm going way too fast, looking at deadlines of a month...,
but a lot of the 'forum visitors and writers', do have a much higher throughput, also widely known and famous as R(ecent)A(verage)C(redit), since
the LUNATICs, whats else could you expect, made an CPU app., doing almost twice
the stock throughput and last, but not least, CUDA 'fell out of the sky' inmediatly
taken care of, again by the notorious LUNATICs, produced, after weeks of non-stop,
hard work, a beatifull shiny new app. x32F, included in the widely known,
Universal Installer 0.37.................. To be continued soon.......
Sorry for making such a fuzz........

(Maybe start, my 'other two' the crunching rigs, an i7-2600(+2xEAH5870+AP ATIrev516 app) and a Q6600+GTS250(SSSE3 CPU& CUDA x32f), before I go to sleep, hoping it'll cool down a bit during the night.)

Gonna 'give a few cores' to CPDN, since our weather/climate seems to
be more rapidly changing, then accounted for, so to speak.......


But, ehh, right, SERVERS (&) PROBLEMS, well, they aren't wanted, or needed,
or included, nor payed for, but somebody has to check, chase Murphy and stop him!

____________

Cosmic_Ocean
Avatar
Send message
Joined: 23 Dec 00
Posts: 2266
Credit: 8,686,024
RAC: 4,192
United States
Message 1100162 - Posted: 24 Apr 2011, 2:47:47 UTC

Those download bursts are batches of APs being issued by the scheduler. Unlike MB which is available all the time, AP gets issues in batches, apparently. Means once you see one of those surges, that's the time to request work for APs. The quiet period between them, it is fairly unlikely to get any.
____________

Linux laptop uptime: 1484d 22h 42m
Ended due to UPS failure, found 14 hours after the fact

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 . . . 12 · Next

Message boards : Number crunching : Panic Mode On (46) Server problems

Copyright © 2014 University of California