Panic Mode On (56) Server problems?

Message boards : Number crunching : Panic Mode On (56) Server problems?
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 . . . 12 · Next

AuthorMessage
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22439
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1156812 - Posted: 28 Sep 2011, 13:00:30 UTC

Stuffed to zero in the food bucket...

Well, I've got about 400 in the cache so I'll just sit tight until they run out, then I'll turn something off and same a bit of CO2...
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1156812 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14672
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1156829 - Posted: 28 Sep 2011, 13:58:20 UTC

My GTX 470 (fast, but far from the fastest) has been working through a block of shorties, and is at DCF 0.0540 - that's even after the five-fold raise in the cap last night. I'm beginning to think we may need two more interim steps, not just one.
ID: 1156829 · Report as offensive
Terror Australis
Volunteer tester

Send message
Joined: 14 Feb 04
Posts: 1817
Credit: 262,693,308
RAC: 44
Australia
Message 1156830 - Posted: 28 Sep 2011, 14:03:29 UTC - in response to Message 1156775.  

Quick question just received 30 short WU,s after outage all resend time outs for 27/09/2011

Should I leave these for an hour or two to let the replica data base catch up or will they be okay to crunch. (13,358 seconds)

Deadline for these 11/10/2011

Michael

The replica database is just that, a replica. It's status has no bearing on the activity of the project. It is used to supply data to the Stats sites and the web pages in order to take some load off the main db. So, if you got 'em, crunch 'em.

T.A.
ID: 1156830 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51477
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1156841 - Posted: 28 Sep 2011, 14:44:38 UTC
Last modified: 28 Sep 2011, 14:47:05 UTC

Looks like something hit a snag...
Crickets have stopped singing.
The AP splitters appear to have stopped producing, even though they show as running as of the last server status update (4 minutes ago).

The GPUs on my top rig have gone cold.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1156841 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51477
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1156856 - Posted: 28 Sep 2011, 16:59:19 UTC
Last modified: 28 Sep 2011, 17:14:58 UTC

Jeff just posted in the News thread...
The workunit storage server crashed, and that's why no work is being generated at the moment.

The kitties will crunch what they have in the kibble bowls for now.

Wish him luck getting it sorted and back online.

Meeeouch.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1156856 · Report as offensive
Profile S@NL Etienne Dokkum
Volunteer tester
Avatar

Send message
Joined: 11 Jun 99
Posts: 212
Credit: 43,822,095
RAC: 0
Netherlands
Message 1156879 - Posted: 28 Sep 2011, 19:56:34 UTC - in response to Message 1156876.  

Extremely well behaved system now. I haven't experienced this smooth behaviour when it comes to uploading, reporting, requesting and getting new WU's, in many many months, if not years.

My computers immediately get what they ask for, and my caches are filled to the top with 10 days worth of work.


sarcasm mode on ? because everybody else gets the "server down for maintenance" response...
ID: 1156879 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1156885 - Posted: 28 Sep 2011, 20:26:27 UTC

I got a bunch of APs this morning before the server crashed. Four didn't make it through the pipe before the server went down though. Going to suspend comms until that problem gets fixed.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1156885 · Report as offensive
Blake Bonkofsky
Volunteer tester
Avatar

Send message
Joined: 29 Dec 99
Posts: 617
Credit: 46,383,149
RAC: 0
United States
Message 1156888 - Posted: 28 Sep 2011, 20:31:54 UTC - in response to Message 1156885.  
Last modified: 28 Sep 2011, 20:32:24 UTC

I got a bunch of APs this morning before the server crashed. Four didn't make it through the pipe before the server went down though. Going to suspend comms until that problem gets fixed.


I have 41 MB's stuck in download as well. Do these downloads eventually just time out, or will they keep trying until either the servers come back up? I'm not going to abort them, as I can't report to the scheduler anyway, I was just wondering.
ID: 1156888 · Report as offensive
Andre Howard
Volunteer tester
Avatar

Send message
Joined: 16 May 99
Posts: 124
Credit: 217,463,217
RAC: 0
United States
Message 1156889 - Posted: 28 Sep 2011, 20:38:06 UTC - in response to Message 1156888.  

They will will download when the site comes back up

ID: 1156889 · Report as offensive
Profile Dirk Sadowski
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 1156890 - Posted: 28 Sep 2011, 20:40:29 UTC - in response to Message 1156888.  

I have also still a few backlogged DLs in the transfers overview in my BOINC.
Nothing to worry about.
If the server will be again reachable, they will come down.

;-)


- Best regards! - Sutaru Tsureku, team seti.international founder. - Optimize your PC for higher RAC. - SETI@home needs your help. -
ID: 1156890 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1156891 - Posted: 28 Sep 2011, 20:42:54 UTC - in response to Message 1156885.  

I got a bunch of APs this morning before the server crashed. Four didn't make it through the pipe before the server went down though. Going to suspend comms until that problem gets fixed.

I've also got a few AP stuck in download,

Claggy
ID: 1156891 · Report as offensive
Profile Akio
Avatar

Send message
Joined: 18 May 11
Posts: 375
Credit: 32,129,242
RAC: 0
United States
Message 1156908 - Posted: 28 Sep 2011, 23:04:33 UTC - in response to Message 1156891.  

My luck finally ran out, hehe. A few AP's stuck in download, and no other tasks. ah well, I know they'll be more work soon ;)
ID: 1156908 · Report as offensive
Profile SciManStev Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Jun 99
Posts: 6657
Credit: 121,090,076
RAC: 0
United States
Message 1156911 - Posted: 28 Sep 2011, 23:12:31 UTC

I have several AP units and a few others stuck in download. I set for NNT, and will crunch these when they get here, but I unleashed Einstein to build a full cache. The only problem is that with Einstein, hyperthreading makes a big difference, but with SETI, it gives me ragged GPU usage. I don't have than many Seti GPU units in download, so I may just go ahead and make the switch. It may mean backing off the clock speed for a bit, but I intend to do a lot of science until Seti can sustain my rig for more than a few minutes at a time.

Steve
Warning, addicted to SETI crunching!
Crunching as a member of GPU Users Group.
GPUUG Website
ID: 1156911 · Report as offensive
Profile Dirk Sadowski
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 1156912 - Posted: 28 Sep 2011, 23:15:35 UTC - in response to Message 1156890.  

I have also still a few backlogged DLs in the transfers overview in my BOINC.
Nothing to worry about.
If the server will be again reachable, they will come down.

;-)


O.K. - and now the DL work..

;-)


- Best regards! - Sutaru Tsureku, team seti.international founder. - Optimize your PC for higher RAC. - SETI@home needs your help. -
ID: 1156912 · Report as offensive
Andre Howard
Volunteer tester
Avatar

Send message
Joined: 16 May 99
Posts: 124
Credit: 217,463,217
RAC: 0
United States
Message 1156913 - Posted: 28 Sep 2011, 23:16:14 UTC

Jeff just posted this in the news section..........

The box is back up and the RAID is resync'ing. I have the project up to clear some of the queues but the feeder is idling until the resync is complete.

ID: 1156913 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1156918 - Posted: 28 Sep 2011, 23:37:04 UTC

Cricket is maxed again. I let comms happen again and all four of my stuck APs began instantly.. at BLAZING speed.


Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1156918 · Report as offensive
Profile SciManStev Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Jun 99
Posts: 6657
Credit: 121,090,076
RAC: 0
United States
Message 1156919 - Posted: 28 Sep 2011, 23:45:41 UTC
Last modified: 28 Sep 2011, 23:46:19 UTC

Wouldn't you know. Right after I reconfigured for Einstein. I'll still wait a few days and see how it goes. If Seti holds strong, I'll switch back, but it would be helpful if I could build a full cache, which may take a couple of weeks acording to what Richard has indicated. I'm happy either way, as the more science I do, the happier I am!

Steve
Warning, addicted to SETI crunching!
Crunching as a member of GPU Users Group.
GPUUG Website
ID: 1156919 · Report as offensive
Profile arkayn
Volunteer tester
Avatar

Send message
Joined: 14 May 99
Posts: 4438
Credit: 55,006,323
RAC: 0
United States
Message 1156937 - Posted: 29 Sep 2011, 1:55:44 UTC

I just connected to Rosetta so I can keep the CPU warm as I was out of CPU work. The GPU still has a few more units until it goes back to Einstein.

ID: 1156937 · Report as offensive
PhonAcq

Send message
Joined: 14 Apr 01
Posts: 1656
Credit: 30,658,217
RAC: 1
United States
Message 1156942 - Posted: 29 Sep 2011, 2:57:46 UTC - in response to Message 1156919.  

Periodically I feel compelled to make a suggestion. I have interest in only Einstein and Seti, with Seti as Numero Uno. Someone suggested to me that I set Einstein to have a resource share of 0 and to set seti at 100. Now, if seti is down long enough to purge my cache, then einstein fills the bill. But not until then. I have to be empty for seti before Einstein is asked for work. Seems to work.

Pretty soon, I'll be able to get some more work from einstein, it appears.

(BTW, any resource setting other than zero seems to let boinc work its wicked ways and produces balances that I don't want. The resource share has to be zero to be a true backstop.)
ID: 1156942 · Report as offensive
archae86

Send message
Joined: 31 Aug 99
Posts: 909
Credit: 1,582,816
RAC: 0
United States
Message 1156946 - Posted: 29 Sep 2011, 3:16:21 UTC - in response to Message 1156942.  

The resource share has to be zero to be a true backstop.)
Even then it may do something most folks might not want: load up your cache with so much work that once SETI resumes you'll face the choice between either trashing a large amount of Einstein work or forgoing a good bit of potential SETI output.

So I think folks trying this scheme who have queue lengths longer than a very few hours may be happier if they radically cut their requested queue time when they spot SETI work dropping below their danger level. Einstein has pretty good up time, so settings longer than just a couple of hours are seldom needed just to keep in work.

I agree that project requested balances such as 99 to 1 generate weird behaviors most users would not like, at least on versions I've tried on my hosts.

ID: 1156946 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 . . . 12 · Next

Message boards : Number crunching : Panic Mode On (56) Server problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.