Panic Mode On (7) Server Problems! Closed for Renovation

Message boards : Number crunching : Panic Mode On (7) Server Problems! Closed for Renovation
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 . . . 12 · Next

AuthorMessage
Profile Keith T.
Volunteer tester
Avatar

Send message
Joined: 23 Aug 99
Posts: 962
Credit: 537,293
RAC: 9
United Kingdom
Message 739784 - Posted: 16 Apr 2008, 11:59:10 UTC

Last years Thumper outage lasted about 12-14 days including catchup time.

It started on 1st May 2007 and ended around 12 May 2007.

That was an exceptional outage. The most recent one was ~ 42 hours i.e. less than 2 days.

BOINC is designed to cope with whole project outages. OK some of you want to run SETI only. That is your choice. I have SETI as my highest resourece share, but I also manage to run 8 other projects (including SETI Beta), it should be 9 but I just joined orbit@home and they don't currently have any WU's.

I run a 2.5 to 3 day cache across all projects which usually gives me about 2 - 5 SETI WU's in reserve. Recent SETI WU crunch times have varied between 2 to 6.5 hours. It works for me.

If you are running SETI only, I don't see any good reason to go beyond 5 days cache, 3 days should be enough to cover most normal outages.

Even if you got a whole bunch of -9 overflows, you would be very unlucky to get more than 10 in a row.

I think some people run the big caches because they see that option and just "go for the biggest". If the sofware allowed it they would probably cache 21 or 30 days worth of WU's!

Just my thoughts, btw this thread seems to have gone a bit off topic.
Sir Arthur C Clarke 1917-2008
ID: 739784 · Report as offensive
Profile SATAN
Avatar

Send message
Joined: 27 Aug 06
Posts: 835
Credit: 2,129,006
RAC: 0
United Kingdom
Message 739791 - Posted: 16 Apr 2008, 12:34:06 UTC

Lets just hope NEZ's timings coincide with a slitter problem, if his machines decide to request 200,000 thousand or so units at once then we could see a lot of issues., His RAC has tripled since I joined the project, so good knows what new machines he has managed to get working on Boinc. The guys are still having splitter problems now, if this spreads then we could see another outage.
ID: 739791 · Report as offensive
Profile littlegreenmanfrommars
Volunteer tester
Avatar

Send message
Joined: 28 Jan 06
Posts: 1410
Credit: 934,158
RAC: 0
Australia
Message 739801 - Posted: 16 Apr 2008, 13:13:03 UTC - in response to Message 739759.  

[quote]I run a 10 day cache, as I have a pretty fast machine. It hurts no-one, as the results are returned pretty darn fast.



You may think you are processing them quickly, but all you are doing is processing WUs that have been already waiting for 10 days on your machine for their number to come up.

Since I have a pending file nearly three times as much as my machines can process in a day, I'd say I'm getting stuff done faster than the rest of the quorum, in most cases. The 10 day cache isn't holding anyone up.

You still return your WUs 10 days later than you got them, and your machine's speed is totally irrelevant.

See above.

I hate to sound rude, because this isn't meant to be rude.
I think you're wide of the mark.
However, I shall respect your right to have an opinion.

Respectfully,

lgm
ID: 739801 · Report as offensive
W-K 666 Project Donor
Volunteer tester

Send message
Joined: 18 May 99
Posts: 19547
Credit: 40,757,560
RAC: 67
United Kingdom
Message 739806 - Posted: 16 Apr 2008, 13:31:22 UTC

With seven day deadlines on VHAR units can you actually get a 10 day cache. I wouldn't have thought so.

The last time we looked at the spread of units across the AR spectrum, VHAR's made up 30% of all units, but only ~10% of crunching time. The percentage of VHAR's may have fallen since then but not by much.
ID: 739806 · Report as offensive
Profile AndyW Project Donor
Volunteer tester
Avatar

Send message
Joined: 23 Oct 02
Posts: 5862
Credit: 10,957,677
RAC: 18
United Kingdom
Message 739816 - Posted: 16 Apr 2008, 14:11:57 UTC - in response to Message 739806.  

With seven day deadlines on VHAR units can you actually get a 10 day cache. I wouldn't have thought so.




Any WUs with a short deadline run as "High Priority" in Boinc, so in effect jump the queue. In theory the cache would stay at 10 days as you are never going to get a run of hundreds of VHAR units.
ID: 739816 · Report as offensive
W-K 666 Project Donor
Volunteer tester

Send message
Joined: 18 May 99
Posts: 19547
Credit: 40,757,560
RAC: 67
United Kingdom
Message 739819 - Posted: 16 Apr 2008, 14:15:45 UTC - in response to Message 739816.  

With seven day deadlines on VHAR units can you actually get a 10 day cache. I wouldn't have thought so.




Any WUs with a short deadline run as "High Priority" in Boinc, so in effect jump the queue. In theory the cache would stay at 10 days as you are never going to get a run of hundreds of VHAR units.

But as soon as you are in EDF, you are inhibited from downloading more units.
And were you not here at the end of last Dec, until Matt changed the splitter sequence, we had nothing but VHAR's.
ID: 739819 · Report as offensive
Profile AndyW Project Donor
Volunteer tester
Avatar

Send message
Joined: 23 Oct 02
Posts: 5862
Credit: 10,957,677
RAC: 18
United Kingdom
Message 739826 - Posted: 16 Apr 2008, 14:37:47 UTC - in response to Message 739819.  



And were you not here at the end of last Dec, until Matt changed the splitter sequence, we had nothing but VHAR's.



I missed that fun as I had a 2 year break from SETI after moving house and selling all my belongings. Only started crunching again in February this year.

Nothing buy VHAR's? How on Earth did the servers/network stand up to that hammering...or didn't they?!
ID: 739826 · Report as offensive
W-K 666 Project Donor
Volunteer tester

Send message
Joined: 18 May 99
Posts: 19547
Credit: 40,757,560
RAC: 67
United Kingdom
Message 739843 - Posted: 16 Apr 2008, 15:35:53 UTC - in response to Message 739826.  
Last modified: 16 Apr 2008, 15:44:12 UTC



And were you not here at the end of last Dec, until Matt changed the splitter sequence, we had nothing but VHAR's.



I missed that fun as I had a 2 year break from SETI after moving house and selling all my belongings. Only started crunching again in February this year.

Nothing buy VHAR's? How on Earth did the servers/network stand up to that hammering...or didn't they?!

The servers did quite well, you can see on the Yearly Cricket graphs that the average comms rates were at about the highest between ~20th Dec to ~5th Jan.

Matt's tech News post when he first notices the hig traffic volumes and lots of VHAR's is post 696817, Happy 2454466.5! (Jan 02 2008)
ID: 739843 · Report as offensive
Profile Andy Lee Robinson
Avatar

Send message
Joined: 8 Dec 05
Posts: 630
Credit: 59,973,836
RAC: 0
Hungary
Message 739865 - Posted: 16 Apr 2008, 16:29:40 UTC - in response to Message 739762.  

Now, does anyone see the logic of why *unnecessarily* large caches are not good?


Please forgive me when I am on holiday next week. I intend to disconnect from the Internet but leave the machine slowly crunching away. I shall need about 8 days of cache. If I was away for 2 weeks then I should want to have 14 days' worth. I suspect there may be a fair amount of that during the summer days and not just people trying to avoid S@H downtime.


Which bit of *unnecessarily* didn't you understand? You don't need my forgiveness!
That's a perfectly valid use of the cache.
ID: 739865 · Report as offensive
Profile Andy Lee Robinson
Avatar

Send message
Joined: 8 Dec 05
Posts: 630
Credit: 59,973,836
RAC: 0
Hungary
Message 739874 - Posted: 16 Apr 2008, 16:42:41 UTC - in response to Message 739801.  

I hate to sound rude, because this isn't meant to be rude.
I think you're wide of the mark.


No rudeness inferred! I'd die for your right to say I'm wide of the mark, but I also respectfully don't agree with you.
I am looking at the process from a system and resource allocation perspective, instead of limiting my horizons to the extent of my cache!

If everyone behaved this way at the dinner table the cook would be trying to make room for supplies all the time instead of actually doing the cooking, while those that want to take as many cookies as they can in case they're hungry later cause some to go without! ...or something like that.

Just good manners - take what you need when you need it, and keep enough for downtime, not more unless you only have intermittent net access.

Andy.
ID: 739874 · Report as offensive
Profile Logan
Volunteer tester
Avatar

Send message
Joined: 26 Jan 07
Posts: 743
Credit: 918,353
RAC: 0
Spain
Message 740083 - Posted: 16 Apr 2008, 22:07:19 UTC
Last modified: 16 Apr 2008, 22:15:32 UTC

Upppssss....!

Uploads don't work. (downloads are working fine, by the moment...).


Best regards.
Logan.

BOINC FAQ Service (Ahora, también disponible en Español/Now available in Spanish)
ID: 740083 · Report as offensive
Profile SATAN
Avatar

Send message
Joined: 27 Aug 06
Posts: 835
Credit: 2,129,006
RAC: 0
United Kingdom
Message 740101 - Posted: 16 Apr 2008, 22:25:35 UTC

Your not alone logan.
ID: 740101 · Report as offensive
Profile Logan
Volunteer tester
Avatar

Send message
Joined: 26 Jan 07
Posts: 743
Credit: 918,353
RAC: 0
Spain
Message 740105 - Posted: 16 Apr 2008, 22:29:39 UTC - in response to Message 740101.  
Last modified: 16 Apr 2008, 22:39:39 UTC

Your not alone logan.


And the server status page is frozen ...

But the cricket graphs says all is ok... (by the moment... :))



[edit]Upppssss!!!![/edit]
Logan.

BOINC FAQ Service (Ahora, también disponible en Español/Now available in Spanish)
ID: 740105 · Report as offensive
Profile Logan
Volunteer tester
Avatar

Send message
Joined: 26 Jan 07
Posts: 743
Credit: 918,353
RAC: 0
Spain
Message 740126 - Posted: 16 Apr 2008, 22:55:22 UTC - in response to Message 740105.  

Your not alone logan.


And the server status page is frozen ...

But the cricket graphs says all is ok... (by the moment... :))



[edit]Upppssss!!!![/edit]


Well, it seem that all already is working, Satan.

Can you make your uploads without problems?


Logan.

BOINC FAQ Service (Ahora, también disponible en Español/Now available in Spanish)
ID: 740126 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13904
Credit: 208,696,464
RAC: 304
Australia
Message 741823 - Posted: 20 Apr 2008, 9:00:12 UTC


On Scarecrow's Graphs the Results waiting for validation appears to have flatlined, everything else appears to be turning over nicely.
Grant
Darwin NT
ID: 741823 · Report as offensive
Scarecrow

Send message
Joined: 15 Jul 00
Posts: 4520
Credit: 486,601
RAC: 0
United States
Message 742657 - Posted: 22 Apr 2008, 5:48:05 UTC - in response to Message 741823.  
Last modified: 22 Apr 2008, 5:48:43 UTC


On Scarecrow's Graphs the Results waiting for validation appears to have flatlined, everything else appears to be turning over nicely.

According to the Data Table it hasn't truly flatlined, but it sure isn't moving very much.
ID: 742657 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13904
Credit: 208,696,464
RAC: 304
Australia
Message 742698 - Posted: 22 Apr 2008, 8:51:53 UTC - in response to Message 742657.  


On Scarecrow's Graphs the Results waiting for validation appears to have flatlined, everything else appears to be turning over nicely.

According to the Data Table it hasn't truly flatlined, but it sure isn't moving very much.

Yeah, i was just looking at that one again as well as the In Progress. I can't remember ever seeing such steady numbers.
Grant
Darwin NT
ID: 742698 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13904
Credit: 208,696,464
RAC: 304
Australia
Message 743053 - Posted: 23 Apr 2008, 7:04:49 UTC


Looks like some of the server stats didn't survive the weekly outage. 14 hours since the last update for many of them.
Grant
Darwin NT
ID: 743053 · Report as offensive
Fred W
Volunteer tester

Send message
Joined: 13 Jun 99
Posts: 2524
Credit: 11,954,210
RAC: 0
United Kingdom
Message 743057 - Posted: 23 Apr 2008, 7:17:06 UTC - in response to Message 743053.  


Looks like some of the server stats didn't survive the weekly outage. 14 hours since the last update for many of them.

Hmmmm! Says 13 hours on my screen??!!

F.
ID: 743057 · Report as offensive
Scarecrow

Send message
Joined: 15 Jul 00
Posts: 4520
Credit: 486,601
RAC: 0
United States
Message 743066 - Posted: 23 Apr 2008, 9:04:34 UTC - in response to Message 743053.  
Last modified: 23 Apr 2008, 9:05:01 UTC


Looks like some of the server stats didn't survive the weekly outage. 14 hours since the last update for many of them.

Now it's a flatline.

t-minus 16 hours and.... just sort of laying there limp. :)
ID: 743066 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 . . . 12 · Next

Message boards : Number crunching : Panic Mode On (7) Server Problems! Closed for Renovation


 
©2025 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.