Panic Mode On (19) Server problems

Message boards : Number crunching : Panic Mode On (19) Server problems
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 . . . 11 · Next

AuthorMessage
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13736
Credit: 208,696,464
RAC: 304
Australia
Message 914211 - Posted: 5 Jul 2009, 3:20:41 UTC - in response to Message 914189.  


Promising signs.
Download traffic is still very high, but upload traffic is still slowly increasing. Number of results returned per hour continues to climb. Still a few hour till the usual 24:00 traffic surge so hopefully things might be almost normal by then.
Grant
Darwin NT
ID: 914211 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 914222 - Posted: 5 Jul 2009, 3:43:47 UTC - in response to Message 914153.  

Thanks Ned, these days, SETI is somewhere about active project 8 or 9 for me, after what are basically steadier and more research oriented projects (Milkyway, GPUGrid, Spinhenge, POEM, Climate, Einstein and Rosetta) about matched with World Grid. So the seemingly regular issues here, be they upload/download, the Tuesday 4 to 8 hour outage, or any of a list of other issues that periodically haunt SETI don't cause near the level of angst that it would if it were higher up my CPU/GPU cycles food chain.

I crunch SETI@Home, SETI@Home beta, and BOINC alpha.

... and frankly, I don't feel any angst. I generally keep a 4 day cache. I'm limiting BOINC to one hour of network access per day, and I've got plenty to carry me over.

I don't have any machines built just for BOINC, so I don't feel like I've invested a lot of money. I have sent 'em a little bit of cash to help keep the lights on at SETI-central, and that's about it.
ID: 914222 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13736
Credit: 208,696,464
RAC: 304
Australia
Message 914234 - Posted: 5 Jul 2009, 4:32:22 UTC - in response to Message 914222.  


All of my older uploads have finally gone though & new work has downloaded. Of course now work that's completing isn't getting through, but at least the network traffic shows plenty still being downloaded & uploads per hour still climbing.

One odd thing though.
A numbber of results start to upload, but feeze on 16kB uploaded. Then they start from scrath when they next retry.
Grant
Darwin NT
ID: 914234 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 914238 - Posted: 5 Jul 2009, 4:56:43 UTC - in response to Message 914032.  
Last modified: 5 Jul 2009, 4:57:34 UTC

56 more minutes and the first of my farm goes offline permanently - that's 2 WU's a day the supercrunchers won't have to compete for. The next one exits tomorrow morning.
No wingmen will be left stranded!

Hal.........give it a break....
My 'supercrunchers' have a bit more to do than yours........
If you run out of work.........so do I.

Take a break in the outhouse.....LOL.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 914238 · Report as offensive
Profile TCP JESUS
Avatar

Send message
Joined: 19 Jan 03
Posts: 205
Credit: 1,248,845
RAC: 0
Canada
Message 914245 - Posted: 5 Jul 2009, 5:20:06 UTC

Now, the race is on to finish uploading before all the current work is gone.....

My main machine finally ran out of work about 2 hours ago or so......and I can't wait to fill a 10 day cache this time ;)
I am TCP JESUS...The Carpenter Phenom Jesus....and HAMMERING is what I do best!
formerly known as...MC Hammer.
ID: 914245 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 914256 - Posted: 5 Jul 2009, 6:06:44 UTC - in response to Message 914245.  
Last modified: 5 Jul 2009, 6:08:29 UTC

Now, the race is on to finish uploading before all the current work is gone.....

My main machine finally ran out of work about 2 hours ago or so......and I can't wait to fill a 10 day cache this time ;)

Not good news here in kittyland either........

Don't quite understand why the project is asking for new participants when they can't support the ones they've got.

Sorry.......Matt......Eric......can you answer me this one?

The kitties wanna know........
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 914256 · Report as offensive
W-K 666 Project Donor
Volunteer tester

Send message
Joined: 18 May 99
Posts: 19062
Credit: 40,757,560
RAC: 67
United Kingdom
Message 914257 - Posted: 5 Jul 2009, 6:10:52 UTC

My Q6600 has just got to the 2 * #cpu's point on uploads and therefore downloaded some tasks.
I was down to the last eight tasks, either crunching or waiting to crunch, I now have 193 tasks two of which are AP's. About 99% of the remaining 191 are VHAR quickies.
ID: 914257 · Report as offensive
BarryAZ

Send message
Joined: 1 Apr 01
Posts: 2580
Credit: 16,982,517
RAC: 0
United States
Message 914258 - Posted: 5 Jul 2009, 6:15:14 UTC - in response to Message 914222.  

OK -- that's one, how many active users are there? <smile>.


I crunch SETI@Home, SETI@Home beta, and BOINC alpha.

... and frankly, I don't feel any angst.


ID: 914258 · Report as offensive
BarryAZ

Send message
Joined: 1 Apr 01
Posts: 2580
Credit: 16,982,517
RAC: 0
United States
Message 914259 - Posted: 5 Jul 2009, 6:21:17 UTC - in response to Message 914256.  

Right, now if they were asking for folks to participate in one (or more) BOINC projects in general, that would seem to make more sense -- except perhaps in a SETI-centric BOINC context -- though even there, more folks in multi-project mode (including current SETI only folks) would strike me as the way to go. I am not sure what proportion of the 325K plus 'active users' for BOINC projects are multi-project.





Don't quite understand why the project is asking for new participants when they can't support the ones they've got.

Sorry.......Matt......Eric......can you answer me this one?

The kitties wanna know........


ID: 914259 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 914267 - Posted: 5 Jul 2009, 6:57:43 UTC - in response to Message 914258.  

OK -- that's one, how many active users are there? <smile>.

I asked a while back, and was given some statistics on how many SETI@Home participants post to the forums. I've lost the exact numbers, but if memory serves, it's less than 9,000. BOINCstats reports 183,000 active users.

So call that 1 in 20 in round numbers.

I suspect that most of those who don't care enough to post also don't care enough to build specialized crunchers, especially since the active host to active user ratio is about 1.6.

But the simple fact of the matter is, I don't get angry because it won't make a difference. Problems will be resolved at about the same speed either way.

So, why waste the effort?

ID: 914267 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 914269 - Posted: 5 Jul 2009, 7:01:38 UTC - in response to Message 914259.  

Right, now if they were asking for folks to participate in one (or more) BOINC projects in general, that would seem to make more sense -- except perhaps in a SETI-centric BOINC context -- though even there, more folks in multi-project mode (including current SETI only folks) would strike me as the way to go. I am not sure what proportion of the 325K plus 'active users' for BOINC projects are multi-project.

From a BOINC point of view, their users are projects, not individual crunchers.

It's also hard to sell someone like me on protein folding or climate prediction because they just don't stir the imagination.

ID: 914269 · Report as offensive
Profile TCP JESUS
Avatar

Send message
Joined: 19 Jan 03
Posts: 205
Credit: 1,248,845
RAC: 0
Canada
Message 914271 - Posted: 5 Jul 2009, 7:03:47 UTC - in response to Message 914267.  

So, why waste the effort?


But Ned, how about the effort that you waste trying to convince everyone that you are not wasting the effort ?
I am TCP JESUS...The Carpenter Phenom Jesus....and HAMMERING is what I do best!
formerly known as...MC Hammer.
ID: 914271 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 914272 - Posted: 5 Jul 2009, 7:11:23 UTC - in response to Message 914271.  

So, why waste the effort?


But Ned, how about the effort that you waste trying to convince everyone that you are not wasting the effort ?

I don't care if anyone believes that I'm not upset.

What I hope is that others will consider their position and put that effort to something more productive.

Instead of getting mad, I'm trying to figure out a practical way to fix it.

I've got some ideas. I need a development environment: that's next.
ID: 914272 · Report as offensive
Profile Uli
Volunteer tester
Avatar

Send message
Joined: 6 Feb 00
Posts: 10923
Credit: 5,996,015
RAC: 1
Germany
Message 914279 - Posted: 5 Jul 2009, 7:52:18 UTC - in response to Message 914272.  

So, why waste the effort?


But Ned, how about the effort that you waste trying to convince everyone that you are not wasting the effort ?

I don't care if anyone believes that I'm not upset.

What I hope is that others will consider their position and put that effort to something more productive.

Instead of getting mad, I'm trying to figure out a practical way to fix it.

I've got some ideas. I need a development environment: that's next.

Warning totally off topic
Is that Molly helping you?

Back on topic
Best of luck on your next project.
Pluto will always be a planet to me.

Seti Ambassador
Not to late to order an Anni Shirt
ID: 914279 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 914283 - Posted: 5 Jul 2009, 8:07:30 UTC

Next subject..........

New color.......

Next raality.........

WTF........

Where are the Fish?


Really. Where ARE the freakin' fish?

No hope........no numbers........no nothing............

Yes, I will crunch forever........because I must......or choose to


And I hope the rest of you will too............
Hon.........it's a lot.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 914283 · Report as offensive
Ianab
Volunteer tester

Send message
Joined: 11 Jun 08
Posts: 732
Credit: 20,635,586
RAC: 5
New Zealand
Message 914294 - Posted: 5 Jul 2009, 8:57:02 UTC

It would be relatively simple for the Seti staff to keep things running smoothly at their end, simply by throttling the splitters and schedulers to keep the bandwidth and database size down to levels that the system works 99.9%reliably.

BUT, doing that is going to seriously slow down the work rate. Then we are all going to be short of work units, all the time.

I believe the guys have tweaked the system (and continue to) to give the maximum throughput, maybe they are only getting 80-90% of the theoretical maximunm because of various TEMPORARY overloads and crashes. But throttling so the system is 99.9% reliable might take the throughput back to maybe only 50% of the max.

Then watch the wailing and moaning start ;-)

Now for all the grief and problems over the last week, my main cruncher (Q6600)has been down to about 1 day of work (from a 3 day cache). Sure I've have upload re-tries, and "no work available" messages by the dozens. Those are temporary. heck you might even run out of work for a few hours.

It's also about YOUR expectations, if you expect the servers to be 90% reliable, then you should still be quite happy.

Relax and let the system work.

Ian
ID: 914294 · Report as offensive
Fred W
Volunteer tester

Send message
Joined: 13 Jun 99
Posts: 2524
Credit: 11,954,210
RAC: 0
United Kingdom
Message 914299 - Posted: 5 Jul 2009, 9:32:47 UTC - in response to Message 914294.  


...

Relax and let the system work.

Ian

Posters on this topic can be split into 2 categories:

For those who complain about the performance of the system, I would agree with your sentiment. Others have an "engineering" bent to their nature (whether or not it is included in their job title) and they are genetically incapable of relaxing when observing a system that seems to be performing sub-optimally. They have to poke it, tweak it, try to understand how it works and how it can be made to work "better". I know this for a fact...

F.
ID: 914299 · Report as offensive
Ianab
Volunteer tester

Send message
Joined: 11 Jun 08
Posts: 732
Credit: 20,635,586
RAC: 5
New Zealand
Message 914302 - Posted: 5 Jul 2009, 10:22:42 UTC - in response to Message 914299.  


...

Relax and let the system work.

Ian

Posters on this topic can be split into 2 categories:

For those who complain about the performance of the system, I would agree with your sentiment. Others have an "engineering" bent to their nature (whether or not it is included in their job title) and they are genetically incapable of relaxing when observing a system that seems to be performing sub-optimally. They have to poke it, tweak it, try to understand how it works and how it can be made to work "better". I know this for a fact...

F.


Ohh. I have an Engineering bent :-)

I like looking at how the system works, and maybe I would adjust things differently. It might work better, but probably not.

Although I work on smaller systems, my clients have similar issues. Disk space, response speed, network throughput etc. I look at the system and make recomendations as to what to buy to fix the issues. But they need to spend money to improve their systems. When money is limited, you tweak what you have to do the best you can.

The other thing is being old and wise enough to know what you can fix, and do something about it, or just accept what you cant. ;-)

Ian
ID: 914302 · Report as offensive
Profile [AF>france>pas-de-calais]symaski62
Volunteer tester

Send message
Joined: 12 Aug 05
Posts: 258
Credit: 100,548
RAC: 0
France
Message 914313 - Posted: 5 Jul 2009, 12:43:24 UTC

05/07/2009 14:37:00 SETI@home Sending scheduler request: To fetch work.
05/07/2009 14:37:00 SETI@home Requesting new tasks for CPU
05/07/2009 14:37:05 SETI@home Scheduler request completed: got 15 new tasks

^^ boinc 6.6.36 version


SETI@Home Informational message -9 result_overflow
with a general handicap of 80% and it makes much d' efforts for the community and s' expimer, thank you d' to be understanding.
ID: 914313 · Report as offensive
Profile DPRGI - Luivul

Send message
Joined: 24 Jan 03
Posts: 17
Credit: 20,639,801
RAC: 0
Italy
Message 914341 - Posted: 5 Jul 2009, 15:18:43 UTC

I get 200 of very fast (15 min) MB wu this wu have the same size of a normal and was returned fast too increasing the network traffic. why the splitter produce it?
ID: 914341 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 . . . 11 · Next

Message boards : Number crunching : Panic Mode On (19) Server problems


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.