Panic Mode On (38) Server problems

Message boards : Number crunching : Panic Mode On (38) Server problems
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · 11 · Next

AuthorMessage
Profile soft^spirit
Avatar

Send message
Joined: 18 May 99
Posts: 6497
Credit: 34,134,168
RAC: 0
United States
Message 1036044 - Posted: 25 Sep 2010, 11:52:25 UTC - in response to Message 1036041.  

What happened to S@H being able to spread out storage amongst the clients? To my knowledge, that feature has never been used. There are countless petabytes of storage space available that they could use.


I have asked the same thing and gotten an answer that did make sense.. Bandwidth issues. For safety it would need to be sent and stored on multiple PC's..I am guessing 2 would not be enough due to the transiant nature of many crunchers.


Janice
ID: 1036044 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 1036050 - Posted: 25 Sep 2010, 12:07:15 UTC - in response to Message 1036041.  

What happened to S@H being able to spread out storage amongst the clients? To my knowledge, that feature has never been used. There are countless petabytes of storage space available that they could use.

It can be done only for backup purposes, but AFAIK they already have good backup storage provider.
For data analysis one need directly access to whole data array. Such analysis would be just impossible if data distributed over different hosts with less than gigabit network connection between each over (and even in this case processing speed would drop on few orders of magnitude due to absence of memory caching of data).
ID: 1036050 · Report as offensive
Terror Australis
Volunteer tester

Send message
Joined: 14 Feb 04
Posts: 1817
Credit: 262,693,308
RAC: 44
Australia
Message 1036077 - Posted: 25 Sep 2010, 13:56:41 UTC

With everything off line I wonder why the bandwidth usage is so high ??

The Cricket graphs are still showing 30Mbps going into Berkeley which is the normal level for a Saturday. However as no-one can report or upload......

T.A.
ID: 1036077 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 1036079 - Posted: 25 Sep 2010, 14:04:46 UTC

Yes - strange..

The server are down, but the cricket graph show traffic..

Since ~ 6 hours 'server down': ~ 5 Mbits/sec DL and ~ 30 Mbits/sec UL..

Hmm.. maybe there is something wrong (because of changed server) and we see now the 'internel' traffic between some server..?

ID: 1036079 · Report as offensive
Profile Geek@Play
Volunteer tester
Avatar

Send message
Joined: 31 Jul 01
Posts: 2467
Credit: 86,146,931
RAC: 0
United States
Message 1036081 - Posted: 25 Sep 2010, 14:05:38 UTC

See my post here.
Boinc....Boinc....Boinc....Boinc....
ID: 1036081 · Report as offensive
Profile Jim_S
Avatar

Send message
Joined: 23 Feb 00
Posts: 4705
Credit: 64,560,357
RAC: 31
United States
Message 1036084 - Posted: 25 Sep 2010, 14:11:03 UTC
Last modified: 25 Sep 2010, 14:12:03 UTC

Panic Mode On...Hyperventilating...Need paper bag...Need Valium.
OK now, The Staff WILL work things out.

I Desire Peace and Justice, Jim Scott (Mod-Ret.)
ID: 1036084 · Report as offensive
Profile hiamps
Volunteer tester
Avatar

Send message
Joined: 23 May 99
Posts: 4292
Credit: 72,971,319
RAC: 0
United States
Message 1036085 - Posted: 25 Sep 2010, 14:11:25 UTC

All but out of work, are they going to let us get any this week?
Official Abuser of Boinc Buttons...
And no good credit hound!
ID: 1036085 · Report as offensive
Profile Geek@Play
Volunteer tester
Avatar

Send message
Joined: 31 Jul 01
Posts: 2467
Credit: 86,146,931
RAC: 0
United States
Message 1036092 - Posted: 25 Sep 2010, 14:29:33 UTC

I got so many .vlar wu last week that I doubt I would (could) download anything. Only wish to report work now.
Boinc....Boinc....Boinc....Boinc....
ID: 1036092 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 1036226 - Posted: 25 Sep 2010, 18:48:56 UTC
Last modified: 25 Sep 2010, 18:51:11 UTC

I have still hundreds of backlogged DLs in BOINC's transfer overview.. :-(

In a few hours the GPUs of the GPU machine will idle.
I have currently ~ 30 days rescheduled VLARs on/for the CPU..
ID: 1036226 · Report as offensive
DJStarfox

Send message
Joined: 23 May 01
Posts: 1066
Credit: 1,226,053
RAC: 2
United States
Message 1036257 - Posted: 25 Sep 2010, 20:26:04 UTC

Ugh, why don't they enable the scheduler?! In order to process all the workunits, my machine needs to report everything that's already uploaded.

This is the why they're out of space:
Results returned and awaiting validation 7,315,135

"Master database queries/second" is also broken since last outage.
ID: 1036257 · Report as offensive
JohnDK Crowdfunding Project Donor*Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 28 May 00
Posts: 1222
Credit: 451,243,443
RAC: 1,127
Denmark
Message 1036262 - Posted: 25 Sep 2010, 20:31:44 UTC

From Jeff in tech news

Reality got a little ahead of us on this one. We were days, a week tops, away from migrating
upload service from bruno to bambi. This will double our upload space and allow us to turn
off bruno.

We will now move this up and make it top priority come Monday. We need to reconfigure
the raid on bambi and then let the raid sync. At that point we can both turn the projects
on and start migrating the results from bruno to bambi. We hope that this will be early in
the week. We'll then leave the projects on through next weekend, ie no normal 3 day outage.

ID: 1036262 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65745
Credit: 55,293,173
RAC: 49
United States
Message 1036281 - Posted: 25 Sep 2010, 20:58:32 UTC - in response to Message 1036262.  

From Jeff in tech news

Reality got a little ahead of us on this one. We were days, a week tops, away from migrating
upload service from bruno to bambi. This will double our upload space and allow us to turn
off bruno.

We will now move this up and make it top priority come Monday. We need to reconfigure
the raid on bambi and then let the raid sync. At that point we can both turn the projects
on and start migrating the results from bruno to bambi. We hope that this will be early in
the week. We'll then leave the projects on through next weekend, ie no normal 3 day outage.


3 Cheers for Jeff and the Staff!

Hip hip hooray! Hip hip hooray! Hip hip hooray!

And I mean that most sincerely too. :D
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 1036281 · Report as offensive
Dave

Send message
Joined: 29 Mar 02
Posts: 778
Credit: 25,001,396
RAC: 0
United Kingdom
Message 1036303 - Posted: 25 Sep 2010, 21:47:48 UTC

I have the GX60 & Amilo laptop off for now as they're dry, but I'm happy that the project should remain up over the coming week to let things settle down again. Next sched outage therefore sounds like should be Tue 5 Oct.
ID: 1036303 · Report as offensive
Profile hiamps
Volunteer tester
Avatar

Send message
Joined: 23 May 99
Posts: 4292
Credit: 72,971,319
RAC: 0
United States
Message 1036622 - Posted: 28 Sep 2010, 17:49:18 UTC

Well the download server is up but not the upload server. Have tons of uploads so I can download...Oh Well.
Official Abuser of Boinc Buttons...
And no good credit hound!
ID: 1036622 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1036624 - Posted: 28 Sep 2010, 17:53:29 UTC - in response to Message 1036622.  

Well the download server is up but not the upload server. Have tons of uploads so I can download...Oh Well.

It may show as being up, but either it's not really online or nobody has anything to download, 'cuz the Cricket graphs have not changed much since it showed as being up hours ago.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1036624 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 1036625 - Posted: 28 Sep 2010, 17:55:14 UTC
Last modified: 28 Sep 2010, 18:29:19 UTC

I have still hundreds of backlogged DLs in the transfer overview in BOINC - and nothing come down.. :-(
ID: 1036625 · Report as offensive
Cruncher-American Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor

Send message
Joined: 25 Mar 02
Posts: 1513
Credit: 370,893,186
RAC: 340
United States
Message 1036628 - Posted: 28 Sep 2010, 18:11:27 UTC - in response to Message 1036303.  

I have the GX60 & Amilo laptop off for now as they're dry, but I'm happy that the project should remain up over the coming week to let things settle down again. Next sched outage therefore sounds like should be Tue 5 Oct.


The way things are going, better you should ask "When is the next scheduled UPage?"
ID: 1036628 · Report as offensive
Cruncher-American Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor

Send message
Joined: 25 Mar 02
Posts: 1513
Credit: 370,893,186
RAC: 340
United States
Message 1036641 - Posted: 28 Sep 2010, 18:40:55 UTC - in response to Message 1036635.  

I have the GX60 & Amilo laptop off for now as they're dry, but I'm happy that the project should remain up over the coming week to let things settle down again. Next sched outage therefore sounds like should be Tue 5 Oct.


The way things are going, better you should ask "When is the next scheduled UPage?"


Mmmmm, I can smell Uptime coming at any moment now. I feel it in my bad knee also.


I sure hope you are right - I'm out of CUDA WUs on one machine, and all the CPUs left are VLARs...guess I should be happy the 4 GT240s on this baby are getting a nice rest.
ID: 1036641 · Report as offensive
Profile ScarabDrowner
Volunteer tester
Avatar

Send message
Joined: 13 Sep 03
Posts: 90
Credit: 456,378
RAC: 0
United States
Message 1036689 - Posted: 28 Sep 2010, 20:25:18 UTC

Then set your managers to not request new work until the cricket graph shows uploads tapering off
ID: 1036689 · Report as offensive
Profile Donald L. Johnson
Avatar

Send message
Joined: 5 Aug 02
Posts: 8240
Credit: 14,654,533
RAC: 20
United States
Message 1036713 - Posted: 28 Sep 2010, 21:23:37 UTC - in response to Message 1036689.  

Then set your managers to not request new work until the cricket graph shows uploads tapering off

Many if not most of us who read these boards do that already.

Sten-Arne is only pointing out that the vast majority of S@H users Don't read these boards -they are "set and forget" users who's BOINC Managers are continuously trying to reach the servers, and even with automatic back-off, will drive bandwidth to the max as soon as ANY of the Upload/Download/Scheduling servers come online. (Yes, I know the Server Status page says Download Server 2 on Vader is running, but the Cricket Graph says it isn't "online".)


Donald
Infernal Optimist / Submariner, retired
ID: 1036713 · Report as offensive
Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · 11 · Next

Message boards : Number crunching : Panic Mode On (38) Server problems


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.