Panic Mode On (96) Server Problems?

Message boards : Number crunching : Panic Mode On (96) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 . . . 23 · Next

AuthorMessage
The_bestest

Send message
Joined: 7 Oct 06
Posts: 36
Credit: 82,706,887
RAC: 79
United States
Message 1649364 - Posted: 5 Mar 2015, 0:09:35 UTC - in response to Message 1649357.  

Matt's update was fine, and that's really all we can hope for. The issue is finding the update. It is extremely difficult to find a specific post in a specific thread on a specific subject. If Matt's update were on the home page, then people would learn to go to that specific location for the most current update. The exact same amount of time would bre required, only the location of the update changes

To say that would create more complaints is in my opinion silly. It is the uncertainty that I believe creates more issues. If the updates are to be kept in the discussion forums, then create a category that ONLY the admins can post to. No comments, no nuthin posted there by anyone else

Which is close to what the Technical News forum is designed for, and I for one would welcome Matt's resumption of a narrative there. TN allows a degree of interactivity, which I think is useful - there have been times when the Q & A dialog following an initial 'news' post has been even more informative than the opening item. But I must confess I'm less enthusiastic about the incontinent outpourings of 'thank you' that follow any new thread. And as for posters that hit the top link on any random page for their outpourings - yet again, 'read before you post', guys.


I concede your point on the interactivity part of the threads. I also fully agree with 'read before you post' (wish I could say I've never made that mistake before). Guess all I'm saying is I wish there were some consistency on how this type of information is distributed. To use your example of the Technical News forum, fine...go for it. But then dont post the update next time in Number Crunching, or some other topic. Ok, been a long day and we can beat this subject to death to no resolution . Thanks for spending the time to shoot this around
ID: 1649364 · Report as offensive
Profile BilBg
Volunteer tester
Avatar

Send message
Joined: 27 May 07
Posts: 3720
Credit: 9,385,827
RAC: 0
Bulgaria
Message 1649366 - Posted: 5 Mar 2015, 0:15:21 UTC

"Marching on... (March 4, 2015)" (Technical News) - Matt Lebofsky
http://setiathome.berkeley.edu/forum_thread.php?id=76871
 


- ALF - "Find out what you don't do well ..... then don't do it!" :)
 
ID: 1649366 · Report as offensive
Cruncher-American Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor

Send message
Joined: 25 Mar 02
Posts: 1513
Credit: 370,893,186
RAC: 340
United States
Message 1649387 - Posted: 5 Mar 2015, 1:51:52 UTC
Last modified: 5 Mar 2015, 1:53:31 UTC

I don't know what you people are complaining about. Why around 8:15pm (EST) each of my two machines were blessed by SETI and Her Servers with ONE WU each - that's one each in the last 24++ hours. and one of them finished in the lengthy time of three (3) seconds.

And you think there's no work to be had. For Shame!!!
ID: 1649387 · Report as offensive
Profile betreger Project Donor
Avatar

Send message
Joined: 29 Jun 99
Posts: 11362
Credit: 29,581,041
RAC: 66
United States
Message 1649398 - Posted: 5 Mar 2015, 2:49:04 UTC - in response to Message 1649387.  

And you think there's no work to be had. For Shame!!!

My better machine will be doing Einstein if things don't get better soon the other one has veen restocked.
ID: 1649398 · Report as offensive
Profile Fawkesguy
Volunteer tester
Avatar

Send message
Joined: 8 Jan 01
Posts: 108
Credit: 188,578,766
RAC: 0
United States
Message 1649399 - Posted: 5 Mar 2015, 3:05:35 UTC - in response to Message 1649387.  

.....each of my two machines were blessed by SETI and Her Servers with ONE WU each - that's one each in the last 24++ hours....


Be thankful. I have eight machines and not one of them has gotten a single new WU since the Tuesday outage. Not sure what to do besides be patient, I suppose.
ID: 1649399 · Report as offensive
Profile Jimbocous Project Donor
Volunteer tester
Avatar

Send message
Joined: 1 Apr 13
Posts: 1853
Credit: 268,616,081
RAC: 1,349
United States
Message 1649402 - Posted: 5 Mar 2015, 3:20:20 UTC - in response to Message 1649399.  

.....each of my two machines were blessed by SETI and Her Servers with ONE WU each - that's one each in the last 24++ hours....


Be thankful. I have eight machines and not one of them has gotten a single new WU since the Tuesday outage. Not sure what to do besides be patient, I suppose.

Yeah, patience is the ticket alright. I was dry here all day, but finally did get a dump for all three machines, so the tanks are about 3/4 and all is well for a bit.
ID: 1649402 · Report as offensive
Dena Wiltsie
Volunteer tester

Send message
Joined: 19 Apr 01
Posts: 1628
Credit: 24,230,968
RAC: 26
United States
Message 1649409 - Posted: 5 Mar 2015, 3:51:09 UTC

There is hope. I have a 2 day queue and I had 8 crunching and one waiting when I received 85 work units with one request. 2 days of work would be 100 work units so I am almost full again. I suspect computers requesting work have been greatly reduced so if you haven't received work yet, you should shortly.
ID: 1649409 · Report as offensive
Profile Donald L. Johnson
Avatar

Send message
Joined: 5 Aug 02
Posts: 8240
Credit: 14,654,533
RAC: 20
United States
Message 1649424 - Posted: 5 Mar 2015, 4:42:15 UTC

I bought a used Core i5 machine from the County Surplus Store over the weekend, took it to my favorite tech for a once-over. Got it back and put it online around noon PST today, and loaded BOINC and S@H. Got nothing until about 20 minutes ago, when the logjam broke and I got all the MB files plus 38 Tasks. Two of my other boxes just got through to the Servers and got new Tasks, too. Making progress, I hope.
Donald
Infernal Optimist / Submariner, retired
ID: 1649424 · Report as offensive
_
Avatar

Send message
Joined: 15 Nov 12
Posts: 299
Credit: 9,037,618
RAC: 0
United States
Message 1649430 - Posted: 5 Mar 2015, 5:03:43 UTC

No worries everyone. During times like these, I switch to Einstein, bury my head in the sand, and before you know it SETI will be back. :)
ID: 1649430 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13746
Credit: 208,696,464
RAC: 304
Australia
Message 1649457 - Posted: 5 Mar 2015, 6:39:02 UTC - in response to Message 1649430.  
Last modified: 5 Mar 2015, 7:12:48 UTC

One of my machines picked up GPU work while I was at work, but the other still is to receive any GPU work.
There are still random Scheduler errors occurring, and when you do get a response from the Scheduler the response is usually "Project has no tasks available" (as others have noted).

If the server stats are halfway right thee is plenty of work available, it's just the Scheduler issues that make it almost impossible to get any.


EDIT- finally managed to get some GPU for the second system.
Grant
Darwin NT
ID: 1649457 · Report as offensive
Cruncher-American Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor

Send message
Joined: 25 Mar 02
Posts: 1513
Credit: 370,893,186
RAC: 340
United States
Message 1649476 - Posted: 5 Mar 2015, 7:42:41 UTC

Couldn't sleep (it's 2:30am here near Boston) so I tickled both of my machines with an Update, and guess what? Each of them got exactly ONE WU - and a VLAR at that.

Maybe Dr. A., after the triumph that was New Credit, decided on New WU????

This is fricking ridiculous - my machines have gotten NO work (well, 4 WUs total) for at least 36 hours now - I have never seen anything like this (when SETI was up and running - apparently) in the 12+ years I have been with the project.

What is the damn problem???????
ID: 1649476 · Report as offensive
Profile Bernie Vine
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 26 May 99
Posts: 9954
Credit: 103,452,613
RAC: 328
United Kingdom
Message 1649478 - Posted: 5 Mar 2015, 7:50:57 UTC - in response to Message 1649476.  

Couldn't sleep (it's 2:30am here near Boston) so I tickled both of my machines with an Update, and guess what? Each of them got exactly ONE WU - and a VLAR at that.

Maybe Dr. A., after the triumph that was New Credit, decided on New WU????

This is fricking ridiculous - my machines have gotten NO work (well, 4 WUs total) for at least 36 hours now - I have never seen anything like this (when SETI was up and running - apparently) in the 12+ years I have been with the project.

What is the damn problem???????

Matt has posted an update in tech news

http://setiathome.berkeley.edu/forum_thread.php?id=76871&postid=1649365#1649365
ID: 1649478 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13746
Credit: 208,696,464
RAC: 304
Australia
Message 1649489 - Posted: 5 Mar 2015, 8:17:30 UTC - in response to Message 1649478.  

Couldn't sleep (it's 2:30am here near Boston) so I tickled both of my machines with an Update, and guess what? Each of them got exactly ONE WU - and a VLAR at that.

Maybe Dr. A., after the triumph that was New Credit, decided on New WU????

This is fricking ridiculous - my machines have gotten NO work (well, 4 WUs total) for at least 36 hours now - I have never seen anything like this (when SETI was up and running - apparently) in the 12+ years I have been with the project.

What is the damn problem???????

Matt has posted an update in tech news

http://setiathome.berkeley.edu/forum_thread.php?id=76871&postid=1649365#1649365

As well as in this very thread, yet it appears it went unread...
Grant
Darwin NT
ID: 1649489 · Report as offensive
Cruncher-American Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor

Send message
Joined: 25 Mar 02
Posts: 1513
Credit: 370,893,186
RAC: 340
United States
Message 1649503 - Posted: 5 Mar 2015, 8:50:57 UTC - in response to Message 1649489.  

As well as in this very thread, yet it appears it went unread...


No, I read it. It really didn't say very much except that MySQL is basically unsuitable for the project as it has grown to become (it's not a commercial, heavy duty db, just an academic toy (IMO)) and they are trying to figure out what to do about it. Or perhaps they just don't have the appropriate level of expertise to get it to work with the volume of info SETI has; I wish I could help there, but (as Sgt. Schultz used to say), "I know nossing".

As in: they seemingly DON'T know what to do about it.
ID: 1649503 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13746
Credit: 208,696,464
RAC: 304
Australia
Message 1649505 - Posted: 5 Mar 2015, 8:57:49 UTC - in response to Message 1649503.  

As in: they seemingly DON'T know what to do about it.

Nope, they don't.
Until they can determine what is actually causing the issues, it's pretty much impossible to fix them.
Once they figure out the cause/causes, that will then show them which direction to head in in order to resolve them.
Grant
Darwin NT
ID: 1649505 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13746
Credit: 208,696,464
RAC: 304
Australia
Message 1649509 - Posted: 5 Mar 2015, 9:04:59 UTC - in response to Message 1649457.  
Last modified: 5 Mar 2015, 9:08:47 UTC

EDIT- finally managed to get some GPU work for the second system.

Since then the Scheduler errors appear to have cleared up, but have been unable to get any more work. "Project has no tasks available" is now the response to all Scheduler requests.

EDIT- there have been 2 "This computer has reached a limit on tasks in progress" messages, which is the response to GPU work requests when only VLARs are available even though you're a long way short of actually reaching any work limits.
Grant
Darwin NT
ID: 1649509 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13746
Credit: 208,696,464
RAC: 304
Australia
Message 1649517 - Posted: 5 Mar 2015, 9:38:41 UTC - in response to Message 1649509.  

EDIT- finally managed to get some GPU work for the second system.

Since then the Scheduler errors appear to have cleared up, but have been unable to get any more work. "Project has no tasks available" is now the response to all Scheduler requests.

EDIT- there have been 2 "This computer has reached a limit on tasks in progress" messages, which is the response to GPU work requests when only VLARs are available even though you're a long way short of actually reaching any work limits.

Almost without fail, the best way to get work when even after several days of trying you've nothing to show for it, is to make a post about the inability to get work.
I just got some more GPU work. I might even be able to make it through the night without running out.
Grant
Darwin NT
ID: 1649517 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1649518 - Posted: 5 Mar 2015, 10:14:43 UTC - in response to Message 1649505.  

As in: they seemingly DON'T know what to do about it.

Nope, they don't.
Until they can determine what is actually causing the issues, it's pretty much impossible to fix them.
Once they figure out the cause/causes, that will then show them which direction to head in in order to resolve them.

It has been mentioned a few times over the years that the project keeps finding new limitations with the various DB softwares/engines they are using, because the size and complexity of the DBs are more than what anyone had available when the software/engine was being designed in the first place.

Long story short: every few years, the project reaches another point where they are quite literally in "uncharted waters" and they have to figure out where the problem is, then what the problem is, and then figure out possible solutions.

Figuring out possible solutions are just educated guesses and trial-and-error for the most part, and there's probably some communication with the developers, which mostly boils down to "I've done all I can think of. Do you have any ideas how/where I should poke this and see what happens?"


Give it time, dude. Seriously. As someone else said earlier in this very thread: "the project never guaranteed it would always have work available." It's even right there on the home page: "Keep your computer busy when SETI@home has no work - participate in other BOINC-based projects."

That being said, I understand not wanting to participate in other projects.. I, myself, don't have any interest in other projects, but you haven't seen me blowing-up in an angry post about how my machine has had an empty cache for 3 days shy of a month at this point. Yes, I did mention that my AP cache finally ran out back on Feb 8, but that's my choice to have an empty cache, and I'm not complaining about it, because.. again: they never guaranteed or promised to always have work.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1649518 · Report as offensive
Cruncher-American Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor

Send message
Joined: 25 Mar 02
Posts: 1513
Credit: 370,893,186
RAC: 340
United States
Message 1649546 - Posted: 5 Mar 2015, 13:24:02 UTC

FINALLY - just got some real work - 104 WUs on one machine, 30 or so on the other. Unlike in the past, they were almost all GPU; recently, the servers were filling my CPU cache before the GPUs, but not this time (maybe they want returns fast & furious?). Interestingly also, mostly shorties. And the CPUs are getting vlars only (so far).

Maybe only a one-time occurrence?

But who cares? Back in business at last!
ID: 1649546 · Report as offensive
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 34258
Credit: 79,922,639
RAC: 80
Germany
Message 1649550 - Posted: 5 Mar 2015, 14:17:58 UTC

Both Win 7 and Linux partions have filled their caches.


With each crime and every kindness we birth our future.
ID: 1649550 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 . . . 23 · Next

Message boards : Number crunching : Panic Mode On (96) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.