Scheduler down??!

Message boards : Number crunching : Scheduler down??!
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Saimek

Send message
Joined: 25 Jan 00
Posts: 121
Credit: 454,423
RAC: 0
Poland
Message 30641 - Posted: 27 Sep 2004, 20:32:59 UTC
Last modified: 27 Sep 2004, 20:48:26 UTC

scheduler kryten Running on the seti status page updated 3 mins ago.. but i cannot connect for ~30mins? any one got the same problem?

ID: 30641 · Report as offensive
Profile Dizzy
Avatar

Send message
Joined: 6 Mar 04
Posts: 6
Credit: 819,531
RAC: 0
Poland
Message 30647 - Posted: 27 Sep 2004, 20:43:14 UTC - in response to Message 30641.  
Last modified: 27 Sep 2004, 20:48:57 UTC

Me too
cannot connect for ~60 mins
ID: 30647 · Report as offensive
David Taylor

Send message
Joined: 17 May 99
Posts: 16
Credit: 199,598
RAC: 0
United Kingdom
Message 30649 - Posted: 27 Sep 2004, 20:50:40 UTC - in response to Message 30641.  
Last modified: 27 Sep 2004, 22:21:59 UTC

> scheduler kryten Running on the seti status page updated 3 mins ago.. but i
> cannot connect for ~30mins? any one got the same problem?
>
>
>
Me three!

90mins +


Edited 22:21 UTC:- But the validators not. I just received over 500 hundred credits that is around a thousand today! Also got plenty to keep me going.
ID: 30649 · Report as offensive
Profile [HWU] GHz & CO. - BOINC.Italy
Volunteer tester
Avatar

Send message
Joined: 1 Jul 02
Posts: 139
Credit: 1,466,611
RAC: 0
Italy
Message 30654 - Posted: 27 Sep 2004, 20:53:32 UTC
Last modified: 27 Sep 2004, 20:55:10 UTC

Here too...I hope that is not a big problem....:(

SETI@home - 2004-09-27 22:44:44 - Scheduler RPC to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi failed
SETI@home - 2004-09-27 22:44:44 - No schedulers responded

<p>GHz

Hardware Upgrade - Seti@home

ID: 30654 · Report as offensive
Profile Sir Ulli
Volunteer tester
Avatar

Send message
Joined: 21 Oct 99
Posts: 2246
Credit: 6,136,250
RAC: 0
Germany
Message 30655 - Posted: 27 Sep 2004, 20:54:35 UTC

yes schedulers are down, at this time, hope this is only a temporarly prob.

Greetings from Germany NRW
Ulli S@h Berkeley's Staff Friends Club m7 ©
ID: 30655 · Report as offensive
Saimek

Send message
Joined: 25 Jan 00
Posts: 121
Credit: 454,423
RAC: 0
Poland
Message 30672 - Posted: 27 Sep 2004, 22:07:57 UTC - in response to Message 30655.  

> yes schedulers are down, at this time, hope this is only a temporarly prob.

So why it's running on the STATUS page??... it's an imaginary status or what? :D

ID: 30672 · Report as offensive
Profile Sir Ulli
Volunteer tester
Avatar

Send message
Joined: 21 Oct 99
Posts: 2246
Credit: 6,136,250
RAC: 0
Germany
Message 30677 - Posted: 27 Sep 2004, 22:37:09 UTC

i hope this is not a new Prob with the Snap...

Greetings from Germany NRW
Ulli S@h Berkeley's Staff Friends Club m7 ©
ID: 30677 · Report as offensive
Profile Siran d'Vel'nahr
Volunteer tester
Avatar

Send message
Joined: 23 May 99
Posts: 7379
Credit: 44,181,323
RAC: 238
United States
Message 30704 - Posted: 28 Sep 2004, 1:34:26 UTC
Last modified: 28 Sep 2004, 1:38:32 UTC

I've been trying for an hour and 15 minutes. Server status page says scheduler is running. Whaas up wit dat?

L8R....

---

Rick A. - BOINCing right along now.... It can only get better! </p>
ID: 30704 · Report as offensive
Profile Bill Price
Avatar

Send message
Joined: 5 Jun 99
Posts: 73
Credit: 2,397,157
RAC: 5
United States
Message 30719 - Posted: 28 Sep 2004, 3:44:17 UTC - in response to Message 30641.  

I got my last new work unit as follows:

SETI@home - 2004-09-27 03:24:51 - Finished download of 06my04aa.15945.21650.711094.93

The scheduler stopped responding as follows:

SETI@home - 2004-09-27 12:35:00 - No schedulers responded

It's been down ever since (all times Pacific Daylight Time).

Bill
ID: 30719 · Report as offensive
EclipseHA

Send message
Joined: 28 Jul 99
Posts: 1018
Credit: 530,719
RAC: 0
United States
Message 30733 - Posted: 28 Sep 2004, 5:12:30 UTC - in response to Message 30715.  


> Me thinks the snap snapped and dats dat!
> Sorry, Couldn't help myself :)

I think the SNAP is gone.. If you notice the "UL/DL moves", the dirs got moved off the SNAP with the last move (they were on SNAP, but just got moved, based on what I see!)

You'll notice that there's been no mention of SNAP in the news, and none of the sponsors are back on the main web page (they went away when SNAP started having major problems....)

LHC is open to 5000, and soon 50,000, yet I've not seen any of the "hickups" that SETI seems to display!

Predictor was good, CP is good, LHC is good, and with Seti, we have the scheduler down for hours, while the status says is still running!
ID: 30733 · Report as offensive
Heaphus
Volunteer tester

Send message
Joined: 1 Apr 03
Posts: 96
Credit: 4,148,549
RAC: 0
United States
Message 30735 - Posted: 28 Sep 2004, 5:16:32 UTC

The amount of users and I/O of Predictor, CP, and LHC isn't a drop in the bucket to what Seti is doing. At least credit is flowing.
ID: 30735 · Report as offensive
Profile mlcudd
Volunteer tester
Avatar

Send message
Joined: 11 Apr 03
Posts: 782
Credit: 63,647
RAC: 0
United States
Message 30738 - Posted: 28 Sep 2004, 5:24:01 UTC

>>You'll notice that there's been no mention of SNAP in the news, and none of the sponsors are back on the main web page (they went away when SNAP started having major problems....)

All the Sponsors are still listed on the Seti Home Page, including Snap.
Rom stated before on a different thread, that after they got other problems straight that he was sure Dave would get it back on there. It was lost sometime in one of the failures. If they had aboned ship, don't you think they would have removed themselves from all projects.

regards,,
Rocky

ID: 30738 · Report as offensive
Profile mlcudd
Volunteer tester
Avatar

Send message
Joined: 11 Apr 03
Posts: 782
Credit: 63,647
RAC: 0
United States
Message 30739 - Posted: 28 Sep 2004, 5:24:13 UTC
Last modified: 28 Sep 2004, 5:24:49 UTC

ID: 30739 · Report as offensive
EclipseHA

Send message
Joined: 28 Jul 99
Posts: 1018
Credit: 530,719
RAC: 0
United States
Message 30743 - Posted: 28 Sep 2004, 5:27:36 UTC - in response to Message 30735.  

> The amount of users and I/O of Predictor, CP, and LHC isn't a drop in the
> bucket to what Seti is doing. At least credit is flowing.
>
>

LHC wants to go to 50k users this week - it blows SETI away! (seti has only got 20k "active" users at best - not those that have ever returned a result, but those that have done so in the last couple of weeks!)
ID: 30743 · Report as offensive
Profile Rom Walton (BOINC)
Volunteer tester
Avatar

Send message
Joined: 28 Apr 00
Posts: 579
Credit: 130,733
RAC: 0
United States
Message 30745 - Posted: 28 Sep 2004, 5:30:44 UTC - in response to Message 30743.  
Last modified: 28 Sep 2004, 5:31:23 UTC

> LHC wants to go to 50k users this week - it blows SETI away! (seti has only
> got 20k "active" users at best - not those that have ever returned a result,
> but those that have done so in the last couple of weeks!)

According to http://www.setisynergy.com/stats/ SETI has 47k active users and 98k active machines...

----- Rom

BOINC Development Team, U.C. Berkeley

ID: 30745 · Report as offensive
EclipseHA

Send message
Joined: 28 Jul 99
Posts: 1018
Credit: 530,719
RAC: 0
United States
Message 30747 - Posted: 28 Sep 2004, 5:34:26 UTC - in response to Message 30738.  


>
> All the Sponsors are still listed on the Seti Home Page, including Snap.

Check out http://setiweb.ssl.berkeley.edu/ where the sponsors for "seti/Boinc" WERE listed until the SNAP started going toes up!


>Rom stated before on a different thread, that after they got other problems
> straight that he was sure Dave would get it back on there. It was lost
> sometime in one of the failures.

Do you know HTML? This SHOULD be a five minute fix, and hasn't been done in over a month! If I represented a "sponsor", I'd be applying pressure to get it fixed.. That is if I wanted my company associated with this project! :)

> If they had aboned ship, don't you think they
> would have removed themselves from all projects.
>

No, as Classic seems to be in the same state it has been for years.... I can see that they wouldn't want their company associated with this project, after the bad press that SNAP got!

ID: 30747 · Report as offensive
EclipseHA

Send message
Joined: 28 Jul 99
Posts: 1018
Credit: 530,719
RAC: 0
United States
Message 30749 - Posted: 28 Sep 2004, 5:47:50 UTC - in response to Message 30745.  
Last modified: 28 Sep 2004, 5:49:43 UTC

> According to http://www.setisynergy.com/stats/ SETI has 47k active users and
> 98k active machines...


Come on Rom..

(BTW, I like the fact that you don't use Seti/boinc stats for this - you got DB access - but base it on a third party stats site!)

You know as well as I do, that the 47k are users that have one or more credits since 6/22. It doesn't mean that that they've even connected since July!

As far as the 98k machines, how many are "dups" as there was a "detach/attach" for that CPU, and the user didn't "merge"...

That's part of the problem, is that stats like this are "unbounded"

Lets bound it a bit.. How many Seti of the 47k users have gotten a credit in the last 21 days? How many of the 98k mancines have gotten a credit in the same timeframe? (I'll bet there's a boatload of CPU's that never got merged!)

That would be a real measure of "active users and hosts".. Using the count of total DB records is just as much of a guess as I make!

Bound the stats!


ID: 30749 · Report as offensive
Profile Rom Walton (BOINC)
Volunteer tester
Avatar

Send message
Joined: 28 Apr 00
Posts: 579
Credit: 130,733
RAC: 0
United States
Message 30756 - Posted: 28 Sep 2004, 6:23:36 UTC - in response to Message 30749.  

> Come on Rom..
>
> (BTW, I like the fact that you don't use Seti/boinc stats for this - you got
> DB access - but base it on a third party stats site!)

Well pushing a car past the red line is bad for the car... Any query I could run to give me a better idea about what is going on is only going to cause a problem somewhere else in the system.

An example here:
select count(*) from workunit where need_validate=1

Takes roughly 45 mins to execute. Even with indexes and the like. Executing such a query would probably drop quite a few queries during that 45 mins.

More equipment has been ordered, I just don't have an ETA.

> You know as well as I do, that the 47k are users that have one or more credits
> since 6/22. It doesn't mean that that they've even connected since July!
>
> As far as the 98k machines, how many are "dups" as there was a "detach/attach"
> for that CPU, and the user didn't "merge"...
>
> That's part of the problem, is that stats like this are "unbounded"
>
> Lets bound it a bit.. How many Seti of the 47k users have gotten a credit in
> the last 21 days? How many of the 98k mancines have gotten a credit in the
> same timeframe? (I'll bet there's a boatload of CPU's that never got merged!)
>
> That would be a real measure of "active users and hosts".. Using the count of
> total DB records is just as much of a guess as I make!
>
> Bound the stats!

Would be nice, but the only way to get the data at this point comes at a cost that is a little to high right now.

Being a BOINC Dev has its advantages at this point. You can spend as much time as you like arguing which project is better than which, but in the end you are still using BOINC to get the job done.

I and the other development staff will just keep making it better.

----- Rom

BOINC Development Team, U.C. Berkeley

ID: 30756 · Report as offensive
KWSN - Milt Q. Llama III

Send message
Joined: 21 Sep 03
Posts: 2
Credit: 318,561
RAC: 0
Germany
Message 30761 - Posted: 28 Sep 2004, 6:47:43 UTC - in response to Message 30756.  


> An example here:
> select count(*) from workunit where need_validate=1
>

I am curious how long it would take to run the following (given that
you have indexed the column need_validate):

select count(need_validate) from workunit where need_validate=1

This line of code should execute MUCH FASTER.
ID: 30761 · Report as offensive
Profile The worm that turned
Volunteer tester
Avatar

Send message
Joined: 15 May 99
Posts: 100
Credit: 4,872,533
RAC: 0
Australia
Message 30770 - Posted: 28 Sep 2004, 7:29:50 UTC
Last modified: 29 Sep 2004, 12:09:07 UTC

At last something to whinge about again
Yipee the schedulers down
I thought I was going to have to change my name to The happy worm
Every cloud has a silver lining

ID: 30770 · Report as offensive
1 · 2 · Next

Message boards : Number crunching : Scheduler down??!


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.