Panic Mode On (13) Server problems

Message boards : Number crunching : Panic Mode On (13) Server problems
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 8 · 9 · 10 · 11 · 12 · 13 · Next

AuthorMessage
archae86

Send message
Joined: 31 Aug 99
Posts: 909
Credit: 1,582,816
RAC: 0
United States
Message 869462 - Posted: 25 Feb 2009, 22:23:20 UTC

After nine hours of almost steady decline from peak on the network graph, it pegged somewhat over an hour ago, and has stayed there.

More unpleasantness?
ID: 869462 · Report as offensive
Profile Dirk Sadowski
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 869478 - Posted: 25 Feb 2009, 23:19:33 UTC


I don't think that everything is working well again..

After every work request I get only 1 new WU..

I hope before the next weekend the WU cache is full again.. ;-)

ID: 869478 · Report as offensive
Profile Bernie Vine
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 26 May 99
Posts: 9960
Credit: 103,452,613
RAC: 328
United Kingdom
Message 869513 - Posted: 26 Feb 2009, 0:47:09 UTC - in response to Message 869413.  

Does anyone know why the neither AP_validate is running? They both say "Not Running" Expanation is "Program failed or ran out of work". I don't think they ran out of work.

They're both back online now ;-)



Have a look at what Matt says here.

Bernie
ID: 869513 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 14020
Credit: 208,696,464
RAC: 304
Australia
Message 869968 - Posted: 27 Feb 2009, 7:48:26 UTC - in response to Message 869513.  


Calm before the storm?
Work in progress is pretty much back to normal, however AP Turn Around time is still about 50hrs less than it was at the begining of the month. MB results received in the Last Hour are down, and the Turn Around time is about 20hours higher than it was at the begining of the month.
And the network traffic is now down to 40Mb/s, where as it was generally around 60Mb/s before the Great Storm.

Problems ahead or has the recent spell of longer work settled things down?
Grant
Darwin NT
ID: 869968 · Report as offensive
Hans Kramer
Volunteer tester

Send message
Joined: 16 May 99
Posts: 61
Credit: 8,770,184
RAC: 0
Netherlands
Message 869971 - Posted: 27 Feb 2009, 8:00:06 UTC - in response to Message 869968.  

I can only speculate but,

After the storm, people got their cues filled up, the turnaround time started counting then, so on average the first results should be coming back after that time.

People with other projects as fall back got more WU's for them and are crunching those other projects for now but will get back to S@H when those are done.

My bet is that in a few days time things will get back a "normal" level.
ID: 869971 · Report as offensive
cavemanmoron

Send message
Joined: 20 Dec 00
Posts: 15
Credit: 90,680
RAC: 0
United States
Message 870203 - Posted: 27 Feb 2009, 23:46:15 UTC - in response to Message 869971.  
Last modified: 27 Feb 2009, 23:46:55 UTC

I can only speculate but,

After the storm, people got their cues filled up, the turnaround time started counting then, so on average the first results should be coming back after that time.

People with other projects as fall back got more WU's for them and are crunching those other projects for now but will get back to S@H when those are done.

My bet is that in a few days time things will get back a "normal" level.


Perhaps, the computers that were switched to other stuff,
or turned off, will not be back full duty, to crunching for Seti. ;)
ID: 870203 · Report as offensive
Profile Gary Charpentier Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 25 Dec 00
Posts: 31649
Credit: 53,134,872
RAC: 32
United States
Message 871431 - Posted: 2 Mar 2009, 20:31:22 UTC

2/28/2009 2:50:36 PM|SETI@home|Sending scheduler request: To fetch work. Requesting 53139 seconds of work, reporting 3 completed tasks
2/28/2009 2:50:41 PM|SETI@home|Scheduler request succeeded: got 1 new tasks
2/28/2009 2:50:41 PM|SETI@home|Message from server: No work can be sent for the applications you have selected
2/28/2009 2:50:41 PM|SETI@home|Message from server: No work is available for SETI@home Enhanced
2/28/2009 2:50:41 PM|SETI@home|Message from server: No work is available for Astropulse

So it sent me a WU and in the same second said it didn't have work? Maybe.
Then it says all in the same second:

2/28/2009 2:50:41 PM|SETI@home|Message from server: You have selected to receive work from other applications if no work is available for the applications you selected
2/28/2009 2:50:41 PM|SETI@home|Message from server: Sending work from other applications

Maybe the messages should be made a bit more human readable.

ID: 871431 · Report as offensive
archae86

Send message
Joined: 31 Aug 99
Posts: 909
Credit: 1,582,816
RAC: 0
United States
Message 871437 - Posted: 2 Mar 2009, 20:49:48 UTC - in response to Message 871431.  

2/28/2009 2:50:36 PM|SETI@home|Sending scheduler request: To fetch work. Requesting 53139 seconds of work, reporting 3 completed tasks
2/28/2009 2:50:41 PM|SETI@home|Scheduler request succeeded: got 1 new tasks
2/28/2009 2:50:41 PM|SETI@home|Message from server: No work can be sent for the applications you have selected
2/28/2009 2:50:41 PM|SETI@home|Message from server: No work is available for SETI@home Enhanced
2/28/2009 2:50:41 PM|SETI@home|Message from server: No work is available for Astropulse

So it sent me a WU and in the same second said it didn't have work? Maybe.
Then it says all in the same second:

2/28/2009 2:50:41 PM|SETI@home|Message from server: You have selected to receive work from other applications if no work is available for the applications you selected
2/28/2009 2:50:41 PM|SETI@home|Message from server: Sending work from other applications

Maybe the messages should be made a bit more human readable.

How? It told you that it had no work for the two applications you have said on the preferences that you prefer, and it tells you it has sent new work, which was from a third application.

The point of confusion, I think, is that folks don't recognize that Astropulse and Astropulse v5 are two separate applications. But that is not a question of the human readability of the messages.

ID: 871437 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14690
Credit: 200,643,578
RAC: 874
United Kingdom
Message 871461 - Posted: 2 Mar 2009, 22:04:13 UTC - in response to Message 871437.  

2/28/2009 2:50:36 PM|SETI@home|Sending scheduler request: To fetch work. Requesting 53139 seconds of work, reporting 3 completed tasks
2/28/2009 2:50:41 PM|SETI@home|Scheduler request succeeded: got 1 new tasks
2/28/2009 2:50:41 PM|SETI@home|Message from server: No work can be sent for the applications you have selected
2/28/2009 2:50:41 PM|SETI@home|Message from server: No work is available for SETI@home Enhanced
2/28/2009 2:50:41 PM|SETI@home|Message from server: No work is available for Astropulse

So it sent me a WU and in the same second said it didn't have work? Maybe.
Then it says all in the same second:

2/28/2009 2:50:41 PM|SETI@home|Message from server: You have selected to receive work from other applications if no work is available for the applications you selected
2/28/2009 2:50:41 PM|SETI@home|Message from server: Sending work from other applications

Maybe the messages should be made a bit more human readable.

How? It told you that it had no work for the two applications you have said on the preferences that you prefer, and it tells you it has sent new work, which was from a third application.

The point of confusion, I think, is that folks don't recognize that Astropulse and Astropulse v5 are two separate applications. But that is not a question of the human readability of the messages.

Well, if folks read it and folks don't understand it, that meets my definition of human readability (failed). The onus is on the communicator to transfer the information to the communicatee.

@ Gary,

Have a revisit with your SETI@home preferences page. You may not be aware that there's a third 'opt-in' option for the new "Astropulse v5" application that archae86 mentions, and it may not be set the way you want.
ID: 871461 · Report as offensive
HarryM
Volunteer tester

Send message
Joined: 24 Jul 08
Posts: 68
Credit: 3,812,695
RAC: 0
United States
Message 872606 - Posted: 5 Mar 2009, 20:01:09 UTC

The "ap_validate2" indicates "Not Running". AP "waiting for validation" is growing.
Seems to be cropping up occasionally lately. Has been this way for a while.
ID: 872606 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14690
Credit: 200,643,578
RAC: 874
United Kingdom
Message 872615 - Posted: 5 Mar 2009, 20:31:05 UTC - in response to Message 872606.  

The "ap_validate2" indicates "Not Running". AP "waiting for validation" is growing.
Seems to be cropping up occasionally lately. Has been this way for a while.

Yes - Matt wrote about it in Mono (Feb 25 2009).
ID: 872615 · Report as offensive
HarryM
Volunteer tester

Send message
Joined: 24 Jul 08
Posts: 68
Credit: 3,812,695
RAC: 0
United States
Message 872620 - Posted: 5 Mar 2009, 21:03:21 UTC - in response to Message 872615.  

Yes he wrote about it, but if it was working correctly the waiting for validation shouldn't be increasing.
ID: 872620 · Report as offensive
Profile arkayn
Volunteer tester
Avatar

Send message
Joined: 14 May 99
Posts: 4438
Credit: 55,006,323
RAC: 0
United States
Message 872703 - Posted: 6 Mar 2009, 1:29:01 UTC - in response to Message 872620.  

Matt wrote about it again today in the technical forum

Once again not much hardware/server stuff to report. I guess the ap_validator "2" is failing due to seg faults. A fact that is obscured on the server status page (due to automatic parsing of configuration files) is that the ap_validator "2" does strictly astropulse_v5 workunits, while ap_validator "1" validates older astropulse workunits. In any case, I warned Josh, he's looking into it, etc. Probably a broken result file/database entry is causing it to seg fault and quit before doing very much.


ID: 872703 · Report as offensive
Profile Misfit
Volunteer tester
Avatar

Send message
Joined: 21 Jun 01
Posts: 21804
Credit: 2,815,091
RAC: 0
United States
Message 873072 - Posted: 6 Mar 2009, 22:41:47 UTC

3/6/2009 2:39:59 PM|SETI@home|Sending scheduler request: To fetch work. Requesting 106296 seconds of work, reporting 0 completed tasks
3/6/2009 2:40:04 PM|SETI@home|Scheduler request completed: got 0 new tasks
3/6/2009 2:40:04 PM|SETI@home|Message from server: No work sent
3/6/2009 2:40:04 PM|SETI@home|Message from server: No work is available for SETI@home Enhanced
3/6/2009 2:40:04 PM|SETI@home|Message from server: No work available for the applications you have selected. Please check your settings on the web site.

Now I can panic.
me@rescam.org
ID: 873072 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14690
Credit: 200,643,578
RAC: 874
United Kingdom
Message 873075 - Posted: 6 Mar 2009, 22:49:43 UTC - in response to Message 873072.  

3/6/2009 2:39:59 PM|SETI@home|Sending scheduler request: To fetch work. Requesting 106296 seconds of work, reporting 0 completed tasks
3/6/2009 2:40:04 PM|SETI@home|Scheduler request completed: got 0 new tasks
3/6/2009 2:40:04 PM|SETI@home|Message from server: No work sent
3/6/2009 2:40:04 PM|SETI@home|Message from server: No work is available for SETI@home Enhanced
3/6/2009 2:40:04 PM|SETI@home|Message from server: No work available for the applications you have selected. Please check your settings on the web site.

Now I can panic.

Nah, that's normal. There'll be more work along in a second.

But "Current result creation rate NULL/sec NULL/sec"? THAT I can panic about.
ID: 873075 · Report as offensive
Profile Misfit
Volunteer tester
Avatar

Send message
Joined: 21 Jun 01
Posts: 21804
Credit: 2,815,091
RAC: 0
United States
Message 873092 - Posted: 6 Mar 2009, 23:16:48 UTC - in response to Message 873075.  

3/6/2009 2:39:59 PM|SETI@home|Sending scheduler request: To fetch work. Requesting 106296 seconds of work, reporting 0 completed tasks
3/6/2009 2:40:04 PM|SETI@home|Scheduler request completed: got 0 new tasks
3/6/2009 2:40:04 PM|SETI@home|Message from server: No work sent
3/6/2009 2:40:04 PM|SETI@home|Message from server: No work is available for SETI@home Enhanced
3/6/2009 2:40:04 PM|SETI@home|Message from server: No work available for the applications you have selected. Please check your settings on the web site.

Now I can panic.

Nah, that's normal. There'll be more work along in a second.

But "Current result creation rate NULL/sec NULL/sec"? THAT I can panic about.

There was plenty of work to be found, but it was all at BETA.
me@rescam.org
ID: 873092 · Report as offensive
Profile [KWSN]John Galt 007
Volunteer tester
Avatar

Send message
Joined: 9 Nov 99
Posts: 2444
Credit: 25,086,197
RAC: 0
United States
Message 873097 - Posted: 6 Mar 2009, 23:40:16 UTC

No DB dump today??
Clk2HlpSetiCty:::PayIt4ward

ID: 873097 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14690
Credit: 200,643,578
RAC: 874
United Kingdom
Message 873312 - Posted: 7 Mar 2009, 10:20:39 UTC - in response to Message 873097.  

No DB dump today??

Quick action by the Berkeley Boyz! There was a dump about 25 minutes after your post.
ID: 873312 · Report as offensive
Profile Dirk Sadowski
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 873315 - Posted: 7 Mar 2009, 10:56:30 UTC




James.. same procedure as every year.. *

Ops.. no.. this was an other story.. ;-D



[* Dinner for One [german] / Dinner for One [US/UK]]**

[** For the young under us.. ;-D]

ID: 873315 · Report as offensive
Profile James Sotherden
Avatar

Send message
Joined: 16 May 99
Posts: 10436
Credit: 110,373,059
RAC: 54
United States
Message 873543 - Posted: 7 Mar 2009, 22:49:56 UTC

I see the AP2 validator is up and running. I hope it stay's running for another 57 hours so i can send in my AP 5.3. Or I might panic then.
[/quote]

Old James
ID: 873543 · Report as offensive
Previous · 1 . . . 8 · 9 · 10 · 11 · 12 · 13 · Next

Message boards : Number crunching : Panic Mode On (13) Server problems


 
©2026 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.