Panic Mode On (13) Server problems

Message boards : Number crunching : Panic Mode On (13) Server problems
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 9 · 10 · 11 · 12 · 13 · 14 · Next

AuthorMessage
Profile arkayn
Volunteer tester
Avatar

Send message
Joined: 14 May 99
Posts: 4438
Credit: 55,006,323
RAC: 0
United States
Message 872703 - Posted: 6 Mar 2009, 1:29:01 UTC - in response to Message 872620.  

Matt wrote about it again today in the technical forum

Once again not much hardware/server stuff to report. I guess the ap_validator "2" is failing due to seg faults. A fact that is obscured on the server status page (due to automatic parsing of configuration files) is that the ap_validator "2" does strictly astropulse_v5 workunits, while ap_validator "1" validates older astropulse workunits. In any case, I warned Josh, he's looking into it, etc. Probably a broken result file/database entry is causing it to seg fault and quit before doing very much.


ID: 872703 · Report as offensive
Profile Misfit
Volunteer tester
Avatar

Send message
Joined: 21 Jun 01
Posts: 21804
Credit: 2,815,091
RAC: 0
United States
Message 873072 - Posted: 6 Mar 2009, 22:41:47 UTC

3/6/2009 2:39:59 PM|SETI@home|Sending scheduler request: To fetch work. Requesting 106296 seconds of work, reporting 0 completed tasks
3/6/2009 2:40:04 PM|SETI@home|Scheduler request completed: got 0 new tasks
3/6/2009 2:40:04 PM|SETI@home|Message from server: No work sent
3/6/2009 2:40:04 PM|SETI@home|Message from server: No work is available for SETI@home Enhanced
3/6/2009 2:40:04 PM|SETI@home|Message from server: No work available for the applications you have selected. Please check your settings on the web site.

Now I can panic.
me@rescam.org
ID: 873072 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 873075 - Posted: 6 Mar 2009, 22:49:43 UTC - in response to Message 873072.  

3/6/2009 2:39:59 PM|SETI@home|Sending scheduler request: To fetch work. Requesting 106296 seconds of work, reporting 0 completed tasks
3/6/2009 2:40:04 PM|SETI@home|Scheduler request completed: got 0 new tasks
3/6/2009 2:40:04 PM|SETI@home|Message from server: No work sent
3/6/2009 2:40:04 PM|SETI@home|Message from server: No work is available for SETI@home Enhanced
3/6/2009 2:40:04 PM|SETI@home|Message from server: No work available for the applications you have selected. Please check your settings on the web site.

Now I can panic.

Nah, that's normal. There'll be more work along in a second.

But "Current result creation rate NULL/sec NULL/sec"? THAT I can panic about.
ID: 873075 · Report as offensive
Profile Misfit
Volunteer tester
Avatar

Send message
Joined: 21 Jun 01
Posts: 21804
Credit: 2,815,091
RAC: 0
United States
Message 873092 - Posted: 6 Mar 2009, 23:16:48 UTC - in response to Message 873075.  

3/6/2009 2:39:59 PM|SETI@home|Sending scheduler request: To fetch work. Requesting 106296 seconds of work, reporting 0 completed tasks
3/6/2009 2:40:04 PM|SETI@home|Scheduler request completed: got 0 new tasks
3/6/2009 2:40:04 PM|SETI@home|Message from server: No work sent
3/6/2009 2:40:04 PM|SETI@home|Message from server: No work is available for SETI@home Enhanced
3/6/2009 2:40:04 PM|SETI@home|Message from server: No work available for the applications you have selected. Please check your settings on the web site.

Now I can panic.

Nah, that's normal. There'll be more work along in a second.

But "Current result creation rate NULL/sec NULL/sec"? THAT I can panic about.

There was plenty of work to be found, but it was all at BETA.
me@rescam.org
ID: 873092 · Report as offensive
Profile [KWSN]John Galt 007
Volunteer tester
Avatar

Send message
Joined: 9 Nov 99
Posts: 2444
Credit: 25,086,197
RAC: 0
United States
Message 873097 - Posted: 6 Mar 2009, 23:40:16 UTC

No DB dump today??
Clk2HlpSetiCty:::PayIt4ward

ID: 873097 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 873312 - Posted: 7 Mar 2009, 10:20:39 UTC - in response to Message 873097.  

No DB dump today??

Quick action by the Berkeley Boyz! There was a dump about 25 minutes after your post.
ID: 873312 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 873315 - Posted: 7 Mar 2009, 10:56:30 UTC




James.. same procedure as every year.. *

Ops.. no.. this was an other story.. ;-D



[* Dinner for One [german] / Dinner for One [US/UK]]**

[** For the young under us.. ;-D]

ID: 873315 · Report as offensive
Profile James Sotherden
Avatar

Send message
Joined: 16 May 99
Posts: 10436
Credit: 110,373,059
RAC: 54
United States
Message 873543 - Posted: 7 Mar 2009, 22:49:56 UTC

I see the AP2 validator is up and running. I hope it stay's running for another 57 hours so i can send in my AP 5.3. Or I might panic then.
[/quote]

Old James
ID: 873543 · Report as offensive
geoff

Send message
Joined: 25 Apr 00
Posts: 123
Credit: 34,100,351
RAC: 18
United Kingdom
Message 873550 - Posted: 7 Mar 2009, 23:15:54 UTC

I have just received credit for 12 pending v5 AP wus that's about 14,000 credit but waiting on wingmen for another 40+ v5 AP, I wonder what credit Mark has got?
ID: 873550 · Report as offensive
Profile perryjay
Volunteer tester
Avatar

Send message
Joined: 20 Aug 02
Posts: 3377
Credit: 20,676,751
RAC: 0
United States
Message 873561 - Posted: 7 Mar 2009, 23:49:53 UTC - in response to Message 873550.  

I've only got three in pending and none of them validated but at least one of them got looked at so they could send it out again. My wingman screwed it up and I've been waiting 3 or 4 days for it to get reissued.


PROUD MEMBER OF Team Starfire World BOINC
ID: 873561 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 873562 - Posted: 7 Mar 2009, 23:53:42 UTC

I've got:

10 pending
3 granted
5 cleared and gone
19 in-progress/queued

That's my ap_v5 experience so far.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 873562 · Report as offensive
HarryM
Volunteer tester

Send message
Joined: 24 Jul 08
Posts: 68
Credit: 3,812,695
RAC: 0
United States
Message 874335 - Posted: 10 Mar 2009, 15:27:45 UTC

I see "ap_validate2" is not running again and "waiting for validation" is increasing. Hopefully will be looked at on today's downtime. Had been working for a while recently.
ID: 874335 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 874398 - Posted: 10 Mar 2009, 20:40:04 UTC - in response to Message 874335.  

I see "ap_validate2" is not running again and "waiting for validation" is increasing. Hopefully will be looked at on today's downtime. Had been working for a while recently.

Then it appears the AP team still hasn't found/fixed the problem. Matt mentioned last week that the script that builds the server status page goes through every 10 minutes and restarts all the processes that have issues. ap_validate2 is specifically for ap_v5, and it gets a few jobs done, and then segfaults, and doesn't get "kicked" until the script runs through again. It is doing something, just not as much as it could be doing.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 874398 · Report as offensive
Profile ccappel
Avatar

Send message
Joined: 27 Jan 00
Posts: 362
Credit: 1,516,412
RAC: 0
United States
Message 874399 - Posted: 10 Mar 2009, 20:49:58 UTC - in response to Message 873562.  

I've got:

10 pending
3 granted
5 cleared and gone
19 in-progress/queued

That's my ap_v5 experience so far.

Mine is:

12 pending
6 cleared and gone
4 in progress/cached
ID: 874399 · Report as offensive
Profile Virtual Boss*
Volunteer tester
Avatar

Send message
Joined: 4 May 08
Posts: 417
Credit: 6,440,287
RAC: 0
Australia
Message 874569 - Posted: 11 Mar 2009, 9:16:16 UTC - in response to Message 874399.  

Since 28 Feb

2 cleared and gone
7 pending
1 crunching
2 waiting
ID: 874569 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 874570 - Posted: 11 Mar 2009, 9:32:32 UTC

Look how many recordings they've loaded on the splitters (Server status page)!

Are the crew all planning to go on vacation together, or something?
ID: 874570 · Report as offensive
Profile Virtual Boss*
Volunteer tester
Avatar

Send message
Joined: 4 May 08
Posts: 417
Credit: 6,440,287
RAC: 0
Australia
Message 874573 - Posted: 11 Mar 2009, 10:30:06 UTC - in response to Message 874570.  
Last modified: 11 Mar 2009, 10:30:33 UTC

Are the crew all planning to go on vacation together, or something?


I suspect they just want more uninterrupted time to try to eliminate server issues (or eliminate those ants ... LOL).
ID: 874573 · Report as offensive
OzzFan Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Apr 02
Posts: 15691
Credit: 84,761,841
RAC: 28
United States
Message 874601 - Posted: 11 Mar 2009, 13:23:33 UTC - in response to Message 874573.  

Are the crew all planning to go on vacation together, or something?


I suspect they just want more uninterrupted time to try to eliminate server issues (or eliminate those ants ... LOL).


I think they're trying to refocus all efforts into getting the Nitpicker running in time for their 10th anniversary. Matt has been hinting at this a couple of times in his Tech News already. Next month is the big deadline, and I don't know how much work they have ahead of them.
ID: 874601 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 874623 - Posted: 11 Mar 2009, 14:23:52 UTC

I was going to make some comment last week about there being 50+ "tapes" online. Must have gotten all that new storage working properly. I still think we need more MB splitters though. When I change my venue to non-ap_v5, I keep getting no work available or just one MB at a time. I'm pretty sure I'm not the only one.

Still wondering why all I get is ap_v5 even though all are selected. Seems like the scheduler doesn't want me to have anything else..
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 874623 · Report as offensive
Rudy
Volunteer tester

Send message
Joined: 23 Jun 99
Posts: 189
Credit: 794,998
RAC: 0
Canada
Message 874629 - Posted: 11 Mar 2009, 14:35:34 UTC - in response to Message 874623.  
Last modified: 11 Mar 2009, 14:38:30 UTC

Still wondering why all I get is ap_v5 even though all are selected. Seems like the scheduler doesn't want me to have anything else..


Do re-issued tasks get priority when work is being sent out?

It could be that the thousands of APv5's getting quota trashed every night are clogging the pipes.

/edit, Not sure more splitters are needed since the ready to send is usually full. Perhaps more feeder cache space.
ID: 874629 · Report as offensive
Previous · 1 . . . 9 · 10 · 11 · 12 · 13 · 14 · Next

Message boards : Number crunching : Panic Mode On (13) Server problems


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.