Panic Mode On (37) Server problems

Message boards : Number crunching : Panic Mode On (37) Server problems
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 7 · 8 · 9 · 10 · 11 · Next

AuthorMessage
Profile perryjay
Volunteer tester
Avatar

Send message
Joined: 20 Aug 02
Posts: 3377
Credit: 20,676,751
RAC: 0
United States
Message 1029970 - Posted: 31 Aug 2010, 14:34:46 UTC - in response to Message 1029967.  

Just noticed I got two Validate errors that tried to validate at the time the system was down. Hope the boys can do something about those too when they come in.


PROUD MEMBER OF Team Starfire World BOINC
ID: 1029970 · Report as offensive
ded1o1

Send message
Joined: 29 Sep 07
Posts: 68
Credit: 10,834,919
RAC: 0
Australia
Message 1029971 - Posted: 31 Aug 2010, 14:36:10 UTC - in response to Message 1029950.  
Last modified: 31 Aug 2010, 14:46:08 UTC

Something wrong with the upload server ? Why is the file system "Read-only" ? What happened there ?


Looks like we are having our regular pre 3 day outage upload outage :p

I'd guess that the file system has filled up again.

PS. Probably a good time to select NAS (Network Activity Suspended) until after the outage.
ID: 1029971 · Report as offensive
Profile Donald L. Johnson
Avatar

Send message
Joined: 5 Aug 02
Posts: 8240
Credit: 14,654,533
RAC: 20
United States
Message 1029973 - Posted: 31 Aug 2010, 14:48:01 UTC - in response to Message 1029970.  

Just noticed I got two Validate errors that tried to validate at the time the system was down. Hope the boys can do something about those too when they come in.

It happened to me, too. On this WU. That's what sent me to the Cricket graphs and the Server Status page.
Donald
Infernal Optimist / Submariner, retired
ID: 1029973 · Report as offensive
Urs Echternacht
Volunteer tester
Avatar

Send message
Joined: 15 May 99
Posts: 692
Credit: 135,197,781
RAC: 211
Germany
Message 1029979 - Posted: 31 Aug 2010, 15:36:19 UTC

Thanks for explaining what happened. It brings (my) panic mode down back to normal operations, awaiting this weeks outage.
_\|/_
U r s
ID: 1029979 · Report as offensive
Dave

Send message
Joined: 29 Mar 02
Posts: 778
Credit: 25,001,396
RAC: 0
United Kingdom
Message 1029995 - Posted: 31 Aug 2010, 16:41:05 UTC

I am NAS + NNT on all machines. Sounds like a good plan to me. Something tells me I will actually get more done now the client isn't wasting time doing all those retries pointlessly.
ID: 1029995 · Report as offensive
Profile perryjay
Volunteer tester
Avatar

Send message
Joined: 20 Aug 02
Posts: 3377
Credit: 20,676,751
RAC: 0
United States
Message 1029997 - Posted: 1 Sep 2010, 17:14:31 UTC

YAY!!! We're back, we're back!!!


PROUD MEMBER OF Team Starfire World BOINC
ID: 1029997 · Report as offensive
Profile James Sotherden
Avatar

Send message
Joined: 16 May 99
Posts: 10436
Credit: 110,373,059
RAC: 54
United States
Message 1030204 - Posted: 2 Sep 2010, 16:35:39 UTC

Well all in all dosent look like to bad of an outage. Just check all my machines and two will run out some time tomorrow morning.

I only set the i7 to NNT as thats the only machine that gets ghosts (So far

I have a question about astropulse. is the completeing the 10 work units for a realistic DCF for one machine or all your hosts?
[/quote]

Old James
ID: 1030204 · Report as offensive
Josef W. Segur
Volunteer developer
Volunteer tester

Send message
Joined: 30 Oct 99
Posts: 4504
Credit: 1,414,761
RAC: 0
United States
Message 1030240 - Posted: 2 Sep 2010, 18:49:25 UTC - in response to Message 1030204.  

...
I have a question about astropulse. is the completeing the 10 work units for a realistic DCF for one machine or all your hosts?

Just one. The whole point of that per-application server-side scaling is to be more detailed than the host's DCF.
                                                                  Joe
ID: 1030240 · Report as offensive
Profile Fred J. Verster
Volunteer tester
Avatar

Send message
Joined: 21 Apr 04
Posts: 3252
Credit: 31,903,643
RAC: 0
Netherlands
Message 1030246 - Posted: 2 Sep 2010, 19:23:09 UTC - in response to Message 1030240.  

Most people run Multiple Project's on multiple host's, so it has to be different, IMO.
Have 3 Quad's and a LT (C2D), all running SETI , SETI-Bêta and several other projects, as Back-Up. And 2 are doing CUDA and CUDA-FERMI and 1 has 2 ATI GPU's.

ID: 1030246 · Report as offensive
Profile [B^S] madmac
Volunteer tester
Avatar

Send message
Joined: 9 Feb 04
Posts: 1175
Credit: 4,754,897
RAC: 0
United Kingdom
Message 1030419 - Posted: 3 Sep 2010, 12:13:36 UTC

Is the schedular up and running as I keep on getting request failed message yet status page says it is, can anyone help ?
ID: 1030419 · Report as offensive
Profile perryjay
Volunteer tester
Avatar

Send message
Joined: 20 Aug 02
Posts: 3377
Credit: 20,676,751
RAC: 0
United States
Message 1030420 - Posted: 3 Sep 2010, 12:49:19 UTC - in response to Message 1030419.  

Is the schedular up and running as I keep on getting request failed message yet status page says it is, can anyone help ?


Even though the scheduler may be up the upload and download servers are not.Hence,the request for work failed because the server isn't getting your request.



PROUD MEMBER OF Team Starfire World BOINC
ID: 1030420 · Report as offensive
Profile Donald L. Johnson
Avatar

Send message
Joined: 5 Aug 02
Posts: 8240
Credit: 14,654,533
RAC: 20
United States
Message 1030430 - Posted: 3 Sep 2010, 14:42:40 UTC - in response to Message 1030420.  
Last modified: 3 Sep 2010, 14:46:01 UTC

Is the schedular up and running as I keep on getting request failed message yet status page says it is, can anyone help ?


Even though the scheduler may be up the upload and download servers are not.Hence,the request for work failed because the server isn't getting your request.

Looks like the Upload server and MB verifiers/assimilators/etc. just came online (0730 PDT/1430 UTC). ALL the Astropulse functions still show Disabled.

(0744 PDT) Oops - Spoke too soon. Upload server Disabled again. Must need a second cup of coffee. (8{)
Donald
Infernal Optimist / Submariner, retired
ID: 1030430 · Report as offensive
Profile perryjay
Volunteer tester
Avatar

Send message
Joined: 20 Aug 02
Posts: 3377
Credit: 20,676,751
RAC: 0
United States
Message 1030432 - Posted: 3 Sep 2010, 14:47:23 UTC - in response to Message 1030430.  
Last modified: 3 Sep 2010, 14:48:17 UTC

Yes the upload server is up but it's going to be a fight to get to it. It's being hit hard right now according to the cricket graph.

You're right, we must have broke it. :0


PROUD MEMBER OF Team Starfire World BOINC
ID: 1030432 · Report as offensive
Profile Donald L. Johnson
Avatar

Send message
Joined: 5 Aug 02
Posts: 8240
Credit: 14,654,533
RAC: 20
United States
Message 1030449 - Posted: 3 Sep 2010, 15:39:39 UTC - in response to Message 1030430.  

Is the schedular up and running as I keep on getting request failed message yet status page says it is, can anyone help ?


Even though the scheduler may be up the upload and download servers are not.Hence,the request for work failed because the server isn't getting your request.

Looks like the Upload server and MB verifiers/assimilators/etc. just came online (0730 PDT/1430 UTC). ALL the Astropulse functions still show Disabled.

(0744 PDT) Oops - Spoke too soon. Upload server Disabled again. Must need a second cup of coffee. (8{)

Having now had MY second cup of coffee, and a bit of breakfast, I wonder if the Upload server was taken back offline because the Astropulse functions are still disabled? Can Astropulse results be reported, validated, etc if the Astropulse Science database is down?

Donald
Infernal Optimist / Submariner, retired
ID: 1030449 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 1030457 - Posted: 3 Sep 2010, 16:06:09 UTC

Since one, two hours the number of validate errors in my hosts overview increase..

ID: 1030457 · Report as offensive
Josef W. Segur
Volunteer developer
Volunteer tester

Send message
Joined: 30 Oct 99
Posts: 4504
Credit: 1,414,761
RAC: 0
United States
Message 1030468 - Posted: 3 Sep 2010, 16:24:40 UTC - in response to Message 1030449.  

...
Can Astropulse results be reported, validated, etc if the Astropulse Science database is down?

Yes, they can be reported and validated, but not assimilated (hence the files cannot be deleted).
                                                                  Joe
ID: 1030468 · Report as offensive
Bernd Noessler

Send message
Joined: 15 Nov 09
Posts: 99
Credit: 52,635,434
RAC: 0
Germany
Message 1030470 - Posted: 3 Sep 2010, 16:26:56 UTC - in response to Message 1030457.  

@Sutaru
Same here.
And the pendings are gowing down. The tasks have been reported in August/July
and are now getting invalid.
ID: 1030470 · Report as offensive
Profile hiamps
Volunteer tester
Avatar

Send message
Joined: 23 May 99
Posts: 4292
Credit: 72,971,319
RAC: 0
United States
Message 1030473 - Posted: 3 Sep 2010, 16:35:53 UTC

Glad I kept my cache higher than 3 days...
Official Abuser of Boinc Buttons...
And no good credit hound!
ID: 1030473 · Report as offensive
Profile perryjay
Volunteer tester
Avatar

Send message
Joined: 20 Aug 02
Posts: 3377
Credit: 20,676,751
RAC: 0
United States
Message 1030475 - Posted: 3 Sep 2010, 16:46:01 UTC - in response to Message 1030457.  

Since one, two hours the number of validate errors in my hosts overview increase..


Looks like I only picked up one more Validate error. I had two from the 31st that I picked up as they were trying to get things shut down. Guess they weren't ready when the switch got flipped this morning and a few got through before they could get it shut down again. Hopefully they will be able to correct them when they get going again.



PROUD MEMBER OF Team Starfire World BOINC
ID: 1030475 · Report as offensive
Profile hiamps
Volunteer tester
Avatar

Send message
Joined: 23 May 99
Posts: 4292
Credit: 72,971,319
RAC: 0
United States
Message 1030477 - Posted: 3 Sep 2010, 16:49:24 UTC

I hope Friday morning problems isn't going to be the norm...Seems last week or the week before they had problems restarting....
Official Abuser of Boinc Buttons...
And no good credit hound!
ID: 1030477 · Report as offensive
Previous · 1 . . . 7 · 8 · 9 · 10 · 11 · Next

Message boards : Number crunching : Panic Mode On (37) Server problems


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.