Panic Mode On (70) Server problems?

Message boards : Number crunching : Panic Mode On (70) Server problems?
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 . . . 9 · Next

AuthorMessage
Highlander
Avatar

Send message
Joined: 5 Oct 99
Posts: 167
Credit: 37,987,668
RAC: 16
Germany
Message 1202897 - Posted: 6 Mar 2012, 8:15:47 UTC

now after midnight berkeley time, i get also "internal server error" for the first time, looks like some automatic scripts/processes had been re/started.
- Performance is not a simple linear function of the number of CPUs you throw at the problem. -
ID: 1202897 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34744
Credit: 261,360,520
RAC: 489
Australia
Message 1202898 - Posted: 6 Mar 2012, 8:18:07 UTC - in response to Message 1202895.  

Nothing much anyone can do with the problem except wait until tomorrow's outage by which time the guy's in the lab should have it sorted out. ;)

Cheers.
ID: 1202898 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1202899 - Posted: 6 Mar 2012, 8:19:23 UTC
Last modified: 6 Mar 2012, 8:20:34 UTC

Well, I just hope the limited caches on my best rigs will keep them going until after tomorrow's outage, when hopefully they shall fix whatever malaise has overtaken the servers.
Top rig has some 700 completed WUs that the kitties would like to phone home.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1202899 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34744
Credit: 261,360,520
RAC: 489
Australia
Message 1202900 - Posted: 6 Mar 2012, 8:26:34 UTC - in response to Message 1202899.  

At least with my little 9800GT's/GTX+ I'm good for a another couple of days.

Cheers.
ID: 1202900 · Report as offensive
musicplayer

Send message
Joined: 17 May 10
Posts: 2430
Credit: 926,046
RAC: 0
Message 1202903 - Posted: 6 Mar 2012, 9:04:13 UTC

I am wondering what it is I am trying to report all the time.
ID: 1202903 · Report as offensive
Profile cliff
Avatar

Send message
Joined: 16 Dec 07
Posts: 625
Credit: 3,590,440
RAC: 0
United Kingdom
Message 1202904 - Posted: 6 Mar 2012, 9:11:25 UTC - in response to Message 1202688.  

Hi Richard,
I have that problem as well. It follows a 36hr+ service outage in my area:-(
I've just managed to get back online, and s@h returns that whenever it tries to poll the server.
But I did'nt know if this was related to the outage at my end or something at seti's end..
Nice to know it isnt at my end.

Still I cant report tasks completed or get new ones:-/
Any idea whats up and when it might get sorted?

Regards,
Cliff,
Been there, Done that, Still no damm T shirt!
ID: 1202904 · Report as offensive
Profile cliff
Avatar

Send message
Joined: 16 Dec 07
Posts: 625
Credit: 3,590,440
RAC: 0
United Kingdom
Message 1202905 - Posted: 6 Mar 2012, 9:13:15 UTC - in response to Message 1202903.  


Hi,
Boinc is 'probably' trying to report how many tasks have been completed and ask for new ones..

At least on my end thats what its trying to report.. completed tasks.

Regards,
Cliff,
Been there, Done that, Still no damm T shirt!
ID: 1202905 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 1202906 - Posted: 6 Mar 2012, 9:26:05 UTC

cc_config.xml file with <max_tasks_reported> made the trick. Sweet spot is 28 on my machine.


- Best regards! - Sutaru Tsureku, team seti.international founder. - Optimize your PC for higher RAC. - SETI@home needs your help. -
ID: 1202906 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1202909 - Posted: 6 Mar 2012, 9:36:12 UTC

OK, to sum up my experience so far.

I had a number of tasks to report (~30 per machine) when we came back from the electrical testing outage, around 20:30 UTC yesterday. I could upload them OK, but got 'internal server error' when I tried to report them.

Slower machines, with fewer tasks to report, seemed OK. After some experimentation, it seems that setting a maximum number of tasks to report at a time - I chose 10 - overcame the server error. You can do that with BOINC v6.12 and above, but not with v6.10 or below.

Once I got all the tasks to report, I was able to get new work as well - but I'm only asking for MB CUDA at the moment, can't speak for any other work.

But the last time I was allocated any work was 01:16 UTC this morning. Every request since then has just got a simple 'project has no tasks available' - which doesn't match the Server Status Page.

And there the story rests. No doubt we'll work out more of the picture as the day wears on, so we can present the staff with a full diagnosis when they report for maintenance duties this afternoon ;-)
ID: 1202909 · Report as offensive
LadyL
Volunteer tester
Avatar

Send message
Joined: 14 Sep 11
Posts: 1679
Credit: 5,230,097
RAC: 0
Message 1202912 - Posted: 6 Mar 2012, 9:54:51 UTC

May I point out that the 284k ready to send value is 17h old?
Result creation rate (being updated) is virtually zero.

Servers etc.may be showing green but something is clearly not working.

I doubt anything will move before maintenance.
I'm not the Pope. I don't speak Ex Cathedra!
ID: 1202912 · Report as offensive
Profile shizaru
Volunteer tester
Avatar

Send message
Joined: 14 Jun 04
Posts: 1130
Credit: 1,967,904
RAC: 0
Greece
Message 1202914 - Posted: 6 Mar 2012, 10:02:40 UTC - in response to Message 1202909.  
Last modified: 6 Mar 2012, 10:05:00 UTC


But the last time I was allocated any work was 01:16 UTC this morning. Every request since then has just got a simple 'project has no tasks available'

- which doesn't match the Server Status Page.


Same here. Last download @01:09 UTC. Only diff is I've been asking for CPU work too (and not getting any)... Are the stats on the Server Status Page 17hrs old? If so, maybe the servers have run dry and we just can't see it.

Edit: Beaten by Lady Luck
ID: 1202914 · Report as offensive
LadyL
Volunteer tester
Avatar

Send message
Joined: 14 Sep 11
Posts: 1679
Credit: 5,230,097
RAC: 0
Message 1202916 - Posted: 6 Mar 2012, 10:05:20 UTC - in response to Message 1202914.  


But the last time I was allocated any work was 01:16 UTC this morning. Every request since then has just got a simple 'project has no tasks available'

- which doesn't match the Server Status Page.


Same here. Last download @01:09 UTC. Only diff is I've been asking for CPU work too (and not getting any)... Are the stats on the Server Status Page 17hrs old? If so, maybe the servers have run dry and we just can't see it.


Most likely - resends are being issued so that bit (feeder/scheduler) isn't dead.
I'm not the Pope. I don't speak Ex Cathedra!
ID: 1202916 · Report as offensive
AndrewM
Volunteer tester

Send message
Joined: 5 Jan 08
Posts: 369
Credit: 34,275,196
RAC: 0
Australia
Message 1202929 - Posted: 6 Mar 2012, 10:41:45 UTC

Is this the first time they've used Synergy for Scheduling?
And I'm out of GPU work.
AndrewM
ID: 1202929 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1202930 - Posted: 6 Mar 2012, 10:44:01 UTC - in response to Message 1202916.  


But the last time I was allocated any work was 01:16 UTC this morning. Every request since then has just got a simple 'project has no tasks available'

- which doesn't match the Server Status Page.


Same here. Last download @01:09 UTC. Only diff is I've been asking for CPU work too (and not getting any)... Are the stats on the Server Status Page 17hrs old? If so, maybe the servers have run dry and we just can't see it.


Most likely - resends are being issued so that bit (feeder/scheduler) isn't dead.

Seti Beta is still alive too, just reported 35 tasks there no problem, but can't do 79 tasks here,

Claggy
ID: 1202930 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1202933 - Posted: 6 Mar 2012, 11:02:23 UTC - in response to Message 1202929.  

Is this the first time they've used Synergy for Scheduling?
And I'm out of GPU work.

Good spot. I noticed a flicker - a change in that column when I refreshed the page - last night, but I can't remember which host was doing scheduling before. The WayBack Machine says Bane was doing it on 15 July 2011, but that probably doesn't help much...
ID: 1202933 · Report as offensive
Profile shizaru
Volunteer tester
Avatar

Send message
Joined: 14 Jun 04
Posts: 1130
Credit: 1,967,904
RAC: 0
Greece
Message 1202936 - Posted: 6 Mar 2012, 11:31:15 UTC - in response to Message 1202933.  

Is this the first time they've used Synergy for Scheduling?
And I'm out of GPU work.

Good spot. I noticed a flicker - a change in that column when I refreshed the page - last night, but I can't remember which host was doing scheduling before. The WayBack Machine says Bane was doing it on 15 July 2011, but that probably doesn't help much...


Bane [As of 4 Mar 2012 | 1:00:05 UTC]

Link won't be good for long...
SSP Bing cached
ID: 1202936 · Report as offensive
ChrisSibbald

Send message
Joined: 23 Jul 11
Posts: 18
Credit: 23,582,502
RAC: 0
Canada
Message 1202941 - Posted: 6 Mar 2012, 12:13:05 UTC - in response to Message 1202804.  

I was still having the issue (Internal server error) this morning on two of my 6 machines. Reporting in batches of 10 (via cc_config.xml parameter) permitted me to report all tasks. Thanks for the tip. Cheers, Chris
ID: 1202941 · Report as offensive
Sakletare
Avatar

Send message
Joined: 18 May 99
Posts: 132
Credit: 23,423,829
RAC: 0
Sweden
Message 1202947 - Posted: 6 Mar 2012, 12:46:18 UTC - in response to Message 1202929.  

Is this the first time they've used Synergy for Scheduling?

Perhaps they are already making a hole for the new servers by taking some of the slower servers out of the mix.
ID: 1202947 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1202963 - Posted: 6 Mar 2012, 13:24:59 UTC - in response to Message 1202947.  

Is this the first time they've used Synergy for Scheduling?

Perhaps they are already making a hole for the new servers by taking some of the slower servers out of the mix.

That could be, but it is 50/50 on bruno going belly up. Until we are told by the guys we won't really know. Should we starting a betting pool on what happened with the proceeds to go to the guys as a donation?
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1202963 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1202964 - Posted: 6 Mar 2012, 13:25:51 UTC - in response to Message 1202946.  

If you use BoincRescheduler you and your wing men will receive less credit for the WU, something like (1/3 or less credit per WU or even less). I don´t know if there are any way to avoid that.
ID: 1202964 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 . . . 9 · Next

Message boards : Number crunching : Panic Mode On (70) Server problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.