Panic Mode On (58) Server problems?

Message boards : Number crunching : Panic Mode On (58) Server problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · 11 · Next

AuthorMessage
Profile Frizz
Volunteer tester
Avatar

Send message
Joined: 17 May 99
Posts: 271
Credit: 5,852,934
RAC: 0
New Zealand
Message 1160735 - Posted: 9 Oct 2011, 21:45:45 UTC - in response to Message 1160732.  

OK, I understand. Thanks Claggy for your explanation.

The BOINC server system will always be a mystery for me :)
ID: 1160735 · Report as offensive
JohnDK Crowdfunding Project Donor*Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 28 May 00
Posts: 1222
Credit: 451,243,443
RAC: 1,127
Denmark
Message 1160743 - Posted: 9 Oct 2011, 22:00:58 UTC - in response to Message 1160735.  

OK, I understand. Thanks Claggy for your explanation.

The BOINC server system will always be a mystery for me :)

It's understandable :) Your question have been asked many times, and I've suggested a little notice on the server page explaining this (rather important) issue, but guess it wont happen...
ID: 1160743 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1160753 - Posted: 9 Oct 2011, 22:47:21 UTC - in response to Message 1160661.  

Jeff Cobb just posted this in the down loading new work units thread

The server anakin crashed. So I built a 64 bit version of the transitioner and deployed in on synergy. Anakin is also a download server (the other being bane) via DNS. I do not want to restart the lab wide DNS server remotely on a Sunday, so I hope this will sort itself out until someone can restart anakin (I am not back to work until Tuesday). DL traffic is maxed out in any case.

For those that like to fiddle with these things, the working download server - bane this time - is 208.68.240.13

That's the other way round from usual, so anyone who still has .18 in their hosts file will be stuck until Tuesday.
ID: 1160753 · Report as offensive
Profile Fred J. Verster
Volunteer tester
Avatar

Send message
Joined: 21 Apr 04
Posts: 3252
Credit: 31,903,643
RAC: 0
Netherlands
Message 1160771 - Posted: 9 Oct 2011, 23:53:31 UTC - in response to Message 1160753.  
Last modified: 9 Oct 2011, 23:54:11 UTC

Jeff Cobb just posted this in the down loading new work units thread

The server anakin crashed. So I built a 64 bit version of the transitioner and deployed in on synergy. Anakin is also a download server (the other being bane) via DNS. I do not want to restart the lab wide DNS server remotely on a Sunday, so I hope this will sort itself out until someone can restart anakin (I am not back to work until Tuesday). DL traffic is maxed out in any case.

For those that like to fiddle with these things, the working download server - bane this time - is 208.68.240.13

That's the other way round from usual, so anyone who still has .18 in their hosts file will be stuck until Tuesday.


Thanks Richard, always usefull to know, got work UP--DownLoaded, so no problem here.
ID: 1160771 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13732
Credit: 208,696,464
RAC: 304
Australia
Message 1160846 - Posted: 10 Oct 2011, 6:21:17 UTC - in response to Message 1160693.  
Last modified: 10 Oct 2011, 6:21:52 UTC


All i'm getting with my work requests are "Project has no tasks available". Hopefully it'll settle down over the next couple of hours.


Curious since cricket graph is active but not maxed out and there are plenty of results available (both MB and AP), but the numbers available are decreasing suggesting that they are being sent somewhere.

Every now & then i've been getting a few WUs, but most requests result in "No tasks sent" or "Project has no tasks available" messages.
Looking at the graphs, although maxed out it's jaggy indicating it's not keeping up with the load all the time. And the rate at which the Ready to Send buffer is shrinking is very slow; you'd normally get a steeper slope.

But at least we're getting some work till they can get things sorted out on Tuesday.
Grant
Darwin NT
ID: 1160846 · Report as offensive
Kevin Olley

Send message
Joined: 3 Aug 99
Posts: 906
Credit: 261,085,289
RAC: 572
United Kingdom
Message 1160867 - Posted: 10 Oct 2011, 8:12:18 UTC

No panic.

I have noticed that I have got that old ghost problem again, ATM 111 of them.

I am sure that others are affected too.

If possable could you turn back on the "Resend Lost Tasks" before all these ghosts start wreaking havoc in your server cabinet:-)



Kevin


ID: 1160867 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22190
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1160872 - Posted: 10 Oct 2011, 8:38:06 UTC

Grant - remember the delivery pipe only holds 100 WU despite the delivery pool having about 900,000 WU available. The delivery pipe fills and empties very rapidly, but with so many hungry crunchers out here it spends a lot of time filling compared to having tasks available.
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1160872 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13732
Credit: 208,696,464
RAC: 304
Australia
Message 1160877 - Posted: 10 Oct 2011, 9:08:05 UTC - in response to Message 1160872.  

Grant - remember the delivery pipe only holds 100 WU despite the delivery pool having about 900,000 WU available. The delivery pipe fills and empties very rapidly, but with so many hungry crunchers out here it spends a lot of time filling compared to having tasks available.

Yep, and when things are running normally the feeder re-loads much faster than it is at the moment. The slow rate at which the Ready to Send buffer is shrinking & the slow rate at which Work in Progress is climbing shows that is the case; things aren't running as welll as they usually do. But at least they are running.
Grant
Darwin NT
ID: 1160877 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 1160947 - Posted: 10 Oct 2011, 15:39:04 UTC

each server request takes minutes to complete...
ID: 1160947 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22190
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1160948 - Posted: 10 Oct 2011, 15:48:14 UTC

They may take minutes, but at least they do complete, eventually............
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1160948 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1160967 - Posted: 10 Oct 2011, 16:28:11 UTC

As long as things hang together in their cobbled state until tomorrow's outage, I'd say it's working pretty well.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1160967 · Report as offensive
Brkovip
Avatar

Send message
Joined: 18 May 99
Posts: 274
Credit: 144,414,367
RAC: 0
United States
Message 1160973 - Posted: 10 Oct 2011, 16:39:05 UTC

I haven't been able to get tasks again. This trouble with the router is getting very old. I am unable to ping any of the servers but 128.32.18.150 setiathome.ssl.berkeley.edu. Everything else has been dead to the world from my end this whole last week.
ID: 1160973 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1160974 - Posted: 10 Oct 2011, 16:42:51 UTC - in response to Message 1160973.  

I haven't been able to get tasks again. This trouble with the router is getting very old. I am unable to ping any of the servers but 128.32.18.150 setiathome.ssl.berkeley.edu. Everything else has been dead to the world from my end this whole last week.

Well, hope is that the new RAM for the router will arrive soon, and access to the secure cage the router is in can be gained to do the upgrade soon as well.
Then we all have to cross our fingers and see if the increased RAM does indeed solve the issue.
Otherwise, they are off on another troubleshooting session with HE, I suppose.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1160974 · Report as offensive
SupeRNovA
Volunteer tester
Avatar

Send message
Joined: 25 Oct 04
Posts: 131
Credit: 12,741,814
RAC: 0
Bulgaria
Message 1160987 - Posted: 10 Oct 2011, 17:31:17 UTC

i have downloading like mad these 24h seti units my cache is FULL and the connection to the server is PERFECT i can't believe last night when i was downloading with 200-300 kilobytes per second. and still the connection is perfect and when sending units the server send back new ones.

Thank you !
ID: 1160987 · Report as offensive
W5DMG - Dave

Send message
Joined: 19 May 99
Posts: 155
Credit: 33,162,251
RAC: 0
United States
Message 1161005 - Posted: 10 Oct 2011, 18:52:24 UTC - in response to Message 1160987.  

Help I cannot seem to get any CUDA work, maybe a total of 20 in the past 2 weeks.
ID: 1161005 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1161006 - Posted: 10 Oct 2011, 18:56:56 UTC - in response to Message 1161005.  

Help I cannot seem to get any CUDA work, maybe a total of 20 in the past 2 weeks.

The past few weeks there have been major issues with the connection and servers. things seem to be starting to get back to normal. One of your computers has downloaded over 100 tasks to process on your CUDA device today.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1161006 · Report as offensive
Profile Frizz
Volunteer tester
Avatar

Send message
Joined: 17 May 99
Posts: 271
Credit: 5,852,934
RAC: 0
New Zealand
Message 1161012 - Posted: 10 Oct 2011, 19:08:27 UTC

I can't up- or download.

But with ap_validate3 down since several days now my AP units won't get validated anyway.

Frustrating ...

ID: 1161012 · Report as offensive
Profile Belthazor
Volunteer tester
Avatar

Send message
Joined: 6 Apr 00
Posts: 219
Credit: 10,373,795
RAC: 13
Russia
Message 1161026 - Posted: 10 Oct 2011, 19:51:44 UTC - in response to Message 1161012.  

I can't up- or download.


Which kind of error? What do you see in the log "ctrl+shift+E"?
ID: 1161026 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1161028 - Posted: 10 Oct 2011, 19:54:43 UTC - in response to Message 1161026.  

I can't up- or download.


Which kind of error? What do you see in the log "ctrl+shift+E"?

Frizz is running Boinc 6.10.60 on his hosts, "ctrl+shift+E" Won't work, But "ctrl+shift+M" will.

Claggy
ID: 1161028 · Report as offensive
MikeN

Send message
Joined: 24 Jan 11
Posts: 319
Credit: 64,719,409
RAC: 85
United Kingdom
Message 1161043 - Posted: 10 Oct 2011, 20:41:42 UTC

Jeff Cobb just posted this in the down loading new work units thread

The server anakin crashed. So I built a 64 bit version of the transitioner and deployed in on synergy. Anakin is also a download server (the other being bane) via DNS. I do not want to restart the lab wide DNS server remotely on a Sunday, so I hope this will sort itself out until someone can restart anakin (I am not back to work until Tuesday). DL traffic is maxed out in any case.


Ever since Jeff ringed up the temporary fix yesterday, I have had much better than normal connection. Downloads come through at 10x normal speed with few than usual backoffs. Also the cricket graph has consistently been at maximum and there are plenty of WU available for MB and AP. I currently have 48 APs downloaded which is around 40 more than I usually get between my machines. Any chance that this temporary fix could be made permanent?
ID: 1161043 · Report as offensive
Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · 11 · Next

Message boards : Number crunching : Panic Mode On (58) Server problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.