Panic Mode On (93) Server Problems?

Message boards : Number crunching : Panic Mode On (93) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 15 · 16 · 17 · 18 · 19 · 20 · 21 . . . 24 · Next

AuthorMessage
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13720
Credit: 208,696,464
RAC: 304
Australia
Message 1619708 - Posted: 28 Dec 2014, 9:13:22 UTC - in response to Message 1619680.  

External symptoms are the same as for the last crash caused by the faulty/unidentifiable RAID card.
Scheduler requests work, but can't request new work as I have WUs to download, but they won't download. Uploads still going through.
Grant
Darwin NT
ID: 1619708 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14649
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1619710 - Posted: 28 Dec 2014, 9:21:33 UTC - in response to Message 1619708.  

External symptoms are the same as for the last crash caused by the faulty/unidentifiable RAID card.
Scheduler requests work, but can't request new work as I have WUs to download, but they won't download. Uploads still going through.

Work is still being allocated (though we're now down to extra replications of failed WUs, rather than newly split WUs) - if you happen to request it during one of the periodic retries of the stalled download. But no sign of download life from either server.
ID: 1619710 · Report as offensive
Highlander
Avatar

Send message
Joined: 5 Oct 99
Posts: 167
Credit: 37,987,668
RAC: 16
Germany
Message 1619719 - Posted: 28 Dec 2014, 10:15:47 UTC

Look on the bright side: Servers are now able to build a "Ready to send" buffer :-) (if no harm to the saving raid had happened, but more unlikly this is on 2 servers at the same time...)
- Performance is not a simple linear function of the number of CPUs you throw at the problem. -
ID: 1619719 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1619763 - Posted: 28 Dec 2014, 12:46:20 UTC - in response to Message 1619680.  

Yep, http://setiathome.berkeley.edu/sah_status.html is stuck at 6:00:06 UTC too, so we can't tell if the download servers have changed status. 208.68.240.13 which I think is vader does respond to pings, but 208.68.240.21 (georgem) does not.
                                                                   Joe

Seti Beta with it's new Server Status page is still updating, and has:

Download server boinc2.ssl.berkeley.edu Not Running

Database schema version: 20932
Remote daemon status as of 28 Dec 2014, 12:40:15 UTC


Claggy
ID: 1619763 · Report as offensive
Profile JaundicedEye
Avatar

Send message
Joined: 14 Mar 12
Posts: 5375
Credit: 30,870,693
RAC: 1
United States
Message 1619796 - Posted: 28 Dec 2014, 15:33:35 UTC

12/28/2014 6:42:56 AM | SETI@home | Sending scheduler request: To fetch work.
12/28/2014 6:42:56 AM | SETI@home | Requesting new tasks for CPU and NVIDIA
12/28/2014 6:42:58 AM | SETI@home | Scheduler request completed: got 0 new tasks
12/28/2014 6:42:58 AM | SETI@home | Project has no tasks available
12/28/2014 6:49:48 AM | SETI@home | Project has no tasks available
12/28/2014 6:54:56 AM | SETI@home | Project has no tasks available
12/28/2014 7:04:04 AM | SETI@home | Project has no tasks available
12/28/2014 7:19:12 AM | SETI@home | Project has no tasks available
12/28/2014 7:24:20 AM | SETI@home | Project has no tasks available
12/28/2014 7:32:53 AM | SETI@home | No tasks are available for AstroPulse v6
12/28/2014 7:32:53 AM | SETI@home | No tasks are available for AstroPulse v7
12/28/2014 7:38:01 AM | SETI@home | Project has no tasks available
12/28/2014 7:44:09 AM | SETI@home | Project has no tasks available
12/28/2014 7:49:17 AM | SETI@home | Project has no tasks available
12/28/2014 7:59:26 AM | SETI@home | Project has no tasks available
12/28/2014 8:17:09 AM | SETI@home | Project has no tasks available
12/28/2014 8:22:17 AM | SETI@home | Project has no tasks available

"It's Dead, Jim......."

"Sour Grapes make a bitter Whine." <(0)>
ID: 1619796 · Report as offensive
Profile Zombu2
Volunteer tester

Send message
Joined: 24 Feb 01
Posts: 1615
Credit: 49,315,423
RAC: 0
United States
Message 1619799 - Posted: 28 Dec 2014, 15:52:57 UTC

Yeah it's dead i got a gazillion WU's waiting to download guess we have to wait till someone revives the server
I came down with a bad case of i don't give a crap
ID: 1619799 · Report as offensive
BetelgeuseFive Project Donor
Volunteer tester

Send message
Joined: 6 Jul 99
Posts: 158
Credit: 17,117,787
RAC: 19
Netherlands
Message 1619802 - Posted: 28 Dec 2014, 16:06:23 UTC - in response to Message 1619763.  

Uploads on Beta are now failing (transient HTTP error), they were working earlier when there were already problems on main ...

Tom

Yep, http://setiathome.berkeley.edu/sah_status.html is stuck at 6:00:06 UTC too, so we can't tell if the download servers have changed status. 208.68.240.13 which I think is vader does respond to pings, but 208.68.240.21 (georgem) does not.
                                                                   Joe

Seti Beta with it's new Server Status page is still updating, and has:

Download server boinc2.ssl.berkeley.edu Not Running

Database schema version: 20932
Remote daemon status as of 28 Dec 2014, 12:40:15 UTC


Claggy

ID: 1619802 · Report as offensive
Rymorea
Volunteer tester
Avatar

Send message
Joined: 14 Feb 14
Posts: 54
Credit: 3,840,646
RAC: 0
Turkey
Message 1619804 - Posted: 28 Dec 2014, 16:09:44 UTC

no download in last 24 hours problem continues
Seti@home Classic account User ID 955 member since 8 Sep 1999 classic CPU time 539,770 hours

ID: 1619804 · Report as offensive
Profile ReiAyanami
Avatar

Send message
Joined: 6 Dec 05
Posts: 116
Credit: 222,900,202
RAC: 174
Japan
Message 1619807 - Posted: 28 Dec 2014, 16:13:30 UTC
Last modified: 28 Dec 2014, 16:23:19 UTC

GPU WU lasted 10 hours since the last DL. Now left with only 36 CPU WU. ~9 hours to crunch. Good 3 weeks?
ID: 1619807 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1619809 - Posted: 28 Dec 2014, 16:24:28 UTC - in response to Message 1619763.  

Yep, http://setiathome.berkeley.edu/sah_status.html is stuck at 6:00:06 UTC too, so we can't tell if the download servers have changed status. 208.68.240.13 which I think is vader does respond to pings, but 208.68.240.21 (georgem) does not.
                                                                   Joe

Seti Beta with it's new Server Status page is still updating, and has:

Download server boinc2.ssl.berkeley.edu Not Running

Database schema version: 20932
Remote daemon status as of 28 Dec 2014, 12:40:15 UTC


Claggy

Not Running

Sorta like all those Macs that are only being Allowed about a dozen GPU APs to be cached.
Now they are All,
Not Running
ID: 1619809 · Report as offensive
Profile Zombu2
Volunteer tester

Send message
Joined: 24 Feb 01
Posts: 1615
Credit: 49,315,423
RAC: 0
United States
Message 1619810 - Posted: 28 Dec 2014, 16:24:41 UTC


I came down with a bad case of i don't give a crap
ID: 1619810 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1619811 - Posted: 28 Dec 2014, 16:26:44 UTC

R.I.P.
ID: 1619811 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1619816 - Posted: 28 Dec 2014, 16:54:50 UTC - in response to Message 1619809.  
Last modified: 28 Dec 2014, 16:55:03 UTC

Sorta like all those Macs that are only being Allowed about a dozen GPU APs to be cached.

Can you get a full Cache of APv7 GPU work when you're running Stock?

Claggy
ID: 1619816 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1619818 - Posted: 28 Dec 2014, 17:12:58 UTC - in response to Message 1619816.  

Sorta like all those Macs that are only being Allowed about a dozen GPU APs to be cached.

Can you get a full Cache of APv7 GPU work when you're running Stock?

Claggy

Most of these people are running Stock, so, I would say No;
http://setiathome.berkeley.edu/results.php?hostid=7237623&offset=0&show_names=0&state=0&appid=20
http://setiathome.berkeley.edu/results.php?hostid=7285978&offset=0&show_names=0&state=0&appid=20
http://setiathome.berkeley.edu/results.php?hostid=7276096&offset=0&show_names=0&state=0&appid=20
http://setiathome.berkeley.edu/results.php?hostid=7242334&offset=0&show_names=0&state=0&appid=20
http://setiathome.berkeley.edu/results.php?hostid=2991797&offset=0&show_names=0&state=0&appid=20
http://setiathome.berkeley.edu/results.php?hostid=7309734&offset=0&show_names=0&state=0&appid=20
http://setiathome.berkeley.edu/results.php?hostid=3265309&offset=0&show_names=0&state=0&appid=20
http://setiathome.berkeley.edu/results.php?hostid=7337211&offset=0&show_names=0&state=0&appid=20
http://setiathome.berkeley.edu/results.php?hostid=6797736&offset=0&show_names=0&state=0&appid=20
http://setiathome.berkeley.edu/results.php?hostid=6157711&offset=0&show_names=0&state=0&appid=20
http://setiathome.berkeley.edu/results.php?hostid=5754631&offset=0&show_names=0&state=0&appid=20
It appears I am able to cache at least 50 CPU APs, I haven't tried any more than that. The Problem is with the GPU APs. After reaching the limit of around 20 GPU APs, I had the server send 20 CPU APs back as GPU APs raising it to 40 GPU APs. The Server then Refused to send anymore GPU APs until the total dropped back to about a dozen GPU APs. This Suggests the server has been HARD WIRED to limit Mac GPU APs to around 10-20.

Now why would someone Hard Wire the server to limit certain Mac GPU APs to a fraction of the 100 per GPU?
My suggestion would be to at least change the cache priority to fill the GPU cache First. That way, maybe the server will send more GPU APs to the machines.
ID: 1619818 · Report as offensive
Profile KWSN Ekky Ekky Ekky
Avatar

Send message
Joined: 25 May 99
Posts: 944
Credit: 52,956,491
RAC: 67
United Kingdom
Message 1619819 - Posted: 28 Dec 2014, 17:21:03 UTC

Woah! Someone tweaked something. Piles of downloads all happening at once!
Thank you very much whomsoever is rfesponsible.

ID: 1619819 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1619820 - Posted: 28 Dec 2014, 17:37:59 UTC

It´s Alive!
ID: 1619820 · Report as offensive
JohnDK Crowdfunding Project Donor*Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 28 May 00
Posts: 1222
Credit: 451,243,443
RAC: 1,127
Denmark
Message 1619821 - Posted: 28 Dec 2014, 17:45:54 UTC

Got some work but it can't download.

28/12/2014 18:42:39 | SETI@home | Temporarily failed download of ap_08oc14ad_B0_P0_00012_20141228_05222.wu: connect() failed
28/12/2014 18:42:39 | SETI@home | Backing off 00:11:27 on download of ap_08oc14ad_B0_P0_00012_20141228_05222.wu
ID: 1619821 · Report as offensive
Phil Burden

Send message
Joined: 26 Oct 00
Posts: 264
Credit: 22,303,899
RAC: 0
United Kingdom
Message 1619822 - Posted: 28 Dec 2014, 17:48:34 UTC - in response to Message 1619821.  

Got some work but it can't download.

28/12/2014 18:42:39 | SETI@home | Temporarily failed download of ap_08oc14ad_B0_P0_00012_20141228_05222.wu: connect() failed
28/12/2014 18:42:39 | SETI@home | Backing off 00:11:27 on download of ap_08oc14ad_B0_P0_00012_20141228_05222.wu


Likewise, 53 waiting to download, all getting backed off for 4 hrs or so.

p>
ID: 1619822 · Report as offensive
Profile Zombu2
Volunteer tester

Send message
Joined: 24 Feb 01
Posts: 1615
Credit: 49,315,423
RAC: 0
United States
Message 1619823 - Posted: 28 Dec 2014, 17:49:31 UTC


I came down with a bad case of i don't give a crap
ID: 1619823 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1619825 - Posted: 28 Dec 2014, 17:55:23 UTC

download server 1 georgem Disabled

Hope nothing serious...
ID: 1619825 · Report as offensive
Previous · 1 . . . 15 · 16 · 17 · 18 · 19 · 20 · 21 . . . 24 · Next

Message boards : Number crunching : Panic Mode On (93) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.