Double crash... vader still down...

Message boards : Technical News : Double crash... vader still down...
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Eric Korpela Project Donor
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 3 Apr 99
Posts: 1382
Credit: 54,506,847
RAC: 60
United States
Message 991016 - Posted: 22 Apr 2010, 4:28:46 UTC

We had a couple of problems tonight. ptolemy, our main file server for user accounts went down at about 5:05pm. Of course that's 5 minutes after Matt and Jeff left, so that left me as the default sysadmin. They're both more patient than I am and are less likely to just pull the plug out of the wall.

So I rebooted ptolemy, and it crashed again about 5 seconds after it came back up. And again. And again. Eventually I figured out that vader was trying to do a lot of writes to ptolemy and that was causing the crash.

I couldn't get vader to respond to anything, so I just pulled the plug out of the wall. I tried a few times to restart it, but it just hangs during the boot process. So our assimilators are down, among other things. We may run out of work at some point.


Hopefully Matt or Jeff will fix it tomorrow.

@SETIEric@qoto.org (Mastodon)

ID: 991016 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 991060 - Posted: 22 Apr 2010, 8:24:22 UTC - in response to Message 991016.  

Thanks for the update Eric,

Claggy
ID: 991060 · Report as offensive
KB7RZF
Volunteer tester
Avatar

Send message
Joined: 15 Aug 99
Posts: 9549
Credit: 3,308,926
RAC: 2
United States
Message 991083 - Posted: 22 Apr 2010, 12:21:23 UTC

Thanks Eric.
ID: 991083 · Report as offensive
Profile Gary Charpentier Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 25 Dec 00
Posts: 30593
Credit: 53,134,872
RAC: 32
United States
Message 991095 - Posted: 22 Apr 2010, 13:44:59 UTC

Thanks for the update Eric. Any news is appreciated.
ID: 991095 · Report as offensive
Profile rebest Project Donor
Volunteer tester
Avatar

Send message
Joined: 16 Apr 00
Posts: 1296
Credit: 45,357,093
RAC: 0
United States
Message 991112 - Posted: 22 Apr 2010, 14:52:14 UTC - in response to Message 991095.  

Thanks for the update Eric. Any news is appreciated.


AMEN!! Many thanks!!!

Join the PACK!
ID: 991112 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 991115 - Posted: 22 Apr 2010, 15:10:12 UTC
Last modified: 22 Apr 2010, 15:22:52 UTC


Eric, thanks for the update!

One of two DL server offline.
Cricket graph show normal traffic.

My BOINC can UL & report/request work.
But can't DL.


____________
[Optimized project applications, for to increase your PC performance (double RAC)!][Overview of abbreviations, which are used often in forum and their meaning.]
ID: 991115 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 991117 - Posted: 22 Apr 2010, 15:21:38 UTC - in response to Message 991115.  

ID: 991117 · Report as offensive
Profile Dave Barstow

Send message
Joined: 14 May 99
Posts: 76
Credit: 15,064,044
RAC: 0
Philippines
Message 991159 - Posted: 22 Apr 2010, 18:47:47 UTC - in response to Message 991016.  

Thanks for the Always Welcome update Eric!

You and I seem to think alike... kick/hit it a few times, if no positive result pull-the-plug and let someone else have a go at it... ;-)

Hopefully the 'Morning Crew' will have a larger hammer...
ID: 991159 · Report as offensive
DJStarfox

Send message
Joined: 23 May 01
Posts: 1066
Credit: 1,226,053
RAC: 2
United States
Message 991169 - Posted: 22 Apr 2010, 19:25:24 UTC - in response to Message 991016.  

Eric, I like the avatar icon. Did you make it or find it somewhere?
ID: 991169 · Report as offensive
Josef W. Segur
Volunteer developer
Volunteer tester

Send message
Joined: 30 Oct 99
Posts: 4504
Credit: 1,414,761
RAC: 0
United States
Message 991172 - Posted: 22 Apr 2010, 19:40:09 UTC - in response to Message 991117.  


Hmm.. strange..

Made a reboot of the PC and now the DL work.
[/color]

Not so strange, DNS was supplying the IP addresses of both download servers, if your host was trying to download from the dead one it could only fail. I had to use the "ipconfig /flushdns" to get downloads.

Both are up now, thanks to whoever solved that!
                                                              Joe
ID: 991172 · Report as offensive
Profile Dr. C.E.T.I.
Avatar

Send message
Joined: 29 Feb 00
Posts: 16019
Credit: 794,685
RAC: 0
United States
Message 991180 - Posted: 22 Apr 2010, 20:02:08 UTC


Thanks Eric - i simply leave well-enough alone

- and the mngr seems to keep care of iT all in the end . . .


BOINC Wiki . . .

Science Status Page . . .
ID: 991180 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14645
Credit: 200,643,578
RAC: 874
United Kingdom
Message 991241 - Posted: 22 Apr 2010, 23:06:44 UTC - in response to Message 991172.  


Hmm.. strange..

Made a reboot of the PC and now the DL work.
[/color]

Not so strange, DNS was supplying the IP addresses of both download servers, if your host was trying to download from the dead one it could only fail. I had to use the "ipconfig /flushdns" to get downloads.

Both are up now, thanks to whoever solved that!
                                                              Joe

BOINC v6.10.24 and above solved that, with a libcurl bug-fix which properly implements round-robin DNS.
ID: 991241 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 991265 - Posted: 23 Apr 2010, 1:04:02 UTC - in response to Message 991241.  
Last modified: 23 Apr 2010, 1:20:13 UTC

Hmm.. strange..

Made a reboot of the PC and now the DL work.

Not so strange, DNS was supplying the IP addresses of both download servers, if your host was trying to download from the dead one it could only fail. I had to use the "ipconfig /flushdns" to get downloads.

Both are up now, thanks to whoever solved that!
                                                              Joe

BOINC v6.10.24 and above solved that, with a libcurl bug-fix which properly implements round-robin DNS.


Ohh well..

I got again DL probs.
So I made a reboot, didn't helped (with BOINC V6.10.18).
Then I installed BOINC V6.10.43, but also no joy, no DL possible, although a lot of DLs in the overview.

I like to edit the CPU name in the registry.. :-D
BOINC V6.10.43 or a version earlier don't take longer the CPU name from the registry like it V6.10.18 do, it take the info from somewhere other. My edit wasn't longer showed in BOINC Manager.

So I downgraded to BOINC V6.10.18, deleted the # signs in front of the SETI@home server addresses in my hosts file and BOINC DL now without probs. :-)

I would like if I could update to BOINC V6.10.43+ if a newer version take again the CPU name from the registry. ;-)
Why was it changed?


____________
[Optimized project applications, for to increase your PC performance (double RAC)!][Overview of abbreviations, which are used often in forum and their meaning.]
ID: 991265 · Report as offensive
Robert Skjöld

Send message
Joined: 26 Oct 09
Posts: 2
Credit: 380,626
RAC: 0
Sweden
Message 992029 - Posted: 26 Apr 2010, 12:44:56 UTC

2010-04-26 00:26:58	SETI@home	Started download of 28dc06aa.6699.2935.3.10.41
2010-04-26 00:26:58	SETI@home	Started download of 24ja07af.20341.85139.8.10.179
2010-04-26 00:27:00	SETI@home	Temporarily failed download of 28dc06aa.6699.2935.3.10.41: HTTP error
2010-04-26 00:27:00	SETI@home	Backing off 2 hr 22 min 6 sec on download of 28dc06aa.6699.2935.3.10.41
2010-04-26 00:27:00	SETI@home	Temporarily failed download of 24ja07af.20341.85139.8.10.179: HTTP error
2010-04-26 00:27:00	SETI@home	Backing off 1 hr 55 min 7 sec on download of 24ja07af.20341.85139.8.10.179
2010-04-26 00:27:00	SETI@home	Started download of 24ja07af.20341.85139.8.10.185
2010-04-26 00:27:00	SETI@home	Started download of 27ja07ag.13493.17659.6.10.42
2010-04-26 00:27:02	SETI@home	Temporarily failed download of 24ja07af.20341.85139.8.10.185: HTTP error
2010-04-26 00:27:02	SETI@home	Backing off 3 min 49 sec on download of 24ja07af.20341.85139.8.10.185
2010-04-26 00:27:02	SETI@home	Temporarily failed download of 27ja07ag.13493.17659.6.10.42: HTTP error
2010-04-26 00:27:02	SETI@home	Backing off 1 hr 27 min 8 sec on download of 27ja07ag.13493.17659.6.10.42
2010-04-26 00:27:12	SETI@home	Started download of 28dc06aa.29060.16746.5.10.18
2010-04-26 00:27:12	SETI@home	Started download of 27ja07ag.4588.20522.8.10.208
2010-04-26 00:27:13	SETI@home	Temporarily failed download of 28dc06aa.29060.16746.5.10.18: HTTP error
2010-04-26 00:27:13	SETI@home	Backing off 17 min 46 sec on download of 28dc06aa.29060.16746.5.10.18
2010-04-26 00:27:13	SETI@home	Temporarily failed download of 27ja07ag.4588.20522.8.10.208: HTTP error
2010-04-26 00:27:13	SETI@home	Backing off 1 min 51 sec on download of 27ja07ag.4588.20522.8.10.208
2010-04-26 00:27:13	SETI@home	Started download of 27ja07ag.4117.3344.10.10.121
2010-04-26 00:27:13	SETI@home	Started download of 11ja07ai.19083.22158.3.10.218
2010-04-26 00:27:14	SETI@home	Temporarily failed download of 27ja07ag.4117.3344.10.10.121: HTTP error
2010-04-26 00:27:14	SETI@home	Backing off 1 min 0 sec on download of 27ja07ag.4117.3344.10.10.121
2010-04-26 00:27:14	SETI@home	Temporarily failed download of 11ja07ai.19083.22158.3.10.218: HTTP error
2010-04-26 00:27:14	SETI@home	Backing off 1 min 0 sec on download of 11ja07ai.19083.22158.3.10.218
2010-04-26 00:27:32	SETI@home	Started download of 27ja07ag.2476.6207.13.10.123
2010-04-26 00:27:32	SETI@home	Started download of 27ja07ag.3797.22976.14.10.151
2010-04-26 00:27:33	SETI@home	Temporarily failed download of 27ja07ag.2476.6207.13.10.123: HTTP error
2010-04-26 00:27:33	SETI@home	Backing off 1 min 0 sec on download of 27ja07ag.2476.6207.13.10.123
2010-04-26 00:27:33	SETI@home	Temporarily failed download of 27ja07ag.3797.22976.14.10.151: HTTP error
2010-04-26 00:27:33	SETI@home	Backing off 1 min 0 sec on download of 27ja07ag.3797.22976.14.10.151
2010-04-26 00:28:15	SETI@home	Started download of 27ja07ag.4117.3344.10.10.121
2010-04-26 00:28:15	SETI@home	Started download of 11ja07ai.19083.22158.3.10.218
2010-04-26 00:28:17	SETI@home	Temporarily failed download of 27ja07ag.4117.3344.10.10.121: HTTP error
2010-04-26 00:28:17	SETI@home	Backing off 1 min 0 sec on download of 27ja07ag.4117.3344.10.10.121
2010-04-26 00:28:17	SETI@home	Temporarily failed download of 11ja07ai.19083.22158.3.10.218: HTTP error
2010-04-26 00:28:17	SETI@home	Backing off 1 min 0 sec on download of 11ja07ai.19083.22158.3.10.218
2010-04-26 00:29:17	SETI@home	Started download of 27ja07ag.4588.20522.8.10.208

This has been going on for approximately a week. Is it just me? It seems to have been fixed for most of you.

Any suggestions?
ID: 992029 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 992072 - Posted: 26 Apr 2010, 15:26:13 UTC - in response to Message 992029.  
Last modified: 26 Apr 2010, 15:26:32 UTC

This has been going on for approximately a week. Is it just me? It seems to have been fixed for most of you.

Any suggestions?

Read the thread you post in?

Two messages above/below yours (depending on how you read threads, if at all :-)) was Richard's message saying:

BOINC v6.10.24 and above solved that, with a libcurl bug-fix which properly implements round-robin DNS.

Since you're still on BOINC 6.10.18, you may want to upgrade to 6.10.43 (the latest recommended) from http://boinc.berkeley.edu/download_all.php.
ID: 992072 · Report as offensive
Profile Derald stafford
Volunteer tester

Send message
Joined: 16 May 99
Posts: 4
Credit: 33,025
RAC: 0
United States
Message 992090 - Posted: 26 Apr 2010, 17:08:14 UTC - in response to Message 992029.  

ive had the same problem since vader crashed cant upload or download anything through bioinc for seti or einstein and milkyway and cosology projects are running very slow bioinc upgrade did not help
ID: 992090 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65690
Credit: 55,293,173
RAC: 49
United States
Message 992104 - Posted: 26 Apr 2010, 18:11:48 UTC

I've been uploading and downloading, I just have at the moment a lot of reporting to do, But that'll clear up in time, Oh and I'm using 6.10.48, So no worries on My part, I'll just wait as I have plenty to do and Boinc on My end will take care of the PC. :D

Last download was at 12:07am PDT on 4/26/2010, Last upload was at 10:53am PDT on 4/26/2010, I don't know is normal or not, But maybe the info will be useful, somehow.
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 992104 · Report as offensive
Urs Echternacht
Volunteer tester
Avatar

Send message
Joined: 15 May 99
Posts: 692
Credit: 135,197,781
RAC: 211
Germany
Message 992119 - Posted: 26 Apr 2010, 19:26:19 UTC

Additionally to what Ageless said it could help to check "Skip image file verification" in Preferences of 6.10.24+ BOINC clients. At least it made my waiting downloads come through.
_\|/_
U r s
ID: 992119 · Report as offensive
Profile QuietDad
Avatar

Send message
Joined: 2 Oct 99
Posts: 83
Credit: 28,926,603
RAC: 59
United States
Message 992222 - Posted: 27 Apr 2010, 4:55:28 UTC - in response to Message 992119.  

Folks, In the Couminity menu, the description of THIS (technical News) forum is

Behind-the-scenes technical details and news updates (only SETI@home staff can start new threads)


Every time someone from Berkley posts any news it becomes a "but its still broken ...see! thread which is better of in the Number Crunching forumn cause there is usually a duplicate thread already started there.

30 yrs of IT technical support under my belt, I can understand why they stop posting here.
ID: 992222 · Report as offensive
DaveInRomeNY

Send message
Joined: 25 Sep 99
Posts: 1
Credit: 21,210,231
RAC: 9
United States
Message 992231 - Posted: 27 Apr 2010, 6:24:11 UTC - in response to Message 992029.  

I'm running BOINC 6.10.43 (MacBookPro, OSX 10.5.8) and get the same HTTP errors you are getting, also for about a week.
ID: 992231 · Report as offensive
1 · 2 · Next

Message boards : Technical News : Double crash... vader still down...


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.