Project servers down ?

Message boards : Number crunching : Project servers down ?
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Profile CoolBlue87GT
Avatar

Send message
Joined: 27 Dec 03
Posts: 59
Credit: 53,580
RAC: 0
United States
Message 511023 - Posted: 31 Jan 2007, 1:02:02 UTC

Starting at 3:13 this afternoon, started getting error messages.

1/30/2007 3:13:33 PM||Access to reference site succeeded - project servers may be temporarily down.
.
. (the message have continued all night)
.
1/30/2007 7:37:05 PM|SETI@home|Started download of file 10my00aa.14124.20705.204810.3.148
1/30/2007 7:37:30 PM||Project communication failed: attempting access to reference site
1/30/2007 7:37:30 PM|SETI@home|Temporarily failed download of 10my00aa.14124.20705.204810.3.148: http error
1/30/2007 7:37:30 PM|SETI@home|Backing off 59 minutes and 20 seconds on download of file 10my00aa.14124.20705.204810.3.148
1/30/2007 7:37:31 PM||Access to reference site succeeded - project servers may be temporarily down.

Any ideas ?
ID: 511023 · Report as offensive
Profile John Clark
Volunteer tester
Avatar

Send message
Joined: 29 Sep 99
Posts: 16515
Credit: 4,418,829
RAC: 0
United Kingdom
Message 511027 - Posted: 31 Jan 2007, 1:05:46 UTC

We are currently in the recovery aftermath of a planned outrage, when the database is being worked on, compressed and backed up. The recovery period tends to be long before normal service (and WU U/Ls and new WU D/Ls) settle down!
It's good to be back amongst friends and colleagues



ID: 511027 · Report as offensive
Profile Gecko
Volunteer tester
Avatar

Send message
Joined: 17 Nov 99
Posts: 454
Credit: 6,946,910
RAC: 47
United States
Message 511360 - Posted: 31 Jan 2007, 21:31:17 UTC

Is there a certain period of time that the WUs will "time-out" if the download doesn't complete and become "Ghosts"?
ID: 511360 · Report as offensive
Wander Saito
Volunteer tester

Send message
Joined: 7 Jul 03
Posts: 555
Credit: 2,136,061
RAC: 0
Brazil
Message 511391 - Posted: 31 Jan 2007, 23:02:08 UTC - in response to Message 511360.  

Is there a certain period of time that the WUs will "time-out" if the download doesn't complete and become "Ghosts"?


I guess you could call the WUs deadline just that. Hopefully it will be uploaded long before that, since the deadlines are measured in weeks and most uploads are completed within hours, even on long outages. But in the strict sense, they don't become ghost WUs, they only expire.

Regards,
Wander
ID: 511391 · Report as offensive
Profile Darth Dogbytes™
Volunteer tester

Send message
Joined: 30 Jul 03
Posts: 7512
Credit: 2,021,148
RAC: 0
United States
Message 511397 - Posted: 31 Jan 2007, 23:10:52 UTC
Last modified: 31 Jan 2007, 23:11:38 UTC

Kryten is having the vapours (quaint old English expression)...again.
Account frozen...
ID: 511397 · Report as offensive
Profile keyboards
Volunteer tester
Avatar

Send message
Joined: 14 Jul 00
Posts: 66
Credit: 492,766
RAC: 0
United States
Message 511413 - Posted: 31 Jan 2007, 23:42:07 UTC - in response to Message 511397.  

Kryten is having the vapours (quaint old English expression)...again.


I think Kryten is having even more serious problems like cardiac arrest. BETA has been sending NO WORK messages (even though the server status shows 29,000+ WUs available) since early this morning and all my crunchers have now reached 24 hour deferral :-(
!!Stupidity should be PAINFUL!!
ID: 511413 · Report as offensive
Profile Pappa
Volunteer tester
Avatar

Send message
Joined: 9 Jan 00
Posts: 2562
Credit: 12,301,681
RAC: 0
United States
Message 511440 - Posted: 1 Feb 2007, 0:58:05 UTC - in response to Message 511431.  

Currently, I would think that the name be changed to "oblivious"

Kryten is having the vapours (quaint old English expression)...again.

More like Vapor lock. ;)

Could that new server Sidius do Krytens job?

Or do We need a server named Vader?


Please consider a Donation to the Seti Project.

ID: 511440 · Report as offensive
Profile Dennis Lathem
Avatar

Send message
Joined: 3 Dec 06
Posts: 27
Credit: 1,126,010
RAC: 0
United States
Message 511450 - Posted: 1 Feb 2007, 1:32:52 UTC

All 6 of my machines are having problems uploading AND downloading and this has been going on since last night. Something is SERIOUSLY wrong.

ID: 511450 · Report as offensive
Odysseus
Volunteer tester
Avatar

Send message
Joined: 26 Jul 99
Posts: 1808
Credit: 6,701,347
RAC: 6
Canada
Message 511457 - Posted: 1 Feb 2007, 1:51:52 UTC - in response to Message 511450.  
Last modified: 1 Feb 2007, 1:52:16 UTC

All 6 of my machines are having problems uploading AND downloading and this has been going on since last night. Something is SERIOUSLY wrong.

See the new Technical News forum. Matt has had to reboot the server twice today … and the more downtime it gets, the more intense the pounding it receives (with concomitant delays & dropped connections) when it comes back up.
ID: 511457 · Report as offensive
Profile CoolBlue87GT
Avatar

Send message
Joined: 27 Dec 03
Posts: 59
Credit: 53,580
RAC: 0
United States
Message 513489 - Posted: 5 Feb 2007, 6:01:06 UTC
Last modified: 5 Feb 2007, 6:01:36 UTC

I'm stll getting the same error.

2/5/2007 12:00:42 AM|SETI@home|Started upload of file 18my00aa.12610.8289.473568.3.148_0_0
2/5/2007 12:01:04 AM||Project communication failed: attempting access to reference site
2/5/2007 12:01:04 AM|SETI@home|Temporarily failed upload of 18my00aa.12610.8289.473568.3.148_0_0: http error
2/5/2007 12:01:04 AM|SETI@home|Backing off 1 minutes and 0 seconds on upload of file 18my00aa.12610.8289.473568.3.148_0_0
2/5/2007 12:01:05 AM||Access to reference site succeeded - project servers may be temporarily down.
2/5/2007 12:02:05 AM|SETI@home|Started upload of file 18my00aa.12610.8289.473568.3.148_0_0
2/5/2007 12:02:35 AM|SETI@home|Finished upload of file 18my00aa.12610.8289.473568.3.148_0_0
2/5/2007 12:02:35 AM|SETI@home|Throughput 1073 bytes/sec
2/5/2007 12:53:46 AM|SETI@home|Started download of file 17my00aa.2347.29281.748586.3.255
2/5/2007 12:54:08 AM||Project communication failed: attempting access to reference site
2/5/2007 12:54:08 AM|SETI@home|Temporarily failed download of 17my00aa.2347.29281.748586.3.255: http error
2/5/2007 12:54:08 AM|SETI@home|Backing off 2 hours, 40 minutes and 59 seconds on download of file 17my00aa.2347.29281.748586.3.255
2/5/2007 12:54:09 AM||Access to reference site succeeded - project servers may be temporarily down.
2/5/2007 12:54:35 AM|SETI@home|Started download of file 18my00aa.25368.28514.548578.3.44
2/5/2007 12:54:36 AM|SETI@home|Started download of file 07jn00aa.20200.19362.136088.3.178
2/5/2007 12:54:57 AM||Project communication failed: attempting access to reference site
2/5/2007 12:54:57 AM|SETI@home|Temporarily failed download of 18my00aa.25368.28514.548578.3.44: http error
2/5/2007 12:54:57 AM|SETI@home|Backing off 46 minutes and 57 seconds on download of file 18my00aa.25368.28514.548578.3.44
2/5/2007 12:54:58 AM||Access to reference site succeeded - project servers may be temporarily down.
2/5/2007 12:54:58 AM|SETI@home|Temporarily failed download of 07jn00aa.20200.19362.136088.3.178: http error
2/5/2007 12:54:58 AM|SETI@home|Backing off 2 hours, 8 minutes and 59 seconds on download of file 07jn00aa.20200.19362.136088.3.178

Glad I have 7 wu stored , any idea's when things will be back to "normal" ?
ID: 513489 · Report as offensive
tombew

Send message
Joined: 12 Apr 00
Posts: 111
Credit: 12,182,261
RAC: 0
United States
Message 513548 - Posted: 5 Feb 2007, 11:12:31 UTC - in response to Message 513489.  

Have you tried stopping and restarting BOINC?
ID: 513548 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14690
Credit: 200,643,578
RAC: 874
United Kingdom
Message 513549 - Posted: 5 Feb 2007, 11:17:29 UTC

And if that doesn't clear it, restarting the PC?
ID: 513549 · Report as offensive
Profile CoolBlue87GT
Avatar

Send message
Joined: 27 Dec 03
Posts: 59
Credit: 53,580
RAC: 0
United States
Message 513555 - Posted: 5 Feb 2007, 11:48:26 UTC - in response to Message 513548.  

Have you tried stopping and restarting BOINC?


I'll give that a shot
ID: 513555 · Report as offensive
W-K 666 Project Donor
Volunteer tester

Send message
Joined: 18 May 99
Posts: 20034
Credit: 40,757,560
RAC: 67
United Kingdom
Message 513559 - Posted: 5 Feb 2007, 12:06:38 UTC

You could also try Pappa's suggestion in msg 512228

Andy
ID: 513559 · Report as offensive
Profile CoolBlue87GT
Avatar

Send message
Joined: 27 Dec 03
Posts: 59
Credit: 53,580
RAC: 0
United States
Message 513564 - Posted: 5 Feb 2007, 12:29:46 UTC - in response to Message 513548.  
Last modified: 5 Feb 2007, 12:30:11 UTC

Have you tried stopping and restarting BOINC?


Okay, I stopped & rebooted. That worked.

The three wu's that were waiting are now downloaded.

Thanks for the hints.
ID: 513564 · Report as offensive
Profile Dennis Lathem
Avatar

Send message
Joined: 3 Dec 06
Posts: 27
Credit: 1,126,010
RAC: 0
United States
Message 513626 - Posted: 5 Feb 2007, 15:07:44 UTC

I have three machines at a location that were unable to download new work. I shut down BONIC on all three machines and rebooted each machine. Now BONIC is back up and getting work.

However, one of the laptops downloaded work and began to crunch. It is still generating a large number of computation errors. I really don't know what could be causing this. This particular laptop is one I got from my daughter when I bought her a new one. This machine had a problem with shutting down, but now that I have cleaned all the dust out of the system and did an nuke and pave (clean OS install) this problem has gone away.

In part, I have been using SETI crunching to test the laptop extensively. It is performing work, but it has a much higher incident of computational errors than any of my other five machines.

The only program running on this machine is BONIC. Here is the listing for this machine. Any ideas would be appreciated.

http://setiathome.berkeley.edu/show_host_detail.php?hostid=3055662
ID: 513626 · Report as offensive
Profile Pooh Bear 27
Volunteer tester
Avatar

Send message
Joined: 14 Jul 03
Posts: 3224
Credit: 4,603,826
RAC: 0
United States
Message 513634 - Posted: 5 Feb 2007, 15:36:31 UTC

Dennis,

Stop using the screen saver. That machine has a shared video memory, and this can cause the issues you are having. You can look for updated video drivers, and make sure you have the latest patches to DirectX 9.0c.

Not using the screen saver will also save crunching time. I use a blank screen saver on those machines I HAVE to use a screen saver on. Those 3D ones can cause issues.



My movie https://vimeo.com/manage/videos/502242
ID: 513634 · Report as offensive
KB7RZF
Volunteer tester
Avatar

Send message
Joined: 15 Aug 99
Posts: 9555
Credit: 3,308,926
RAC: 2
United States
Message 513637 - Posted: 5 Feb 2007, 15:45:55 UTC

Hey Dennis,

Pooh Bear hit the spot. Looking at some of your work units, the exit code -1073741819 usually means something is up with the video drivers.

There was another exit code, - exit code -1072365552 that i couldn't find anything on. I know a lot of discussion on here with laptops is that they overheat quickly, and throttle down the speed of the CPU. Make sure you have good cooling, and you may want to use a program to limit the amount of CPU usage, or try version 5.8.x BOINC, you can also set in your General preferences on how much CPU to use as well.

Good luck!

Jeremy
ID: 513637 · Report as offensive
Profile Gecko
Volunteer tester
Avatar

Send message
Joined: 17 Nov 99
Posts: 454
Credit: 6,946,910
RAC: 47
United States
Message 513653 - Posted: 5 Feb 2007, 17:03:02 UTC

Haven't been able to D/L WUs for Apple w/ intel in 2 days.
Get this message:

Mon Feb 5 08:51:24 2007||Project communication failed: attempting access to reference site
Mon Feb 5 08:51:24 2007||Access to reference site succeeded - project servers may be temporarily down.
Mon Feb 5 08:51:24 2007|SETI@home|Temporarily failed download of 20ap00aa.5520.2320.134656.3.21: http error
Mon Feb 5 08:51:24 2007|SETI@home|Backing off 1 minutes and 0 seconds on download of file 20ap00aa.5520.2320.134656.3.21
Mon Feb 5 08:51:24 2007|SETI@home|Temporarily failed download of 20ap00aa.5520.2320.134656.3.56: http error
Mon Feb 5 08:51:24 2007|SETI@home|Backing off 1 minutes and 0 seconds on download of file 20ap00aa.5520.2320.134656.3.56


Server status shows fine on Berkeley's end.
What's up w/ this?

I've re-started BOINC and also have reset project (HATE to do this!)
Same thing.

Anyone w/ Intel Macs having similar problem?
BTW, I'm running 5.4.11 stable release.
ID: 513653 · Report as offensive
Profile Dennis Lathem
Avatar

Send message
Joined: 3 Dec 06
Posts: 27
Credit: 1,126,010
RAC: 0
United States
Message 513662 - Posted: 5 Feb 2007, 17:37:31 UTC - in response to Message 513634.  

Dennis,

Stop using the screen saver. That machine has a shared video memory, and this can cause the issues you are having. You can look for updated video drivers, and make sure you have the latest patches to DirectX 9.0c.

Not using the screen saver will also save crunching time. I use a blank screen saver on those machines I HAVE to use a screen saver on. Those 3D ones can cause issues.



Thanks for the suggestion, however, I do not use any screen saver. I have the screen set to power off after a few minutes. I also have it on a forced fan cooling rack.

When I did the nuke and pave on the machine I made sure every and all drivers were updated. The machine does indeed have that share video memory stuff and it is sucking up 128MB of the 1GB installed.

Funny, nothing else appears to have any errors showing up. I will look into the new version to pull down the CPU load on this machine a bit. Perhaps that will help. I have a large work unit about to be finished I hope it is free of errors.

ID: 513662 · Report as offensive
1 · 2 · Next

Message boards : Number crunching : Project servers down ?


 
©2026 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.