Panic Mode On (26) Server problems

Message boards : Number crunching : Panic Mode On (26) Server problems
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 13 · Next

AuthorMessage
Luke
Volunteer developer
Avatar

Send message
Joined: 31 Dec 06
Posts: 2546
Credit: 817,560
RAC: 0
New Zealand
Message 950324 - Posted: 28 Nov 2009, 0:49:27 UTC

All my uploads and downloads have finally cleared. And a 20% RAC jump to go with it... since I was lucky enough to be able to report a AP task with the rest.
- Luke.
ID: 950324 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13736
Credit: 208,696,464
RAC: 304
Australia
Message 950326 - Posted: 28 Nov 2009, 1:00:29 UTC - in response to Message 950323.  

Or.....

I had 39 WU's stuck in download for a couple of hours giving a mixture of these reports even though the network traffic seemed to be dropping off. A restart of BM caused them all to download immediately without any problem.


Same here.
Had a whole group of downloads allocated, but all of them were "Download Pending". Selected a few & did the "Retry now", but they timed out after a couple of seconds. After that time out, after the next attempt at downloading they rejoined all the others with "Download Pending" status.
I checked the project properties, and there were no deferal times there. Tried the "Do nework communication" option, still all "Download Pending".
Exited & restarted BOINC and after about 4-6 "Wrong file size" error messages, they all downloaded one ofter the other as fast as they ever have.

May be a bug with the manager? My present one is
v6.6.41
Running the AKv8 optimised app. No CUDA. No projects other than Seti.
Grant
Darwin NT
ID: 950326 · Report as offensive
Luke
Volunteer developer
Avatar

Send message
Joined: 31 Dec 06
Posts: 2546
Credit: 817,560
RAC: 0
New Zealand
Message 950327 - Posted: 28 Nov 2009, 1:02:55 UTC - in response to Message 950326.  

Or.....

I had 39 WU's stuck in download for a couple of hours giving a mixture of these reports even though the network traffic seemed to be dropping off. A restart of BM caused them all to download immediately without any problem.


Same here.
Had a whole group of downloads allocated, but all of them were "Download Pending". Selected a few & did the "Retry now", but they timed out after a couple of seconds. After that time out, after the next attempt at downloading they rejoined all the others with "Download Pending" status.
I checked the project properties, and there were no deferal times there. Tried the "Do nework communication" option, still all "Download Pending".
Exited & restarted BOINC and after about 4-6 "Wrong file size" error messages, they all downloaded one ofter the other as fast as they ever have.

May be a bug with the manager? My present one is
v6.6.41
Running the AKv8 optimised app. No CUDA. No projects other than Seti.


There are a few new features and changes related to the Transfers tab (UL's/DL's) in Boinc 6.10.18.

Perhaps it's worth a shot upgrading...

- Luke.
ID: 950327 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 950337 - Posted: 28 Nov 2009, 1:48:35 UTC - in response to Message 950327.  

There are a few new features and changes related to the Transfers tab (UL's/DL's) in Boinc 6.10.18.

You must have different info than all the rest of us. ;-)

There were changes made in the project-wide upload and download back-off code. But these happened in 6.6.38 and 6.6.39

Grant is using 6.6.41, so he's well covered.
ID: 950337 · Report as offensive
Luke
Volunteer developer
Avatar

Send message
Joined: 31 Dec 06
Posts: 2546
Credit: 817,560
RAC: 0
New Zealand
Message 950339 - Posted: 28 Nov 2009, 2:07:58 UTC - in response to Message 950337.  

There are a few new features and changes related to the Transfers tab (UL's/DL's) in Boinc 6.10.18.

You must have different info than all the rest of us. ;-)

There were changes made in the project-wide upload and download back-off code. But these happened in 6.6.38 and 6.6.39

Grant is using 6.6.41, so he's well covered.


Oops, read the 6.10.18 thread wrong then. My mistake...
- Luke.
ID: 950339 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65748
Credit: 55,293,173
RAC: 49
United States
Message 950340 - Posted: 28 Nov 2009, 2:50:31 UTC

Must wait as the Force says the Turkeys, Er servers, are busy gobbling. :D
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 950340 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 950342 - Posted: 28 Nov 2009, 2:52:00 UTC


I guess there is something wrong in Berkeley with the new network switch.

I made a 'reboot' of my DSL router (OFF/ON), for to be sure.
Before I made a reboot of the PC, no joy.

After reboot of DSL router at my home and second reboot of the PC, BOINC DLed ~ 50 WUs. Then again wrong DL size errors.
Again a reboot of the PC, again some WUs DLed. Again, wrong DL size errors.
Again reboot, no joy.. only wrong DL size errors.

Is there now with the new network switch in Berkeley a 'jumping' DL IP ?

I use BOINC V6.6.37 and never saw this in past.

ID: 950342 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65748
Credit: 55,293,173
RAC: 49
United States
Message 950348 - Posted: 28 Nov 2009, 3:28:54 UTC - in response to Message 950342.  


I guess there is something wrong in Berkeley with the new network switch.

I made a 'reboot' of my DSL router (OFF/ON), for to be sure.
Before I made a reboot of the PC, no joy.

After reboot of DSL router at my home and second reboot of the PC, BOINC DLed ~ 50 WUs. Then again wrong DL size errors.
Again a reboot of the PC, again some WUs DLed. Again, wrong DL size errors.
Again reboot, no joy.. only wrong DL size errors.

Is there now with the new network switch in Berkeley a 'jumping' DL IP ?

I use BOINC V6.6.37 and never saw this in past.

And that's precisely why I didn't reset both of My routers. :D
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 950348 · Report as offensive
Profile jrusling
Avatar

Send message
Joined: 8 Sep 02
Posts: 37
Credit: 4,764,889
RAC: 0
United States
Message 950350 - Posted: 28 Nov 2009, 3:39:49 UTC - in response to Message 950342.  
Last modified: 28 Nov 2009, 3:42:54 UTC

Just on the off chance, I rebooted my machine and the all of the pending downloads completed. I had already tried just stopping and restarting BOINC and SETI.

I just noticed others posting that it appears that things have cleared up. I guess that it was just coincidence that it cleared while I was rebooting.
http://boincstats.com/signature/-1/user/18390/sig.png
ID: 950350 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 950357 - Posted: 28 Nov 2009, 4:04:53 UTC


It wasn't really a 'reset' of the DSL router.
It was a 'reboot'.. I switched only OFF - waited one, two minutes and again - ON.

Because I didn't had joy..
I switched OFF again the DSL router, waited now ~ 10 minutes. Again ON.
Then reboot of PC.
BOINC DLed WUs, every 2nd DL wrong size error.
After ~ 30 WUs DLed, only DL wrong size errors.

It's not at my side.

I don't have time for to reboot after ~ 30 DLs.



ID: 950357 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13736
Credit: 208,696,464
RAC: 304
Australia
Message 950381 - Posted: 28 Nov 2009, 5:23:56 UTC - in response to Message 950357.  


There's a glitch of some type in the system- the number of results being returned is about half of what it would normally be & the network traffic dropped off way to quickly given the length of the outage.
And i've got another Work Unit ready to download, but it keeps timing out even after restarting BOINC.
Grant
Darwin NT
ID: 950381 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 950393 - Posted: 28 Nov 2009, 5:45:22 UTC


I updated both PCs with BOINC V6.10.18, but no joy.

Oh well..

ID: 950393 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 950415 - Posted: 28 Nov 2009, 7:14:01 UTC - in response to Message 950381.  

There's a glitch of some type in the system- the number of results being returned is about half of what it would normally be & the network traffic dropped off way to quickly given the length of the outage.
And i've got another Work Unit ready to download, but it keeps timing out even after restarting BOINC.

The trick is:

Exit BOINC.
do ipconfig /flushdns
Wait a minute.
Restart BOINC.
Instead of Retry download, do Advanced->Do network communications.

Others rebooting their machine(s) is about the same thing as that.
ID: 950415 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13736
Credit: 208,696,464
RAC: 304
Australia
Message 950420 - Posted: 28 Nov 2009, 7:47:06 UTC - in response to Message 950415.  

There's a glitch of some type in the system- the number of results being returned is about half of what it would normally be & the network traffic dropped off way to quickly given the length of the outage.
And i've got another Work Unit ready to download, but it keeps timing out even after restarting BOINC.

The trick is:

.....

That did it.
Hopefully they'll have it sorted before i need to do it.
:-)
Grant
Darwin NT
ID: 950420 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65748
Credit: 55,293,173
RAC: 49
United States
Message 950425 - Posted: 28 Nov 2009, 8:18:19 UTC - in response to Message 950420.  

There's a glitch of some type in the system- the number of results being returned is about half of what it would normally be & the network traffic dropped off way to quickly given the length of the outage.
And i've got another Work Unit ready to download, but it keeps timing out even after restarting BOINC.

The trick is:

.....

That did it.
Hopefully they'll have it sorted before i need to do it.
:-)

Ditto and Good Night all. (yawns)
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 950425 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 950440 - Posted: 28 Nov 2009, 9:35:13 UTC


Ohh.. well..

What help it if I need to do it after every ~ 30 DLs?
My GPU cruncher make ~ 600 'normal' AR WUs/day.
He idle since days..

So 20x reboot or flush/day?
The hardware (if reboot) will be happy about.

Maybe it's time for to switch OFF some days.. maybe to Wednesday/Thursday next week..

Ohh.. nice weekend.

ID: 950440 · Report as offensive
Profile [DPC]NGS~R.Stanneveld
Volunteer tester
Avatar

Send message
Joined: 20 Nov 05
Posts: 9
Credit: 1,225,991
RAC: 0
Netherlands
Message 950444 - Posted: 28 Nov 2009, 10:27:07 UTC - in response to Message 950415.  
Last modified: 28 Nov 2009, 10:28:14 UTC

Exit BOINC.
do ipconfig /flushdns
Wait a minute.
Restart BOINC.
Instead of Retry download, do Advanced->Do network communications.

Well that did the trick just there :P
Drinking koffie now and everything seems fine.
ID: 950444 · Report as offensive
Profile Link
Avatar

Send message
Joined: 18 Sep 03
Posts: 834
Credit: 1,807,369
RAC: 0
Germany
Message 950448 - Posted: 28 Nov 2009, 10:39:02 UTC - in response to Message 950444.  

Exit BOINC.
do ipconfig /flushdns
Wait a minute.
Restart BOINC.
Instead of Retry download, do Advanced->Do network communications.

Well that did the trick just there :P
Drinking koffie now and everything seems fine.

Well, that did not the trick here... on both machines :(
ID: 950448 · Report as offensive
Profile jason_gee
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 24 Nov 06
Posts: 7489
Credit: 91,093,184
RAC: 0
Australia
Message 950450 - Posted: 28 Nov 2009, 10:42:33 UTC - in response to Message 950448.  
Last modified: 28 Nov 2009, 10:45:27 UTC

Exit BOINC.
do ipconfig /flushdns
Wait a minute.
Restart BOINC.
Instead of Retry download, do Advanced->Do network communications.

Well that did the trick just there :P
Drinking koffie now and everything seems fine.

Well, that did not the trick here... on both machines :(


In addition to the above trick, on my main machine I had to make a hosts file entry ( located in \windows\system32\drivers\etc ). My secondary machine already had such an entry (leftover & forgotten from last time...), so didn't have the problems downloading. The entry I added was:

208.68.240.18 boinc2.ssl.berkeley.edu
"Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions.
ID: 950450 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 950456 - Posted: 28 Nov 2009, 11:06:06 UTC - in response to Message 950450.  

Does that mean Vader is on the fritz again? Has anybody checked exactly which comms are failing? (I'm away from my main machines at the moment, and this little P4 has got plenty of work).

See 911175 and thereabouts.
ID: 950456 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 13 · Next

Message boards : Number crunching : Panic Mode On (26) Server problems


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.