Panic Mode On (62) Server problems?

Message boards : Number crunching : Panic Mode On (62) Server problems?
Message board moderation

To post messages, you must log in.

1 · 2 · 3 · 4 . . . 11 · Next

AuthorMessage
Profile arkayn
Volunteer tester
Avatar

Send message
Joined: 14 May 99
Posts: 4277
Credit: 53,974,336
RAC: 5,579
United States
Message 1175192 - Posted: 2 Dec 2011, 16:44:01 UTC

Time to restart this thread.

ID: 1175192 · Report as offensive
Miklos M.

Send message
Joined: 5 May 99
Posts: 875
Credit: 20,222,998
RAC: 944
United States
Message 1175216 - Posted: 2 Dec 2011, 20:25:22 UTC

I wonder why when the settings are identical, one of my computers is getting a few ap units and the other one is getting none?
ID: 1175216 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,278,688
RAC: 190
United Kingdom
Message 1175223 - Posted: 2 Dec 2011, 20:51:30 UTC - in response to Message 1175216.  

I wonder why when the settings are identical, one of my computers is getting a few ap units and the other one is getting none?

One is running Boinc 6.10.x and the other is running Boinc 6.12.x, to build up any sort of Astropulse cache Boinc has to ask for work frequently, Boinc 6.12.x just doesn't ask frequently enough.

Claggy
ID: 1175223 · Report as offensive
mikeej42

Send message
Joined: 26 Oct 00
Posts: 109
Credit: 791,863,759
RAC: 0
United States
Message 1175233 - Posted: 2 Dec 2011, 21:56:48 UTC - in response to Message 1175223.  


One is running Boinc 6.10.x and the other is running Boinc 6.12.x, to build up any sort of Astropulse cache Boinc has to ask for work frequently, Boinc 6.12.x just doesn't ask frequently enough.

Claggy[/quote]
If I have a bunch of systems that are headless servers (no displays or GPUs) is there any reason to not to downgrade to 6.10.x?
ID: 1175233 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,278,688
RAC: 190
United Kingdom
Message 1175234 - Posted: 2 Dec 2011, 22:03:12 UTC - in response to Message 1175233.  


One is running Boinc 6.10.x and the other is running Boinc 6.12.x, to build up any sort of Astropulse cache Boinc has to ask for work frequently, Boinc 6.12.x just doesn't ask frequently enough.

Claggy

If I have a bunch of systems that are headless servers (no displays or GPUs) is there any reason to not to downgrade to 6.10.x?

No, i don't think so,

Claggy
ID: 1175234 · Report as offensive
mikeej42

Send message
Joined: 26 Oct 00
Posts: 109
Credit: 791,863,759
RAC: 0
United States
Message 1175237 - Posted: 2 Dec 2011, 22:08:23 UTC - in response to Message 1175234.  

Okay thanks. I downgraded one one host and it actually was able to get 5 ap task downloads completed that I have been clicking retry for 3 days.
ID: 1175237 · Report as offensive
tbret
Volunteer tester
Avatar

Send message
Joined: 28 May 99
Posts: 3378
Credit: 276,625,387
RAC: 112,052
United States
Message 1175308 - Posted: 3 Dec 2011, 6:07:43 UTC - in response to Message 1175237.  

Okay thanks. I downgraded one one host and it actually was able to get 5 ap task downloads completed that I have been clicking retry for 3 days.



Yeah, something "spooky" is happening at whatever distance you are from Berkeley.

But, maybe it's just the act of restoring everything to defaults and clearing calculated values that got all funkified in an asynchronistic dance of data death. Without the twirling tassles... or so I've heard.
ID: 1175308 · Report as offensive
Kevin Olley

Send message
Joined: 3 Aug 99
Posts: 762
Credit: 83,343,497
RAC: 167,370
United Kingdom
Message 1175318 - Posted: 3 Dec 2011, 7:21:27 UTC

For the first time in a while I have got enough GPU WU's to last me over 24 Hrs.

Almost up to full cache and less than half are shorties.



Kevin


ID: 1175318 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6005
Credit: 83,677,689
RAC: 25,533
Russia
Message 1175419 - Posted: 3 Dec 2011, 17:31:49 UTC

Looks like it's now not possible to get enough work w/o manual intervention.
I didn't hit "update" button last few days - how few times was out of work with many tasks wating download and big project backoffs...

NB, after hitting update few times I get quite good download speeds. So, there is some bandwidth available, it's just a server dropping connections!
Bad network config again, apparently...
ID: 1175419 · Report as offensive
kittyman Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 50057
Credit: 931,297,612
RAC: 226,932
United States
Message 1175421 - Posted: 3 Dec 2011, 17:35:59 UTC - in response to Message 1175419.  

Everything's tied up pretty tight now...
Incredible shorty storm. Over 98,000 results/hour coming back to the servers as of the last status update.

Meeouch.
Happy is the person who shares their life with a cat. (Or two or three or........) =^.^=

Have made friends here.
Most were cats.
ID: 1175421 · Report as offensive
Kevin Olley

Send message
Joined: 3 Aug 99
Posts: 762
Credit: 83,343,497
RAC: 167,370
United Kingdom
Message 1175431 - Posted: 3 Dec 2011, 18:24:33 UTC - in response to Message 1175421.  

Everything's tied up pretty tight now...
Incredible shorty storm. Over 98,000 results/hour coming back to the servers as of the last status update.

Meeouch.


Looks like I shouted to soon, now over 75% shorties on GPU's.


Kevin


ID: 1175431 · Report as offensive
Miklos M.

Send message
Joined: 5 May 99
Posts: 875
Credit: 20,222,998
RAC: 944
United States
Message 1175443 - Posted: 3 Dec 2011, 19:22:58 UTC

I got two ap units last night. So, my 6 cores are crunching Einstein, until they get fed ap's.
ID: 1175443 · Report as offensive
kittyman Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 50057
Credit: 931,297,612
RAC: 226,932
United States
Message 1175444 - Posted: 3 Dec 2011, 19:23:56 UTC

Possible panic looming?

I have noticed over the last hour or so multiple rigs going through scheduler request failures and backoffs when trying to report work.

Might be bad juju.
Happy is the person who shares their life with a cat. (Or two or three or........) =^.^=

Have made friends here.
Most were cats.
ID: 1175444 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,278,688
RAC: 190
United Kingdom
Message 1175445 - Posted: 3 Dec 2011, 19:24:30 UTC - in response to Message 1175444.  
Last modified: 3 Dec 2011, 19:24:58 UTC

Possible panic looming?

I have noticed over the last hour or so multiple rigs going through scheduler request failures and backoffs when trying to report work.

Might be bad juju.

Same here,

Claggy
ID: 1175445 · Report as offensive
Profile Khangollo
Avatar

Send message
Joined: 1 Aug 00
Posts: 245
Credit: 36,410,524
RAC: 0
Slovenia
Message 1175446 - Posted: 3 Dec 2011, 19:27:30 UTC
Last modified: 3 Dec 2011, 19:28:21 UTC

Servers, once again, DDoS-ed themselves to death, didn't they?
All I'm getting are HTTP timeouts on any kind of request and when scheduler succeeds, it's no work available.
Quack quack.
ID: 1175446 · Report as offensive
Tutankhamon
Volunteer tester
Avatar

Send message
Joined: 1 Nov 08
Posts: 7253
Credit: 45,236,425
RAC: 5,172
Sweden
Message 1175448 - Posted: 3 Dec 2011, 19:32:37 UTC - in response to Message 1175444.  

Possible panic looming?

I have noticed over the last hour or so multiple rigs going through scheduler request failures and backoffs when trying to report work.

Might be bad juju.


Yup, same here. I'm going down to the basement, starting up my alcohol distiller. Time to become unconscious, before the servers go belly up.

LOL
Too much hormone treated meat.
Too much Monsanto veggies.
Too old and outdated constitution.
A crazy problem, as you Yanks use to say......

There is no God, and God never existed.
ID: 1175448 · Report as offensive
Starman
Avatar

Send message
Joined: 15 May 99
Posts: 202
Credit: 72,913,063
RAC: 9,722
Canada
Message 1175452 - Posted: 3 Dec 2011, 19:40:50 UTC

Having trouble getting a good mix of CPU and GPU WU. My one rig hasn't seen a GPU WU for well over a week. While my main cruncher has lots of GPU work, but is struggling to get enough CPU W/C. It's get's a handfull to last a few hours and then runs dry again.
Gigabyte Z170X-UD5
i7-6700K
32 MB Corsair Vengeance LPX 2400mhz
Samsung 850 Pro SSD 512GB
WD Caviar Black 4.0TB
WD Caviar Black 2.0TB
Corsair HX850i
Corsair H80iGT
MSI R9-380 Gaming 4G
Visiontek HD7870 2G
Corsair Obsidian 450D
ID: 1175452 · Report as offensive
Profile SciManStev Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Jun 99
Posts: 6343
Credit: 111,444,788
RAC: 58,732
United States
Message 1175462 - Posted: 3 Dec 2011, 19:55:25 UTC

Yep, can't report here either, and on my way to empty again....

Steve
Warning, addicted to SETI crunching!
Crunching as a member of GPU Users Group.
GPUUG Website
ID: 1175462 · Report as offensive
Profile perryjay
Volunteer tester
Avatar

Send message
Joined: 20 Aug 02
Posts: 3377
Credit: 20,676,751
RAC: 0
United States
Message 1175468 - Posted: 3 Dec 2011, 21:03:24 UTC - in response to Message 1175462.  

No problem here. My little two-banger+one is getting a little bit greedy. I've got over three hundred in progress and it just got five more. A nice mix of CPU and GPU work to keep me busy. Just finished my last two APs though. Guess we won't get anymore of them until this shorty storm passes.


PROUD MEMBER OF Team Starfire World BOINC
ID: 1175468 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 10189
Credit: 136,368,949
RAC: 87,567
Australia
Message 1175488 - Posted: 3 Dec 2011, 22:40:02 UTC - in response to Message 1175468.  
Last modified: 3 Dec 2011, 23:00:58 UTC

Inbound network traffic has dropped off a lot, although still high. But my uploads are all accumulating.
I'm guessing all the present inbound traffic is Scheduler requests- no uploads are going through.


Edit- inbound traffic has picked up a bit, but not back to where it was. Some uploads have gone through, but the backlog continues to grow.
Grant
Darwin NT
ID: 1175488 · Report as offensive
1 · 2 · 3 · 4 . . . 11 · Next

Message boards : Number crunching : Panic Mode On (62) Server problems?


 
©2018 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.