Panic Mode On (74) Server problems?

Message boards : Number crunching : Panic Mode On (74) Server problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 . . . 11 · Next

AuthorMessage
Profile James Sotherden
Avatar

Send message
Joined: 16 May 99
Posts: 10436
Credit: 110,373,059
RAC: 54
United States
Message 1227613 - Posted: 5 May 2012, 4:58:33 UTC

Im having problems with downloads not wanting to download.
[/quote]

Old James
ID: 1227613 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65689
Credit: 55,293,173
RAC: 49
United States
Message 1227622 - Posted: 5 May 2012, 5:20:40 UTC - in response to Message 1227613.  

Im having problems with downloads not wanting to download.

Me too and currently I have 3 that are either in slow mode or are simply stuck.
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 1227622 · Report as offensive
Profile Area 51
Avatar

Send message
Joined: 31 Jan 04
Posts: 965
Credit: 42,193,520
RAC: 0
United Kingdom
Message 1228425 - Posted: 6 May 2012, 20:26:44 UTC

No issues for me - until now. The cricket's got an unhealthy gap forming.....
ID: 1228425 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13715
Credit: 208,696,464
RAC: 304
Australia
Message 1228445 - Posted: 6 May 2012, 21:15:36 UTC - in response to Message 1228425.  
Last modified: 6 May 2012, 21:57:16 UTC

Network traffic has dropped off, Ready to Send still has plenty to go, the last dozen or so requests for work have got "Project has no tasks available" messages.
There was a spike in database activity for a while there- almost 3,000 queries per second, usually it's aronound 600. Hopefully once it settles down again we'll be able to get more work.
Grant
Darwin NT
ID: 1228445 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1228453 - Posted: 6 May 2012, 21:34:41 UTC - in response to Message 1228445.  


Network traffic has dropped off, Ready to Send still has plenty to go, the last dozen or so requests for work have got "Project has no tasks available" messages.
There was a spike in databse activity for a while there- almost 3,000 queries per second, usually it's aronound 600. Hopefully once it settles down again we'll be able to get more work.

Probably generating the daily stats dump.
It's at 636 now.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1228453 · Report as offensive
Profile Area 51
Avatar

Send message
Joined: 31 Jan 04
Posts: 965
Credit: 42,193,520
RAC: 0
United Kingdom
Message 1228458 - Posted: 6 May 2012, 21:47:00 UTC

....false alarm - back up to full tilt.
ID: 1228458 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13715
Credit: 208,696,464
RAC: 304
Australia
Message 1229469 - Posted: 9 May 2012, 9:25:24 UTC - in response to Message 1220675.  


Hmm, there's a blockage somewhere.
Plenty of work available, but network traffic isn't maxed out. After an outage, and with the limits raised further (or even removed) it should remain maxed out probably until the next outage.
Grant
Darwin NT
ID: 1229469 · Report as offensive
LadyL
Volunteer tester
Avatar

Send message
Joined: 14 Sep 11
Posts: 1679
Credit: 5,230,097
RAC: 0
Message 1229475 - Posted: 9 May 2012, 9:51:14 UTC - in response to Message 1229469.  


Hmm, there's a blockage somewhere.
Plenty of work available, but network traffic isn't maxed out. After an outage, and with the limits raised further (or even removed) it should remain maxed out probably until the next outage.

My last bunch came down without problems and so fast I went to see why...

Server Status Page shows no AP are being split. AP account for quite a bit of the bandwidth as we've seen in the past when there were none. Good news for filling the caches all round.
I'm not the Pope. I don't speak Ex Cathedra!
ID: 1229475 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14644
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1229476 - Posted: 9 May 2012, 9:57:37 UTC - in response to Message 1229475.  


Hmm, there's a blockage somewhere.
Plenty of work available, but network traffic isn't maxed out. After an outage, and with the limits raised further (or even removed) it should remain maxed out probably until the next outage.

My last bunch came down without problems and so fast I went to see why...

Server Status Page shows no AP are being split. AP account for quite a bit of the bandwidth as we've seen in the past when there were none. Good news for filling the caches all round.

09/05/2012 10:46:59 | SETI@home | Scheduler request completed: got 36 new tasks

And not a single shorty among them.
ID: 1229476 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13715
Credit: 208,696,464
RAC: 304
Australia
Message 1229478 - Posted: 9 May 2012, 10:04:58 UTC - in response to Message 1229475.  

Server Status Page shows no AP are being split. AP account for quite a bit of the bandwidth as we've seen in the past when there were none. Good news for filling the caches all round.

True, but considering the network traffic before the outage & the raising of the limits, i wouldn't expect the network traffic to drop off for at least 3 days.
And when network traffic isn't at maximum resuests for work rarely result in "Project has no tasks available" messages, but 80% of my scheduler requests are resulting in that message indcates something isn't quite right.
Grant
Darwin NT
ID: 1229478 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1229549 - Posted: 9 May 2012, 15:14:34 UTC

My guess is that we are again seeing some kind of scheduler/feeder limitation.
I agree that even with no AP using bandwidth, MB alone has shown the capability of fully saturating the bandwidth.

On the other hand, NOT saturating the bandwidth may actually be making better use of it......
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1229549 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1229581 - Posted: 9 May 2012, 16:33:38 UTC - in response to Message 1229549.  

My guess is that we are again seeing some kind of scheduler/feeder limitation.
I agree that even with no AP using bandwidth, MB alone has shown the capability of fully saturating the bandwidth.

On the other hand, NOT saturating the bandwidth may actually be making better use of it......

My machines are no longer uploading/requesting tasks 1 or 2 at a time. As they seem to have filled to their cache settings. So we may be looking at a normal bandwidth graph again. Which is how it would often look in the days before limits sans AP or shorties.
Not to say all requests are being fulfilled. Just that there are not so many transfers in progress to keep the bandwidth pegged 24/7.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1229581 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1230054 - Posted: 10 May 2012, 16:36:54 UTC
Last modified: 10 May 2012, 16:37:54 UTC

With the increased limits and the scheduler/feeder not having tasks available all the time....

The dang Boinc scheduler bug is kicking up again.

My #1 rig, not banging up against the limits anymore, is getting plenty of work for the GPU, but the scheduler is once again letting the CPUs go idle, not sending them a drop of work because the GPU cache is not full yet.
So the CPUs are twiddling their thumbs.

Dang it, DA....please quit starving the slower resources completely just because the fastest ones do not have their caches full!!!
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1230054 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1230066 - Posted: 10 May 2012, 17:00:42 UTC - in response to Message 1230054.  

With the increased limits and the scheduler/feeder not having tasks available all the time....

The dang Boinc scheduler bug is kicking up again.

My #1 rig, not banging up against the limits anymore, is getting plenty of work for the GPU, but the scheduler is once again letting the CPUs go idle, not sending them a drop of work because the GPU cache is not full yet.
So the CPUs are twiddling their thumbs.

Dang it, DA....please quit starving the slower resources completely just because the fastest ones do not have their caches full!!!

I thought there was talk about that being corrected in the v7 client, but then there is the odd high/low work fetch system it uses.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1230066 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1230068 - Posted: 10 May 2012, 17:07:14 UTC - in response to Message 1230066.  

With the increased limits and the scheduler/feeder not having tasks available all the time....

The dang Boinc scheduler bug is kicking up again.

My #1 rig, not banging up against the limits anymore, is getting plenty of work for the GPU, but the scheduler is once again letting the CPUs go idle, not sending them a drop of work because the GPU cache is not full yet.
So the CPUs are twiddling their thumbs.

Dang it, DA....please quit starving the slower resources completely just because the fastest ones do not have their caches full!!!

I thought there was talk about that being corrected in the v7 client, but then there is the odd high/low work fetch system it uses.


I don't believe this has ANYTHING to do with the Boinc client.
The host continually asks for GPU 'AND' CPU tasks. But is repeatedly ONLY sent GPU work.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1230068 · Report as offensive
Profile shizaru
Volunteer tester
Avatar

Send message
Joined: 14 Jun 04
Posts: 1130
Credit: 1,967,904
RAC: 0
Greece
Message 1230070 - Posted: 10 May 2012, 17:11:00 UTC - in response to Message 1230068.  

I don't believe ANYTHING that has to do with the Boinc client.


There, I fixed it:)
ID: 1230070 · Report as offensive
Profile red-ray
Avatar

Send message
Joined: 24 Jun 99
Posts: 308
Credit: 9,029,848
RAC: 0
United Kingdom
Message 1230072 - Posted: 10 May 2012, 17:13:57 UTC - in response to Message 1230068.  
Last modified: 10 May 2012, 17:52:33 UTC

I thought there was talk about that being corrected in the v7 client, but then there is the odd high/low work fetch system it uses.

No, I have 7.0.25 on my QX6700 and it's got the same problem, so having V7 does not help with this server issue.

I would like to see a bigger fifo so fewer requests are needed to replenish the cache.
ID: 1230072 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1230073 - Posted: 10 May 2012, 17:15:03 UTC - in response to Message 1230072.  

I thought there was talk about that being corrected in the v7 client, but then there is the odd high/low work fetch system it uses.

No, I have 7.0.25 on my QX6700 and it's got the same problem.

It's not the client....
It's the what the scheduler logic does with the client request.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1230073 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14644
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1230078 - Posted: 10 May 2012, 17:33:06 UTC - in response to Message 1230073.  

I thought there was talk about that being corrected in the v7 client, but then there is the odd high/low work fetch system it uses.

No, I have 7.0.25 on my QX6700 and it's got the same problem.

It's not the client....
It's the what the scheduler logic does with the client request.

And by scheduler, Mark means the scheduler that runs on the server - that is indeed where this particular problem lies.
ID: 1230078 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1230081 - Posted: 10 May 2012, 17:34:51 UTC - in response to Message 1230078.  
Last modified: 10 May 2012, 17:51:47 UTC

I thought there was talk about that being corrected in the v7 client, but then there is the odd high/low work fetch system it uses.

No, I have 7.0.25 on my QX6700 and it's got the same problem.

It's not the client....
It's the what the scheduler logic does with the client request.

And by scheduler, Mark means the scheduler that runs on the server - that is indeed where this particular problem lies.

Thank you, Richard.

Of my top 3 rigs, 2 are now running GPU only due to this bug.
The only reason the 3rd is not is that the CPU is running on cached AP work with the manually installed AP app. Otherwise, it would be in the same boat.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1230081 · Report as offensive
Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 . . . 11 · Next

Message boards : Number crunching : Panic Mode On (74) Server problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.