Panic Mode On (71) Server problems?

Message boards : Number crunching : Panic Mode On (71) Server problems?
Message board moderation

To post messages, you must log in.

1 · 2 · 3 · 4 . . . 9 · Next

AuthorMessage
Profile arkayn
Volunteer tester
Avatar

Send message
Joined: 14 May 99
Posts: 4438
Credit: 55,006,323
RAC: 0
United States
Message 1203912 - Posted: 8 Mar 2012, 23:30:27 UTC

Now if only my video cards would not keep dying on me.

ID: 1203912 · Report as offensive
Profile ivan
Volunteer tester
Avatar

Send message
Joined: 5 Mar 01
Posts: 783
Credit: 348,560,338
RAC: 223
United Kingdom
Message 1203916 - Posted: 8 Mar 2012, 23:52:45 UTC - in response to Message 1203912.  

Now if only my video cards would not keep dying on me.

Yep, I know that feelin'! [copyright an old ad for French parfum...]
Actually, people have been complaining that they can't get enough CPU WUs -- I seem to have the opposite problem: my faster machines keep asking for new CPU jobs, not GPU ones, and of course the scheduler says, "Nope, you've reached a limit!" E.g. the Supermicro has 2x4-core Xeons, not running hyperthreading, a C1060 and a GTX 460 so it should be getting an (8x50)+(2x400) limit, but it's struggling to get past 666 jobs in the queue because it's always asking for, and being denied, CPU jobs.
Answers on a postcard, please...
ID: 1203916 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1203918 - Posted: 9 Mar 2012, 0:03:13 UTC - in response to Message 1203916.  
Last modified: 9 Mar 2012, 0:03:32 UTC

Answers on a postcard, please...

Just deselect 'Use CPU' in your project preferences until you're got lots of GPU tasks,

Claggy
ID: 1203918 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65785
Credit: 55,293,173
RAC: 49
United States
Message 1203959 - Posted: 9 Mar 2012, 1:27:28 UTC - in response to Message 1203916.  

Now if only my video cards would not keep dying on me.

Yep, I know that feelin'! [copyright an old ad for French parfum...]
Actually, people have been complaining that they can't get enough CPU WUs -- I seem to have the opposite problem: my faster machines keep asking for new CPU jobs, not GPU ones, and of course the scheduler says, "Nope, you've reached a limit!" E.g. the Supermicro has 2x4-core Xeons, not running hyperthreading, a C1060 and a GTX 460 so it should be getting an (8x50)+(2x400) limit, but it's struggling to get past 666 jobs in the queue because it's always asking for, and being denied, CPU jobs.
Answers on a postcard, please...

Maybe that's Yer problem Ivan... ;)
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 1203959 · Report as offensive
Profile cliff
Avatar

Send message
Joined: 16 Dec 07
Posts: 625
Credit: 3,590,440
RAC: 0
United Kingdom
Message 1204017 - Posted: 9 Mar 2012, 4:44:07 UTC

The answer to whats going on lies in technical news, latest post from mat about 3 hrs ago:-)

Regards,
Cliff,
Been there, Done that, Still no damm T shirt!
ID: 1204017 · Report as offensive
Lionel

Send message
Joined: 25 Mar 00
Posts: 680
Credit: 563,640,304
RAC: 597
Australia
Message 1204024 - Posted: 9 Mar 2012, 5:10:45 UTC - in response to Message 1203912.  

Now if only my video cards would not keep dying on me.


no ... you want them to die ... it's the excuse you need to get new ones ...

ID: 1204024 · Report as offensive
Profile Belthazor
Volunteer tester
Avatar

Send message
Joined: 6 Apr 00
Posts: 219
Credit: 10,373,795
RAC: 13
Russia
Message 1204030 - Posted: 9 Mar 2012, 5:52:09 UTC

Bane is die! R.I.P.
You was a nice scheduler anyway. I'll miss you!
ID: 1204030 · Report as offensive
Lionel

Send message
Joined: 25 Mar 00
Posts: 680
Credit: 563,640,304
RAC: 597
Australia
Message 1204053 - Posted: 9 Mar 2012, 6:21:13 UTC - in response to Message 1204035.  


the only bane in my life is the fracking limits ... i would like to see these die as well ...
ID: 1204053 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13753
Credit: 208,696,464
RAC: 304
Australia
Message 1204056 - Posted: 9 Mar 2012, 6:25:52 UTC - in response to Message 1204053.  

the only bane in my life is the fracking limits ... i would like to see these die as well ...

Not going to happen till the DCF problem is sorted.
Grant
Darwin NT
ID: 1204056 · Report as offensive
Profile ivan
Volunteer tester
Avatar

Send message
Joined: 5 Mar 01
Posts: 783
Credit: 348,560,338
RAC: 223
United Kingdom
Message 1204093 - Posted: 9 Mar 2012, 10:06:00 UTC - in response to Message 1203959.  

Now if only my video cards would not keep dying on me.

Yep, I know that feelin'! [copyright an old ad for French parfum...]
Actually, people have been complaining that they can't get enough CPU WUs -- I seem to have the opposite problem: my faster machines keep asking for new CPU jobs, not GPU ones, and of course the scheduler says, "Nope, you've reached a limit!" E.g. the Supermicro has 2x4-core Xeons, not running hyperthreading, a C1060 and a GTX 460 so it should be getting an (8x50)+(2x400) limit, but it's struggling to get past 666 jobs in the queue because it's always asking for, and being denied, CPU jobs.
Answers on a postcard, please...

Maybe that's Yer problem Ivan... ;)

Devilishly difficult, innit?
ID: 1204093 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1204097 - Posted: 9 Mar 2012, 10:55:17 UTC

Well I'm empty and idle now. 10 days worth of APs have come and gone. Onward to waiting for the AP splitters to be updated and turned back on.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1204097 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51469
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1204098 - Posted: 9 Mar 2012, 10:59:06 UTC - in response to Message 1204097.  

Well I'm empty and idle now. 10 days worth of APs have come and gone. Onward to waiting for the AP splitters to be updated and turned back on.

Get some MB to get your Johhny on, and then wait.

"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1204098 · Report as offensive
musicplayer

Send message
Joined: 17 May 10
Posts: 2430
Credit: 926,046
RAC: 0
Message 1204212 - Posted: 9 Mar 2012, 17:40:06 UTC

I got a two seconds glimpse of something in one of my tasks.

Apparently BoincLogX said 13.50 for the gaussian score.

But checking SMV, I only get -2.613185 for the same score.

Task in question was 10ja12aa.24707.17823.8.10_132_0_0 .

Running by means of CUDA only.

But I may have been wrong.
ID: 1204212 · Report as offensive
Profile cliff
Avatar

Send message
Joined: 16 Dec 07
Posts: 625
Credit: 3,590,440
RAC: 0
United Kingdom
Message 1204230 - Posted: 9 Mar 2012, 18:23:17 UTC
Last modified: 9 Mar 2012, 18:27:09 UTC

Well its begining to take the biscuit, I have zero GPU tasks, load of CPU tasks and the server status page says tons of work available. However nearly every request for work gets the response no work available.

Does that mean that of all the tasks shown on the status page not one is a GPU task, that they are all CPU ones?

Over the past couple of hours on manual updates I've had 4 GPU tasks, if it were'nt for e@h my GPU's would be totally idle.

Would someone kindly give that sched server a tiny kick in the rear end and free the apparent logjam?
[edit]
Obliged to the prompt person at Berkeley, I just got 2 GPU tasks
Cliff,
Been there, Done that, Still no damm T shirt!
ID: 1204230 · Report as offensive
Profile cliff
Avatar

Send message
Joined: 16 Dec 07
Posts: 625
Credit: 3,590,440
RAC: 0
United Kingdom
Message 1204238 - Posted: 9 Mar 2012, 18:31:41 UTC - in response to Message 1204236.  

Ahh well at least I got 2WU last min or so,
but I'd have thought you would just love to have a lady jump all over you:-)

Cheers,
Cliff,
Been there, Done that, Still no damm T shirt!
ID: 1204238 · Report as offensive
LadyL
Volunteer tester
Avatar

Send message
Joined: 14 Sep 11
Posts: 1679
Credit: 5,230,097
RAC: 0
Message 1204257 - Posted: 9 Mar 2012, 19:09:40 UTC
Last modified: 9 Mar 2012, 19:13:03 UTC

I thought I'd put that into the FAQ. It's certainly been discussed often enough. ok, for the umpietst time...

a) There are no 'CPU' or 'GPU' tasks. they are all the same. some of them will be marked .vlar and those will not go out to CUDA GPU on account of bringing down most machines.

b) The feeder holds 100 tasks.it refills in regular intervals. the scheduler looks in the feeder if the type of task you are requesting (AP/MB) is available. if it isn't or if there is only vlar on a cuda GPU request or if the feeder is already empty you get 'no tasks available' if there were only vlar on GPU request you get 'no tasks sent'.

If you have trouble filling the cache for one resource but have plenty on the other, disable workfetch on the website for the resource that has plenty so onlyone of them will ask. Don't forget to enable again when you have enough.

P.S. You two can be thankful he found it funny. I didn't.
I'm not the Pope. I don't speak Ex Cathedra!
ID: 1204257 · Report as offensive
Profile RottenMutt
Avatar

Send message
Joined: 15 Mar 01
Posts: 1011
Credit: 230,314,058
RAC: 0
United States
Message 1204310 - Posted: 9 Mar 2012, 21:27:36 UTC - in response to Message 1204258.  

Something still seems wrong. RAC has been flat or falling on for the last four days, even on machines which didn't run out of work during the last outage
ID: 1204310 · Report as offensive
Profile cliff
Avatar

Send message
Joined: 16 Dec 07
Posts: 625
Credit: 3,590,440
RAC: 0
United Kingdom
Message 1204317 - Posted: 9 Mar 2012, 22:02:41 UTC - in response to Message 1204257.  

Thanks for the reminder, sorry if I upset you with my jest, it was not so intended.

Regards,
Cliff,
Been there, Done that, Still no damm T shirt!
ID: 1204317 · Report as offensive
AndrewM
Volunteer tester

Send message
Joined: 5 Jan 08
Posts: 369
Credit: 34,275,196
RAC: 0
Australia
Message 1204319 - Posted: 9 Mar 2012, 22:09:33 UTC

wuid=945484895

Another symptom perhaps. This task appears to have received double credit. Is that possible?
AndrewM
ID: 1204319 · Report as offensive
Josef W. Segur
Volunteer developer
Volunteer tester

Send message
Joined: 30 Oct 99
Posts: 4504
Credit: 1,414,761
RAC: 0
United States
Message 1204341 - Posted: 9 Mar 2012, 23:24:13 UTC - in response to Message 1204319.  

wuid=945484895

Another symptom perhaps. This task appears to have received double credit. Is that possible?

WU true angle range is : 0.256601

That's a rare WU up near the peak for amount of processing which is needed. They take much longer to crunch and earn proportionally more credit, that one had a ~59 day deadline for instance.
                                                                   Joe
ID: 1204341 · Report as offensive
1 · 2 · 3 · 4 . . . 9 · Next

Message boards : Number crunching : Panic Mode On (71) Server problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.