Panic Mode On (84) Server Problems?

Message boards : Number crunching : Panic Mode On (84) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · 11 · 12 . . . 20 · Next

AuthorMessage
Profile Vipin Palazhi
Avatar

Send message
Joined: 29 Feb 08
Posts: 286
Credit: 167,386,578
RAC: 0
India
Message 1378190 - Posted: 7 Jun 2013, 18:14:30 UTC - in response to Message 1378149.  

Few of my crunchers have fallen back on Einstein. And looks as though the beer companies in Juan's neighborhood would be a happy lot.
______________

ID: 1378190 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22720
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1378210 - Posted: 7 Jun 2013, 18:39:29 UTC

I just hope they stuff some more tapes in the slot soon...
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1378210 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1378211 - Posted: 7 Jun 2013, 18:45:45 UTC - in response to Message 1378130.  

Totaly out of MB work and no one left on cache, my RAC now will fall as an asteroid!

Panic Mode ON...

I belive is better to go to buy some more beer, this will be a long weekend...

You do have AP on GPU enabled, no?
I now have 943 of them....no rest for my GPUs.


You might want to try putting some parameters in your ap_cmdline_win_x86_SSE2_OpenCL_NV.txt file, the following is a good start:

-unroll 10 -ffa_block 6144 -ffa_block_fetch 1536

Claggy
ID: 1378211 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51522
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1378212 - Posted: 7 Jun 2013, 18:46:15 UTC - in response to Message 1378210.  

I just hope they stuff some more tapes in the slot soon...

One just loaded.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1378212 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13904
Credit: 208,696,464
RAC: 304
Australia
Message 1378266 - Posted: 7 Jun 2013, 20:25:59 UTC - in response to Message 1378212.  

I just hope they stuff some more tapes in the slot soon...

One just loaded.

Unfortunately nothing is being split.
Current result creation rate is 0.2658 (ie 0), and has been for several hours. Almost all of my requsts for work are for the last 5+ hours are resulting in "project has no tasks avalable" messages, with just the odd one or two getting work- often only 1 WU.
I think something's jammed up somewhere.
Grant
Darwin NT
ID: 1378266 · Report as offensive
Oddbjornik Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 15 May 99
Posts: 220
Credit: 349,610,548
RAC: 1,728
Norway
Message 1378267 - Posted: 7 Jun 2013, 20:26:23 UTC - in response to Message 1378211.  
Last modified: 7 Jun 2013, 20:26:55 UTC


You might want to try putting some parameters in your ap_cmdline_win_x86_SSE2_OpenCL_NV.txt file, the following is a good start:

-unroll 10 -ffa_block 6144 -ffa_block_fetch 1536

Claggy


I've got a GTX 680 and a Quadro K2000M, each of which has loads and loads of memory.

I run two tasks simultaneously on both.

Is there any way I can put all that memory to use? I've set
-unroll 12 -ffa_block 8192 -ffa_block_fetch 4096
on both, as recommended in the readme.

Is there anything to gain from increasing those values further?
ID: 1378267 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1378284 - Posted: 7 Jun 2013, 21:31:46 UTC - in response to Message 1378212.  
Last modified: 7 Jun 2013, 21:35:57 UTC

I just hope they stuff some more tapes in the slot soon...

One just loaded.

Still no new AP or MB work, now all my 3 fastest hosts are running empty of GPU work. The good thing is my power drain drops more than 1.5 kw at least. So i could save some money to buy more beers.
ID: 1378284 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13904
Credit: 208,696,464
RAC: 304
Australia
Message 1378285 - Posted: 7 Jun 2013, 21:31:52 UTC - in response to Message 1378266.  

Unfortunately nothing is being split.
Current result creation rate is 0.2658 (ie 0), and has been for several hours. Almost all of my requsts for work are for the last 5+ hours are resulting in "project has no tasks avalable" messages, with just the odd one or two getting work- often only 1 WU.
I think something's jammed up somewhere.


One good thing about the extra run time is that the limited cache lasts longer.
Even with that, at the present rate i'll be out of work before nightfall if the splitters don't start splitting again.
Grant
Darwin NT
ID: 1378285 · Report as offensive
Sir Mick
Volunteer tester

Send message
Joined: 18 Mar 13
Posts: 38
Credit: 106,756,204
RAC: 0
United States
Message 1378291 - Posted: 7 Jun 2013, 22:00:27 UTC

No work since this morning, any problems?
ID: 1378291 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13904
Credit: 208,696,464
RAC: 304
Australia
Message 1378293 - Posted: 7 Jun 2013, 22:02:51 UTC - in response to Message 1378291.  

No work since this morning, any problems?

The splitters aren't splitting.
Grant
Darwin NT
ID: 1378293 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13904
Credit: 208,696,464
RAC: 304
Australia
Message 1378311 - Posted: 7 Jun 2013, 22:41:37 UTC - in response to Message 1378285.  

One good thing about the extra run time is that the limited cache lasts longer.
Even with that, at the present rate i'll be out of work before nightfall if the splitters don't start splitting again.

Looks like i'll be out of GPU work by lunch time.
Won't help the plummeting RAC at all.
Grant
Darwin NT
ID: 1378311 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 37592
Credit: 261,360,520
RAC: 489
Australia
Message 1378314 - Posted: 7 Jun 2013, 22:50:56 UTC - in response to Message 1378311.  

One good thing about the extra run time is that the limited cache lasts longer.
Even with that, at the present rate i'll be out of work before nightfall if the splitters don't start splitting again.

Looks like i'll be out of GPU work by lunch time.
Won't help the plummeting RAC at all.

My 660's have been out of SETI work for 3hrs now and my 550Ti's just switched over to their backup project. :-(

Cheers.
ID: 1378314 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13904
Credit: 208,696,464
RAC: 304
Australia
Message 1378325 - Posted: 7 Jun 2013, 23:00:41 UTC - in response to Message 1378314.  
Last modified: 7 Jun 2013, 23:00:56 UTC

If someone in the loop could let the staff know there's a problem with the splitters before they knock off for the weekend it would be appreciated.
Grant
Darwin NT
ID: 1378325 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13904
Credit: 208,696,464
RAC: 304
Australia
Message 1378357 - Posted: 8 Jun 2013, 0:09:50 UTC - in response to Message 1378325.  
Last modified: 8 Jun 2013, 0:43:19 UTC

Just about out of GPU work. Didn't even make to close to lunch.

The splitters certainly are having issues, on the Server Status page sometimes all but one shows as green, then it's mostly red. Even when they are running 10/s is the most i've seen in the last 24hours or so. Even with the reduced load of v7 20/s seems to be the minimum to meet demand & still build up (although very slowly) a ready-to-send buffer.


Ever since the MB splitters were converted to PFB their output has been seriously limited. With several of them now repeatedly failing that limited output is making things even worse.



EDIT- i should have posted earlier.
The *very* second i hit OK for this post, i got 59 WUs for one of my systems GPUs. Another couple of hundred WUs all round & i should be good till tomorrow.
Grant
Darwin NT
ID: 1378357 · Report as offensive
spitfire_mk_2
Avatar

Send message
Joined: 14 Apr 00
Posts: 563
Credit: 27,306,885
RAC: 0
United States
Message 1378358 - Posted: 8 Jun 2013, 0:14:45 UTC

I did an upgrade to BOINC 7.0.64 earlier today. Imagine my surprise when I could not get any work afterward. However, I did got 4 AP tasks a few hours later.

(there is always Einstein work if you feel neglected by seti)
ID: 1378358 · Report as offensive
tbret
Volunteer tester
Avatar

Send message
Joined: 28 May 99
Posts: 3380
Credit: 296,162,071
RAC: 40
United States
Message 1378816 - Posted: 9 Jun 2013, 6:25:01 UTC - in response to Message 1378358.  

I did an upgrade to BOINC 7.0.64 earlier today. Imagine my surprise when I could not get any work afterward. However, I did got 4 AP tasks a few hours later.

(there is always Einstein work if you feel neglected by seti)


I haven't read through this whole thread carefully, so ignore me if I say something stupid. I'd better amend that: Ignore me if I say something even more stupid than you would expect.

You did go to Preferences and enable v7 work, didn't you?
ID: 1378816 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13904
Credit: 208,696,464
RAC: 304
Australia
Message 1378820 - Posted: 9 Jun 2013, 6:34:16 UTC - in response to Message 1378358.  

I did an upgrade to BOINC 7.0.64 earlier today. Imagine my surprise when I could not get any work afterward. However, I did got 4 AP tasks a few hours later.

If you upgraded from BOINC v6 you need to swap the values around in the cache settings. They changed in v7.

Grant
Darwin NT
ID: 1378820 · Report as offensive
Profile Donald L. Johnson
Avatar

Send message
Joined: 5 Aug 02
Posts: 8240
Credit: 14,654,533
RAC: 20
United States
Message 1378958 - Posted: 9 Jun 2013, 14:54:54 UTC - in response to Message 1378820.  
Last modified: 9 Jun 2013, 14:58:43 UTC

I did an upgrade to BOINC 7.0.64 earlier today. Imagine my surprise when I could not get any work afterward. However, I did got 4 AP tasks a few hours later.

If you upgraded from BOINC v6 you need to swap the values around in the cache settings. They changed in v7.

No, the cache settings changed between BOINC v5 and v6.
There was a change early in BOINC v7 that affected flops/APR, but not cache settings.
Yeah, here is is, and it mostly applies to Anonymous Platform apps.
Donald
Infernal Optimist / Submariner, retired
ID: 1378958 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 1378984 - Posted: 9 Jun 2013, 16:04:52 UTC - in response to Message 1378958.  
Last modified: 9 Jun 2013, 16:05:44 UTC

If you upgraded from BOINC v6 you need to swap the values around in the cache settings. They changed in v7.

No, the cache settings changed between BOINC v5 and v6.

No, in BOINC 6 the amount of work BOINC asked for was minimum + additional, and it'd ask for it at any time that work was reported. In BOINC 7, we have the minimum + additional amount, but BOINC only asks for work after it's gotten below the minimum mark.

This means that when you have left your old BOINC 6 values for "Connect to" + "Additional work" at, example given 0.1 and 1.0, that BOINC 7.0 will ask for 1.1 days worth of work and ONLY renew this cache when it's fallen under the 0.1 days worth of work limit. Which means that it can happen that your BOINC runs empty, because 7.0 won't request new work before it has dropped below the 'minimum work' setting and will only ask for work up to the 'and additional' setting --and that only from the project that has the highest priority (worst REC to resource share ratio). Only if that project doesn't have work it will ask other projects in order of priority.

By changing the values around, 1.0 and 0.1, the minimum is 1.0 days. So when BOINC finds the cache falls under that value, it'll ask for new work and top it off to 1.1 or more days of work.
ID: 1378984 · Report as offensive
spitfire_mk_2
Avatar

Send message
Joined: 14 Apr 00
Posts: 563
Credit: 27,306,885
RAC: 0
United States
Message 1379014 - Posted: 9 Jun 2013, 17:31:19 UTC - in response to Message 1378816.  

I did an upgrade to BOINC 7.0.64 earlier today. Imagine my surprise when I could not get any work afterward. However, I did got 4 AP tasks a few hours later.

(there is always Einstein work if you feel neglected by seti)


I haven't read through this whole thread carefully, so ignore me if I say something stupid. I'd better amend that: Ignore me if I say something even more stupid than you would expect.

You did go to Preferences and enable v7 work, didn't you?

Everything is working fine. A few hours after I posted, I got a bunch of seti 7 work for cpu and gpu.


Also, regarding upgrading. I upgraded from 7.0.25 to 7.0.64. So I was running BOINC 7 all along.
ID: 1379014 · Report as offensive
Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · 11 · 12 . . . 20 · Next

Message boards : Number crunching : Panic Mode On (84) Server Problems?


 
©2025 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.