This computer has reached a limit on tasks in progress??

Message boards : Number crunching : This computer has reached a limit on tasks in progress??
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · Next

AuthorMessage
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1011093 - Posted: 3 Jul 2010, 0:44:16 UTC - in response to Message 1011090.  
Last modified: 3 Jul 2010, 0:44:30 UTC


Network traffic has dropped right off & there is plenty of work ready to send, so i guess almost everyone has finally hit the task limit.


Good, maybe they'll lift the limit so we can build a cache.

Doesn't look like it...
It would appear that they may have increased the limit...say, from 20 to maybe 50 or something, but folks are still reporting hitting the in progress limit.
So they haven't opened up the spigot all the way just yet.
Maybe tomorrow?
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1011093 · Report as offensive
Rasputin
Volunteer tester

Send message
Joined: 13 Jun 02
Posts: 1764
Credit: 6,132,221
RAC: 0
Russia
Message 1011102 - Posted: 3 Jul 2010, 0:55:35 UTC - in response to Message 1011083.  

Has anyone here generated any -12 errors? I have, 6 in the last 7 days... Out of thousands of good results...


I had 11 -12 errors with the 1200+ uploaded today. That's really unusual for my box, 1 or 2 errors a day in about 400 WU's is normal. And usually there caused by running out of memory (Because I forgot to snooze SETI when watching video's or playing a game) and not -12 errors.
ID: 1011102 · Report as offensive
Rasputin
Volunteer tester

Send message
Joined: 13 Jun 02
Posts: 1764
Credit: 6,132,221
RAC: 0
Russia
Message 1011104 - Posted: 3 Jul 2010, 0:58:22 UTC - in response to Message 1011093.  


Network traffic has dropped right off & there is plenty of work ready to send, so i guess almost everyone has finally hit the task limit.


Good, maybe they'll lift the limit so we can build a cache.

Doesn't look like it...
It would appear that they may have increased the limit...say, from 20 to maybe 50 or something, but folks are still reporting hitting the in progress limit.
So they haven't opened up the spigot all the way just yet.
Maybe tomorrow?


Hopefully. I just hope they can do it remotely because nobody will come in to work on the weekend. Especially a holiday weekend.
ID: 1011104 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13736
Credit: 208,696,464
RAC: 304
Australia
Message 1011105 - Posted: 3 Jul 2010, 1:03:57 UTC - in response to Message 1011104.  

Hopefully. I just hope they can do it remotely because nobody will come in to work on the weekend. Especially a holiday weekend.

Bull#h!t.
They've done that on many occasions when the system hasn't been able to be restarted remotely, inspite of attitudes such as yours.
Grant
Darwin NT
ID: 1011105 · Report as offensive
Profile Lint trap

Send message
Joined: 30 May 03
Posts: 871
Credit: 28,092,319
RAC: 0
United States
Message 1011107 - Posted: 3 Jul 2010, 1:07:31 UTC - in response to Message 1011083.  

Has anyone here generated any -12 errors? I have, 6 in the last 7 days... Out of thousands of good results...



Last -12 I reported was 26th of June. Reported one in May, too.

Martin
ID: 1011107 · Report as offensive
forward

Send message
Joined: 1 Jun 99
Posts: 53
Credit: 2,658,479
RAC: 0
United States
Message 1011108 - Posted: 3 Jul 2010, 1:07:59 UTC

Will they take Monday off for the 4th Holiday? If they do I hope the project stays up through the weekend.
ID: 1011108 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65746
Credit: 55,293,173
RAC: 49
United States
Message 1011109 - Posted: 3 Jul 2010, 1:13:03 UTC
Last modified: 3 Jul 2010, 1:17:33 UTC

Lets see, No work for gpus(or any for that matter), -12 errors, anonymous platform, 2.3 dll files anybody?
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 1011109 · Report as offensive
Rasputin
Volunteer tester

Send message
Joined: 13 Jun 02
Posts: 1764
Credit: 6,132,221
RAC: 0
Russia
Message 1011112 - Posted: 3 Jul 2010, 1:17:19 UTC - in response to Message 1011105.  

Hopefully. I just hope they can do it remotely because nobody will come in to work on the weekend. Especially a holiday weekend.

Bull#h!t.
They've done that on many occasions when the system hasn't been able to be restarted remotely, inspite of attitudes such as yours.


Chill out.. I'm not saying anything bad about the staff. I'm going by what Matt said about going on a three week vacation starting today and also he mentioned a while back (Not long ago, don't feel like going through all his posts) that basically we shouldn't expect them to come in one weekends. Hell, maybe it was Eric that mentioned it. My memory isn't that great anymore.

I have great respect for all of the staff at SETI and I'm well aware they work their a**es off for this project and have come in numerous weekends.



ID: 1011112 · Report as offensive
Profile perryjay
Volunteer tester
Avatar

Send message
Joined: 20 Aug 02
Posts: 3377
Credit: 20,676,751
RAC: 0
United States
Message 1011113 - Posted: 3 Jul 2010, 1:22:23 UTC - in response to Message 1011109.  

If you're asking are we using the 2.3dlls, yes I am. As I said I only had the one -12 error over the outage. I now have the new limit of 60 WUs spread between my two CPUs and one 9500GT GPU. So you don't have to ask, I'm running optimized without the flops count added.

@forward...They probably will take Monday off but hopefully they will be able to raise the quotas remotely if they haven't got it set to raise automatically.


PROUD MEMBER OF Team Starfire World BOINC
ID: 1011113 · Report as offensive
Ianab
Volunteer tester

Send message
Joined: 11 Jun 08
Posts: 732
Credit: 20,635,586
RAC: 5
New Zealand
Message 1011125 - Posted: 3 Jul 2010, 1:59:50 UTC - in response to Message 1011088.  
Last modified: 3 Jul 2010, 2:01:00 UTC

Unfortunatly some of us have hosts that will process more than 20 WU's at a time. Case in point is a dual Intel X5680 system that I have with 2 Fermi cards running two instances per card. That is a total of 28 tasks at a time that could be processed if this limitation was lifted. Also I am building a system with a Quad sockets and each cpu has 8 cores and hyperthreading - and two fermi cards to boot. That will be 68 tasks at a time processing.

If this behavior is temporary and expected that is fine but if this is the way it is going to be I am seriously considering other options for my resources. This system is being built as a mega-cruncher and has a significant price tag associated with it. If it cant be used to its full potential I won't bother.

I'm not saying I'm dumping S@H or anything like that but do want to make it known that there are individuals with systems that are more than able to process more than 20 WU's at a time and a higher limit should be implemented.

Todd


I can understand the frustration, but it's still a big improvement on the usual "turn on the tap" recovery from a long outage.

The usual process if you remember is that the server and network link get hammered to death for 12 hours or more, uploads fail, and therefore you cant download any new work anyway.

At least this way the completed units all uploaded, and SOME new work was sent out to most crunchers. A big improvement over the chaos that most people expected. A few have got caught out with only CPU or GPU, but at least they have some work, and can report / request more without the usual hassles.

I see the limit has been raised to 60 in progess now, with a corresponding jump in the network traffic, so I assume things are starting to get back to normal.

Hopefully full caches for everyone before Tuesday?

Ian
ID: 1011125 · Report as offensive
Profile Ghery S. Pettit
Avatar

Send message
Joined: 7 Nov 99
Posts: 325
Credit: 28,109,066
RAC: 82
United States
Message 1011126 - Posted: 3 Jul 2010, 2:04:02 UTC - in response to Message 1011093.  


Network traffic has dropped right off & there is plenty of work ready to send, so i guess almost everyone has finally hit the task limit.


Good, maybe they'll lift the limit so we can build a cache.

Doesn't look like it...
It would appear that they may have increased the limit...say, from 20 to maybe 50 or something, but folks are still reporting hitting the in progress limit.
So they haven't opened up the spigot all the way just yet.
Maybe tomorrow?


Looks like the new limit is 60. Two machines have gotten up to that many WUs. The third already had more (good cache going into the shutdown), so it's not getting any more for a while.
ID: 1011126 · Report as offensive
Profile Geek@Play
Volunteer tester
Avatar

Send message
Joined: 31 Jul 01
Posts: 2467
Credit: 86,146,931
RAC: 0
United States
Message 1011127 - Posted: 3 Jul 2010, 2:05:02 UTC - in response to Message 1011109.  
Last modified: 3 Jul 2010, 2:17:07 UTC

Lets see, No work for gpus(or any for that matter), -12 errors, anonymous platform, 2.3 dll files anybody?


@ VW Bobier

Since you cpu has 4 cores, as long as you have more than 2* cores = 8 wu waiting to upload, Boinc will not ask for work. As soon as it hits 8 or less then Boinc will start requesting work.

I saw in another posting by you that you were still uploading cpu work.
Boinc....Boinc....Boinc....Boinc....
ID: 1011127 · Report as offensive
Profile rebest Project Donor
Volunteer tester
Avatar

Send message
Joined: 16 Apr 00
Posts: 1296
Credit: 45,357,093
RAC: 0
United States
Message 1011131 - Posted: 3 Jul 2010, 2:12:51 UTC

As of right now, my three rigs have work. I ran Rescheduler on my CUDA rig so I now have CUDA work as well. For now, that's all that's needed and I'm happy.

Now if the servers were to crash tonight, I'd only have about 12 hours of work on the CUDA rig even though my cache is set for 5 days. I've got the voodoo going.

Hopefully, whatever new load control experiment that's being run proves successful so we can stock up on work prior to the scheduled outage next Tuesday. It did make for a successful restart after 3 days being down.



Join the PACK!
ID: 1011131 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1011133 - Posted: 3 Jul 2010, 2:23:45 UTC - in response to Message 1011131.  
Last modified: 3 Jul 2010, 2:24:00 UTC

As of right now, my three rigs have work. I ran Rescheduler on my CUDA rig so I now have CUDA work as well. For now, that's all that's needed and I'm happy.

Now if the servers were to crash tonight, I'd only have about 12 hours of work on the CUDA rig even though my cache is set for 5 days. I've got the voodoo going.

Hopefully, whatever new load control experiment that's being run proves successful so we can stock up on work prior to the scheduled outage next Tuesday. It did make for a successful restart after 3 days being down.


Well...good news is the splitters just started back up and are building ready to send again.
And the waiting for assimilation total is starting to come down.

So I guess I'll just have to tell the kitties to start clicking their ruby slippers again...............
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1011133 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13736
Credit: 208,696,464
RAC: 304
Australia
Message 1011137 - Posted: 3 Jul 2010, 2:36:19 UTC - in response to Message 1011133.  


Network traffic has dropped off again so it looks like everyone's hit the task limit once again.
Grant
Darwin NT
ID: 1011137 · Report as offensive
Rasputin
Volunteer tester

Send message
Joined: 13 Jun 02
Posts: 1764
Credit: 6,132,221
RAC: 0
Russia
Message 1011141 - Posted: 3 Jul 2010, 2:50:33 UTC - in response to Message 1011137.  

ya know, everybody has a certain amount of pride that might not let them admit when their wrong.. But you did jump on me and accuse me of something I didn't do.. Least you could say would be "Sorry" or even a simple "my mistake".



ID: 1011141 · Report as offensive
Profile JDWhale
Volunteer tester
Avatar

Send message
Joined: 6 Apr 99
Posts: 921
Credit: 21,935,817
RAC: 3
United States
Message 1011145 - Posted: 3 Jul 2010, 3:07:53 UTC

AFAIK, this project has never asked for more than "spare" CPU cycles to get the job done. If this has changed, please let me know. Building more and more powerful systems then complaining when work is not available to feed them seems against the original concept of the project.

It's my opinion that we should be happy when the admimistrators of the project allow us to crunch for them. Isn't that why we all originally joined the project. That was the way it was when I joined back in 1999 :)
ID: 1011145 · Report as offensive
Aurora Borealis
Volunteer tester
Avatar

Send message
Joined: 14 Jan 01
Posts: 3075
Credit: 5,631,463
RAC: 0
Canada
Message 1011153 - Posted: 3 Jul 2010, 3:29:35 UTC

Looking at the bandwidth usage on Cricket and reading the posts, my uneducated guess is that they have a script running that is monitoring the server loads and was designed to spoon feed everyone. As the load balances out, I suspect it will incrementally increase the cache limits until they are removed completely.

That would be my solution to the ridiculous pounding we've seen after outages in the past. This approach ensures that everyone gets some work while the big crunchers with the large caches are prevented from saturating the system and gumming up the works. They've tried various load balancing schemes in the past that didn't work, but this one seems to be working.

Boinc V7.2.42
Win7 i5 3.33G 4GB, GTX470
ID: 1011153 · Report as offensive
Profile soft^spirit
Avatar

Send message
Joined: 18 May 99
Posts: 6497
Credit: 34,134,168
RAC: 0
United States
Message 1011155 - Posted: 3 Jul 2010, 3:33:55 UTC

new limit on seti tasks=42?
Janice
ID: 1011155 · Report as offensive
W-K 666 Project Donor
Volunteer tester

Send message
Joined: 18 May 99
Posts: 19062
Credit: 40,757,560
RAC: 67
United Kingdom
Message 1011162 - Posted: 3 Jul 2010, 7:15:38 UTC

This is after the short break for database server crash, the replica is still off-line.

And the title msg is still there, so no changes there.
ID: 1011162 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · Next

Message boards : Number crunching : This computer has reached a limit on tasks in progress??


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.