Please rise the limits... just a little...

Author	Message
juan BFP Volunteer tester Send message Joined: 16 Mar 07 Posts: 9786 Credit: 572,710,851 RAC: 3,799	Message 1344751 - Posted: 9 Mar 2013, 23:05:06 UTC Now with the Windows TCP Settings bug fixed why not rise the limit a little? Maybe just to 100WU per GPU not per host (mantaining the 100 limit on CPU work), that will allow us to pass more easely the scheduled manteinances on each Tuesday on our fastest hosts. ID: 1344751 ·

Grant (SSSF) Volunteer tester Send message Joined: 19 Aug 99 Posts: 13727 Credit: 208,696,464 RAC: 304	Message 1344753 - Posted: 9 Mar 2013, 23:12:24 UTC - in response to Message 1344751. 100 per core, 600 per GPU would be a nice start. Grant Darwin NT ID: 1344753 ·

Keith Myers Volunteer tester Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873	Message 1344795 - Posted: 10 Mar 2013, 2:02:15 UTC - in response to Message 1344753. 100 per core, 600 per GPU would be a nice start. Ditto. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) ID: 1344795 ·

rob smith Volunteer moderator Volunteer tester Send message Joined: 7 Mar 03 Posts: 22182 Credit: 416,307,556 RAC: 380	Message 1344864 - Posted: 10 Mar 2013, 9:08:24 UTC While an increase in the number of WU distributed to an individual cruncher might sound like a good idea it wouldn't help anything part from our egos. With the way the weekly outages have gone of late there is actually little need to increase per processor, what would be good however is BOINC to correctly identify multi processor Nvidia cards are being more than one processor. Why the chuff chuff does BOINC decide that my GTX690 is only a single processor, when it is reported as [2] on the accounts page?? Bob Smith Member of Seti PIPPS (Pluto is a Planet Protest Society) Somewhere in the (un)known Universe? ID: 1344864 ·

Grant (SSSF) Volunteer tester Send message Joined: 19 Aug 99 Posts: 13727 Credit: 208,696,464 RAC: 304	Message 1344871 - Posted: 10 Mar 2013, 9:17:04 UTC - in response to Message 1344864. Why the chuff chuff does BOINC decide that my GTX690 is only a single processor, when it is reported as [2] on the accounts page?? Because it is a single device. Just as a 16 core CPU with Hypethreading (so 32 available processing units) is a single CPU. Grant Darwin NT ID: 1344871 ·

rob smith Volunteer moderator Volunteer tester Send message Joined: 7 Mar 03 Posts: 22182 Credit: 416,307,556 RAC: 380	Message 1344880 - Posted: 10 Mar 2013, 9:46:44 UTC But it isn't - its two devices on one board, which may, or may not, be connected by an internal SLI link (mine are unlinked). One part of the system reports it as TWO devices - take a look at the details for yourself : http://setiathome.berkeley.edu/show_host_detail.php?hostid=6890059 It is interesting to note that GPUGRID treats the GTX690 as being TWO GPUs as witnessed by the fact that a few minutes ago it was running one instance of a GPUGRID task, plus three S@H tasks at the same time, and will run 6 S@H tasks with a setting of 0.33/GPU - if it were a single GPU it would not be capable of doing either of these sets of operations. Bob Smith Member of Seti PIPPS (Pluto is a Planet Protest Society) Somewhere in the (un)known Universe? ID: 1344880 ·

Fred E. Volunteer tester Send message Joined: 22 Jul 99 Posts: 768 Credit: 24,140,697 RAC: 0	Message 1344900 - Posted: 10 Mar 2013, 10:40:48 UTC Last modified: 10 Mar 2013, 10:41:23 UTC While an increase in the number of WU distributed to an individual cruncher might sound like a good idea it wouldn't help anything part from our egos. I disagree. Everytime I run out of work during the outages, I load work from a "B" project, and that crunch time is lost to SETI. I suspect the faster crunchers run out more than I do. That lost crunch time means that less results flow to the science databases, and that hurts the project. It is wasted capacity. Give us enough to withstand a 48 hour outage. Another Fred Support SETI@home when you search the Web with GoodSearch or shop online with GoodShop. ID: 1344900 ·

Richard Haselgrove Volunteer tester Send message Joined: 4 Jul 99 Posts: 14650 Credit: 200,643,578 RAC: 874	Message 1344904 - Posted: 10 Mar 2013, 10:47:54 UTC - in response to Message 1344900. While an increase in the number of WU distributed to an individual cruncher might sound like a good idea it wouldn't help anything part from our egos. I disagree. Everytime I run out of work during the outages, I load work from a "B" project, and that crunch time is lost to SETI. I suspect the faster crunchers run out more than I do. That lost crunch time means that less results flow to the science databases, and that hurts the project. It is wasted capacity. Give us enough to withstand a 48 hour outage. No it doesn't, I'm afraid. At the moment, with the project running absolutely flat out, it means that somebody else grabs the tasks and runs them for you. ID: 1344904 ·

Grant (SSSF) Volunteer tester Send message Joined: 19 Aug 99 Posts: 13727 Credit: 208,696,464 RAC: 304	Message 1344909 - Posted: 10 Mar 2013, 11:03:22 UTC - in response to Message 1344880. But it isn't But it is. Just as i pointed out in my first post- 8 or 16 or 32 cores in a CPU still counts as 1 CPU. 2 or 4 or 8 GPUs on a single board still counts as a single video card. I'm not sure what happens in the case of 2 CPUs or 2 physical video cards. If the limitation is per device, then you'd get 100 WUs for each CPU & each video card. If the limitation is per system, then you'd still be limited to 100 WUs for all CPUs & 100WUs for all video cards no matter how many the system has. Grant Darwin NT ID: 1344909 ·

Fred E. Volunteer tester Send message Joined: 22 Jul 99 Posts: 768 Credit: 24,140,697 RAC: 0	Message 1344911 - Posted: 10 Mar 2013, 11:08:35 UTC While an increase in the number of WU distributed to an individual cruncher might sound like a good idea it wouldn't help anything part from our egos. I disagree. Everytime I run out of work during the outages, I load work from a "B" project, and that crunch time is lost to SETI. I suspect the faster crunchers run out more than I do. That lost crunch time means that less results flow to the science databases, and that hurts the project. It is wasted capacity. Give us enough to withstand a 48 hour outage. No it doesn't, I'm afraid. At the moment, with the project running absolutely flat out, it means that somebody else grabs the tasks and runs them for you. Just don't see it that way. If I can't get work during an outage, they can't either. And if I still have B project work left over when work flow resumes and someone runs tasks I would have run, one set gets run instead of two. Larger caches would allow the proect to run flat out, which it is not doing when we're out of work. Another Fred Support SETI@home when you search the Web with GoodSearch or shop online with GoodShop. ID: 1344911 ·

Wiggo Send message Joined: 24 Jan 00 Posts: 34744 Credit: 261,360,520 RAC: 489	Message 1344915 - Posted: 10 Mar 2013, 11:25:47 UTC - in response to Message 1344909. But it isn't But it is. Just as i pointed out in my first post- 8 or 16 or 32 cores in a CPU still counts as 1 CPU. 2 or 4 or 8 GPUs on a single board still counts as a single video card. I'm not sure what happens in the case of 2 CPUs or 2 physical video cards. If the limitation is per device, then you'd get 100 WUs for each CPU & each video card. If the limitation is per system, then you'd still be limited to 100 WUs for all CPUs & 100WUs for all video cards no matter how many the system has. It doesn't matter whether you have 1 CPU or 2 you still only get 100 w/u's and the same goes for a GPU capable machine and that number also stays at 100 w/u's no matter how many which means that unless you run some really early version of BOINC the limit for any machine crunching on both CPU and GPU is 200 w/u's. Cheers. ID: 1344915 ·

juan BFP Volunteer tester Send message Joined: 16 Mar 07 Posts: 9786 Credit: 572,710,851 RAC: 3,799	Message 1344928 - Posted: 10 Mar 2013, 12:12:30 UTC - in response to Message 1344915. Last modified: 10 Mar 2013, 12:17:24 UTC It doesn't matter whether you have 1 CPU or 2 you still only get 100 w/u's and the same goes for a GPU capable machine and that number also stays at 100 w/u's no matter how many which means that unless you run some really early version of BOINC the limit for any machine crunching on both CPU and GPU is 200 w/u's. That exactly why i sugest a "little increase" in the limit of the GPU WU, 100 WU of CPU task is far enought for a 1/2 day of work, even on the fastest CPUÂ´s (if iÂ´m wrong someone could show why please), but 100 WU of GPU is not, even in a single 690 hosts, normaly a WU will crunch (not a shortie of course) in less than 7 minutes, itÂ´s about 34 WU per hour, so a 100 WU caches last for less than 3 hours, not enought for the 3-6 hours outages. A 200 per GPU limit will get us far enought work in a double or triple GPU hosts, even for a large normal outage, not when a unsheduled things happening of course, but that will be a good beginning and does not produce to much new load to the databases. I noticed something else (out of this thread focus), the cricket show almost 100% of bandwidth utilization, but all the AP splitters are out and MB spiting is in "slow" mode (only 3 splliters are workin) and we have a lot of ready to send MB units (more then 300K) and everything apears to work fine, lets see what happening when the AP splitters returns to dutty. ID: 1344928 ·

bill Send message Joined: 16 Jun 99 Posts: 861 Credit: 29,352,955 RAC: 0	Message 1345099 - Posted: 10 Mar 2013, 21:46:38 UTC - in response to Message 1344904. While an increase in the number of WU distributed to an individual cruncher might sound like a good idea it wouldn't help anything part from our egos. I disagree. Everytime I run out of work during the outages, I load work from a "B" project, and that crunch time is lost to SETI. I suspect the faster crunchers run out more than I do. That lost crunch time means that less results flow to the science databases, and that hurts the project. It is wasted capacity. Give us enough to withstand a 48 hour outage. No it doesn't, I'm afraid. At the moment, with the project running absolutely flat out, it means that somebody else grabs the tasks and runs them for you. Emphasis added by me. That last part should be in a FAQ or a sticky or something. ID: 1345099 ·

kittyman Volunteer tester Send message Joined: 9 Jul 00 Posts: 51468 Credit: 1,018,363,574 RAC: 1,004	Message 1345105 - Posted: 10 Mar 2013, 21:55:37 UTC - in response to Message 1345099. While an increase in the number of WU distributed to an individual cruncher might sound like a good idea it wouldn't help anything part from our egos. I disagree. Everytime I run out of work during the outages, I load work from a "B" project, and that crunch time is lost to SETI. I suspect the faster crunchers run out more than I do. That lost crunch time means that less results flow to the science databases, and that hurts the project. It is wasted capacity. Give us enough to withstand a 48 hour outage. No it doesn't, I'm afraid. At the moment, with the project running absolutely flat out, it means that somebody else grabs the tasks and runs them for you. Emphasis added by me. That last part should be in a FAQ or a sticky or something. The fact remains, that when I cannot cache enough work on my fastest crunchers to ride out even the weekly outage, less work gets done for the project. Granted, this may not have a tremendous impact on the project overall, but it is fact that when I run out of WUs, I am not crunching Seti on my best resources. The CPUs never run out of work due to their slower speed. But the multiple GPU rigs burn through things pretty quickly. Especially when the 100 task allotment consists of shorties. "Freedom is just Chaos, with better lighting." Alan Dean Foster ID: 1345105 ·

bill Send message Joined: 16 Jun 99 Posts: 861 Credit: 29,352,955 RAC: 0	Message 1345107 - Posted: 10 Mar 2013, 21:56:01 UTC - in response to Message 1344911. While an increase in the number of WU distributed to an individual cruncher might sound like a good idea it wouldn't help anything part from our egos. I disagree. Everytime I run out of work during the outages, I load work from a "B" project, and that crunch time is lost to SETI. I suspect the faster crunchers run out more than I do. That lost crunch time means that less results flow to the science databases, and that hurts the project. It is wasted capacity. Give us enough to withstand a 48 hour outage. No it doesn't, I'm afraid. At the moment, with the project running absolutely flat out, it means that somebody else grabs the tasks and runs them for you. Just don't see it that way. If I can't get work during an outage, they can't either. And if I still have B project work left over when work flow resumes and someone runs tasks I would have run, one set gets run instead of two. Larger caches would allow the proect to run flat out, which it is not doing when we're out of work. But the project is already running flat out. How does increasing the limits make the project run any more flat out than it already is? At what point does the increased number of work units in the field cause the database to crash? Which is why limits were put in, isn't it? ID: 1345107 ·

kittyman Volunteer tester Send message Joined: 9 Jul 00 Posts: 51468 Credit: 1,018,363,574 RAC: 1,004	Message 1345109 - Posted: 10 Mar 2013, 21:58:12 UTC - in response to Message 1345107. At what point does the increased number of work units in the field cause the database to crash? Which is why limits were put in, isn't it? That's the million dollar question. I have been told that the actual DB usage is only at, I think it was like 60% of capacity. The limit may be on the server's capacity to process and maintain the DB. "Freedom is just Chaos, with better lighting." Alan Dean Foster ID: 1345109 ·

bill Send message Joined: 16 Jun 99 Posts: 861 Credit: 29,352,955 RAC: 0	Message 1345110 - Posted: 10 Mar 2013, 22:03:30 UTC - in response to Message 1345105. Yes, but the project never asked anybody to build such super crunchers, and the project is not time critical. If it takes an extra year to find those little green men that is not a problem for the project. Yes, it would be nice to have all the WU's you want, but if it causes even bigger problems, why do it? ID: 1345110 ·

bill Send message Joined: 16 Jun 99 Posts: 861 Credit: 29,352,955 RAC: 0	Message 1345112 - Posted: 10 Mar 2013, 22:07:08 UTC - in response to Message 1345109. At what point does the increased number of work units in the field cause the database to crash? Which is why limits were put in, isn't it? That's the million dollar question. I have been told that the actual DB usage is only at, I think it was like 60% of capacity. The limit may be on the server's capacity to process and maintain the DB. So that problem has to be fixed first, where ever it resides at the project. What can we do to fix that problem from this side of the servers? ID: 1345112 ·

bill Send message Joined: 16 Jun 99 Posts: 861 Credit: 29,352,955 RAC: 0	Message 1345118 - Posted: 10 Mar 2013, 22:16:54 UTC - in response to Message 1344928. Much snippage by me. That exactly why i sugest a "little increase" What is needed is the point <number> at which the project grinds to a halt because of too many WU's in the field. Then divide that by the number of crunchers, add a suitable fudge factor for a safety margin and we'll have a limit that can be imposed until the underlying problem is fixed. Has anybody run the numbers? ID: 1345118 ·

juan BFP Volunteer tester Send message Joined: 16 Mar 07 Posts: 9786 Credit: 572,710,851 RAC: 3,799	Message 1345127 - Posted: 10 Mar 2013, 22:36:28 UTC I talk about a "Little increase" to avoid any problems, a change from 100WU GPU Host to 100W per GPU sure will not make a crash on the DB and will keep our GPUÂ´s working. I agree in something, nobody ask to build super-crunchers, but if they are here why not use? IÂ´m with Mark, we just want to full use of our resources like anyone else. ID: 1345127 ·

©2024 University of California

SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.