Heads Up: Quorum Change |
![]() |
| log in |
Message boards : SETI@home Staff Blog : Heads Up: Quorum Change
1 · 2 · 3 · 4 · Next
| Author | Message |
|---|---|
|
Hello, generous donators of CPU time. | |
| ID: 543372 · | |
|
Good news Matt. Thanks. | |
| ID: 543378 · | |
Hello, generous donators of CPU time. - Matt Never let it be said again we aren't recognized. :) (Unless I say it.) ____________ | |
| ID: 543551 · | |
|
Hi Matt ... thanks very much for the Good news! | |
| ID: 543586 · | |
|
Thanks Matt, | |
| ID: 543611 · | |
|
Thanks, Matt, for the news. Less redundancy means more science done with the same amount of computation. So, Alfa datacrunching is just around the corner? Einstein will be proud if Seti data runs out, for me. | |
| ID: 543801 · | |
|
This is a good opportunity for me to get on my "change control soapbox". Would someone give a good (technical, quantitative) reason for the change? Would someone give a good (technical, quantitative) analysis of the risk involved? It just seems like good project management to go to the effort to engineer the change (and good PR to publish the results). | |
| ID: 543818 · | |
|
Reducing quorum means less redundancy, which in turn means we need to create more work to keep up with "demand." In theory, we still have a bunch of "standard" SETI@home data to work with, so we shouldn't run out... but they are on DLT IV tapes which are slow to read, and we don't have a carousel or robot to feed the drive over nights or weekends. Currently I can throw one or two tapes in a day (four days a week) and that's just enough. Reducing quorum to 3 will increase splitter demand by 25%. And we may soon reduce it to 2, meaning splitter demand will increase by 50%. We can't feed our (only) DLT drive fast enough. Another DLT drive won't help - our network nor the file server can't handle the bandwidth. Plus we're very close to being finished with this data. | |
| ID: 543844 · | |
|
Thanks, Matt; this is enlightening. You have 1 tape drive that feeds the splitters. If the downstream demand increases due to reduced redundancy then someone needs to feed the drive at an increasing rate, which will lead to gaps during which the drive is dormant. You also have 'a bunch' of tapes left to analyze. So reduced redundancy will increase their processing rate, but potentially expose the project to gaps in WU availability. | |
| ID: 543882 · | |
|
I forget the exact numbers, but if the redundancy level was 1, the reliability of the data was over 99% (i.e. less than 1% of the data was corrupt or bogus). Each level of redundancy above that is an improvement, but with diminishing returns. | |
| ID: 543912 · | |
Thanks, Matt; this is enlightening. You have 1 tape drive that feeds the splitters. If the downstream demand increases due to reduced redundancy then someone needs to feed the drive at an increasing rate, which will lead to gaps during which the drive is dormant. You also have 'a bunch' of tapes left to analyze. So reduced redundancy will increase their processing rate, but potentially expose the project to gaps in WU availability. Remember that the new data is being shipped from Aricebo on nice, fast SATA hard drives, not tape. ____________ | |
| ID: 543917 · | |
Thanks, Matt; this is enlightening. You have 1 tape drive that feeds the splitters... Ouch! No redundancy there for a failed drive or a jammed tape... But why is the redundancy criterion being modified? The premise since Classic is that Seti needed a high level of redundancy to be believed. (I seem to remember questioning this premise a couple of years ago and getting blasted and later ignored by the community.) What model tells us that we can live with 3 or 2 in a quorem? The risk to the project is false positives as well as missing positives. (I am not aware of what kind of probability analyses have been performed on this topic.) I guess the question is for what is the probability (possibility) of getting two similarly incorrect results to fool the quorum checking? Could two similar systems similarly overclocked or similarly overheated return results in the same incorrect way? A secondary question is how long would it take with the current redundancy criterion to finish processing the 'backlog' of data tapes, assuming the current computing capacity as provided by the volunteers? That is, what is the aggregate number of WU's on the tapes, how many have been processed, how many can 'we' process per day? I realize that the tapes are being re-analyzed with more stringent criteria, leading to either more WU's or more compute time, or both. All very good questions. I think that the strategy of improving the efficiency and holding recrunching of classic WUs in reserve is a good idea. I must admit that I too have the question as to whether the quorum logic and the real world can work well enough together for a minimum quorum of two. But then again, the main risk is that of false positives and for those positives, those few WUs can be reanalysed for confirmation. The other risk is that of false negatives and whether that can open the door for users to cheat in some way... Happy crunchin', Martin ____________ Mandriva Linux A user friendly OS! See new freedom Mageia2 The Future is what We make IT (GPLv3) | |
| ID: 543921 · | |
|
PhonAcq This is a good opportunity for me to get on my "change control soapbox". Would someone give a good (technical, quantitative) reason for the change? Would someone give a good (technical, quantitative) analysis of the risk involved? It just seems like good project management to go to the effort to engineer the change (and good PR to publish the results). ____________ Please consider a Donation to the Seti Project. | |
| ID: 544096 · | |
... Small, but the same as it has always been. A canonical result is one of a pair that is "strongly similar". Any additional results which are at least "weakly similar" are also granted credit. The "strongly similar" criteria are sufficiently tight to ensure reliable results are entered in the Master Science database, the "weakly similar" criteria loose enough that those using different platforms or CPU generations which may occasionally not be in complete agreement will still be rewarded. Could two similar systems similarly overclocked or similarly overheated return results in the same incorrect way? Highly unlikely, but of course possible. Joe | |
| ID: 544144 · | |
This has been the best conversation from the Seti Staff and identifiable progress in years... And all excellent motivation too all round. Somewhere you missed that MultiBeam provides more data than 10X the users that are currently crunching... This does not mention that the change to the number of machines (Quorum) has been in testing (Seti Beta) for months... So now it getting ready for one of the next steps in Seti Evolution!... The "x10" data throughput gives very good impetus to look at speeding up the analysis, provided... This is a good opportunity for me to get on my "change control soapbox". Would someone give a good (technical, quantitative) reason for the change? ... (technical, quantitative) analysis of the risk involved?... ... that the Science isn't diluted. Given that WU verification is still done by Berkeley themselves for those 'interesting' WUs found by us, and that the real 'proof' is in subsequently finding a consistent ET beacon, then I consider that this change is 'affordable' and all good and better for progress. Especially so for doing something useful and timely with the new flood of data. It would still be good to see some statistics and/or numbers for what the quorum change 'means' for the 'reliability' of the Science results. Congratulations to Matt, Eric(?), Jeff(?) and All for pushing this far for getting the ALFA data onstream! I'm sure we're all looking forward to seeing the new data! Happy crunchin', Martin ____________ Mandriva Linux A user friendly OS! See new freedom Mageia2 The Future is what We make IT (GPLv3) | |
| ID: 544265 · | |
I forget the exact numbers, but if the redundancy level was 1, the reliability of the data was over 99% (i.e. less than 1% of the data was corrupt or bogus). Each level of redundancy above that is an improvement, but with diminishing returns. Oooops, missed that comment. That answers one part of the quorum change questions. The current data being analyzed by SETI@home has already been analyzed before, but with older versions of the client that did far less analysis (so workunits wouldn't take weeks to finish). There's a scientific reason to do so, So still useful as data to use to search again but deeper if we run out of the more sensitive new data: but we're chomping on the bit to get to the new data because it's "deeper" in both frequency space and in sensitivity. Plus RFI will be easier to reject because of the multiple beams, etc. So reducing redundancy will help us push through the remaining old data. This following very important point (to some) is harking back to old discussions about having 'golden calibrations' so that credits can be more accurately awarded... One drawback of less redundancy is more "jagged" credit for workunits done (since they won't be averaged over the claimed credit of many users). Over the long term this will average out, but users who pay close attention will get annoyed by a few data points that make it seem they're being "short changed." Some PR will be necessary to smooth that over. Happy crunchin', Martin ____________ Mandriva Linux A user friendly OS! See new freedom Mageia2 The Future is what We make IT (GPLv3) | |
| ID: 544269 · | |
|
I see that the change is already in process. I've gotten several WUs that are the 3/2 format. | |
| ID: 544316 · | |
I see that the change is already in process. I've gotten several WUs that are the 3/2 format. Yay, a few here too, :D. Here's to a leaner, tidier database. Nice smooth outage and recovery this week too. Keep up the great work ( but remember to take a little rest too!) Jason ____________ "It is not the strongest of the species that survives, nor the most intelligent that survives. It is the one that is the most adaptable to change." Charles Darwin | |
| ID: 544331 · | |
|
Redundancy level of one--- Does that mean that there are two results that agree? If so, and the probability that those two results are truly correct is 99 percent--- Does that mean that each unchecked result has a 90 percent chance of bring correct, and that three checks would imply a certainty of 99.9 percent? Of course it would seem that any false positive found in the results would easily be disproven by more results. | |
| ID: 544360 · | |
With flop-counting, most machines claim exactly the same credit. It'll only be an issue when a current BOINC client gets paired with an older BOINC client, and I think the lower score is usually thrown out, so it'll likely raise credit a tiny bit on average. ____________ | |
| ID: 544383 · | |
Message boards : SETI@home Staff Blog : Heads Up: Quorum Change
| Copyright © 2013 University of California |