Message boards :
Technical News :
Up (Dec 07 2010)
Message board moderation
Previous · 1 · 2 · 3 · 4 · 5 · Next
Author | Message |
---|---|
Dave Send message Joined: 29 Mar 02 Posts: 778 Credit: 25,001,396 RAC: 0 |
LOL nice 1 - starting to get a bit of VFM... |
APCyberax Send message Joined: 6 Jun 01 Posts: 29 Credit: 24,078,024 RAC: 48 |
got some WU for my pc. shame my server missed them all as thats now idle. i've left it with no tasks now so i can get right back to the search :) Thanks guys for your hard work. Hope you had some money left in the donations for a drink. Its on us. |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14679 Credit: 200,643,578 RAC: 874 |
Hope you had some money left in the donations for a drink. Its on us. Tomorrow we'll quadruple the buffer space from where they are now. Mwha ha ha ha. Sounds like they've started already - and as a donor, I don't begrudge them a drop. Cheers, Matt! |
droctagon Send message Joined: 11 Jul 10 Posts: 1 Credit: 8,543,805 RAC: 0 |
Thanks for the info guys, I'm finally back in the project, at full speed :). Cheers! |
KZ3AB Send message Joined: 1 Mar 00 Posts: 6 Credit: 4,084,338 RAC: 0 |
Hot damn! I finally received some numbers to crunch. Thankx all the hard work. |
David S Send message Joined: 4 Oct 99 Posts: 18352 Credit: 27,761,924 RAC: 12 |
No need for an announcement. The BOINC software was written to take care of sites that have no work for an extended period. Look at LHC@home. The thing is, there was no reason to set it to NNT. As stated, BOINC is designed to automatically handle times of no work available. I set both Seti and Einstein to NNT when I started having trouble, but that was because I didn't really know what my problem was (and in fact I still don't know if Seti will also be a problem, but tonight I have to go down to the basement to do some laundry, so I'll start Seti then). That said, I am glad an announcement was made, almost as much as I'm glad the project is back up! David David Sitting on my butt while others boldly go, Waiting for a message from a small furry creature from Alpha Centauri. |
Whitesnake Send message Joined: 28 Aug 99 Posts: 12 Credit: 9,923,878 RAC: 0 |
Thanks to every body for this very good job. ap.wu start again Avec toutes ces dernières avancées sur les systèmes et l'équation de Drake remis à jour. Et cela donne une toute autre mesure des choses. |
rflulling Send message Joined: 2 Mar 02 Posts: 6 Credit: 627,423 RAC: 0 |
What incredible timing. I was just talking to Cisco engineer yesterday. He was telling me about a really kick but server they have developed that has more ram slots than you can shake a stick at. The idea was a low cost system that let an administrator load all the slots with lower cost memory (smaller sizes) then upgrade later as costs fall. I really don't know much about the hardware but when he started tell me about the blades, the first thing that came to mind was SETI and it's update status. The Cisco Engineer said that most likely SETI has already spoken with a Cisco rep. I'll trust that's true, but if not, you guys might be missing out. Maybe next upgrade? He said the hardware was 100% compatible with all existing architecture without any alterations needed. The hardware also favors Virtualized environments where the Hyper Visor watches over all. Anyway, I am totally stoked to see the servers back up and running. My work unit average has been dropping hard for weeks. Looking forward to the Rocket trail take off. |
Matt Lebofsky Send message Joined: 1 Mar 99 Posts: 1444 Credit: 957,058 RAC: 0 |
One more minor update: We continue to beat up on oscar - long story short we're finding our biggest hurdle in utilizing the server to its maximum potential is probably the stripe size on the raid subsystem (which is set to the factory default as it's hard to predict these bottlenecks until everything is turned on). I think I can adjust it live - and we'll try these sort of tests/updates early next week. In the meantime, more testing with what we got... - Matt -- BOINC/SETI@home network/web/science/development person -- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude |
Gary Charpentier Send message Joined: 25 Dec 00 Posts: 30981 Credit: 53,134,872 RAC: 32 |
One more minor update: We continue to beat up on oscar - long story short we're finding our biggest hurdle in utilizing the server to its maximum potential is probably the stripe size on the raid subsystem (which is set to the factory default as it's hard to predict these bottlenecks until everything is turned on). I think I can adjust it live - and we'll try these sort of tests/updates early next week. In the meantime, more testing with what we got... If you can change it live, that's quite a trick. Thanks for the progress report. |
DanLM Send message Joined: 2 Mar 04 Posts: 4 Credit: 516,599 RAC: 0 |
Thank you for all the work. I had just built three new FreeBSD(old) machines for seti when the project went off line. Lol, timing is everything. I thought I had done something wrong with setting up seti because I had no wu's for those machines. But all is well for both the seti project and all the machines I have crunching numbers. Again, thank you. |
soft^spirit Send message Joined: 18 May 99 Posts: 6497 Credit: 34,134,168 RAC: 0 |
Matt. if you need to take anything down, no worries. It is if it crashes unexpectedly we start to cringe. We spend a lot of time and energy re-arranging those 0's and 1's.. hate to see them scattered on the floor. Janice |
Swibby Bear Send message Joined: 1 Aug 01 Posts: 246 Credit: 7,945,093 RAC: 0 |
Okay, I know I'll get a lot of flack for this question, but since I don't know the WU system flow, I'll ask it. From looking at the Server Status Page, I believe that there are (at least) two problems in evidence: 1.) Low WU creation rate, and 2.) Slow assimiliation rate. My logic tells me that the BOINC database server (Carolyn) is where the WUs are assigned among all the client computers (and probably affects the creation rates), while the Science database (Oscar) only comes into play at the time of assimiliation. So, am I correct in the belief that tinkering with Oscar's DB settings would affect the assimiliation speed, and tinkering with Carolyn's DB settings would affect the WU creation rate (and deletions, purges, etc)? I realize that the internal file server, Ptolemy, is probably in the mix too, but, to my knowledge, that server has not yet been changed over to it's replacement, which will be Thumper. So, no settings on that machine have been altered as yet. Thanks in advance for enlightening me. Whit |
soft^spirit Send message Joined: 18 May 99 Posts: 6497 Credit: 34,134,168 RAC: 0 |
Actually I do not see a slow creation or assimilation rate. The "tapes" are only going to put out work units so fast. The assimilation is not seriously backlogged, the queries are not out of line on Carolyn, and Jocelyn seems to be keeping up just fine. The results in the field are going up at a fair rate, and turn around time is not out of line. Can they do better? Probably, but give them a chance to tune it. The biggest thing is, we are working again, have not crashed, do not seem to be over-stressing the servers yet. Once they get it all tuned, Oscar should be able to handle anything thumper did with lots of room to spare. The most impressive change I have seen is we are getting our units through the transfers MUCH faster than I have seen. This seems to be saving bandwidth. Further questions are probably best in number crunching. We pick everything apart in there. Janice |
David S Send message Joined: 4 Oct 99 Posts: 18352 Credit: 27,761,924 RAC: 12 |
Siran, I've spent most of a day and a half of "work" time reading your thread at Einstein. I'll make a more detailed response there when I finish it (almost done). However, let me say here that I finally started BOINC last night. I sat and watched it get no tasks for about half an hour before I gave up and went upstairs. My status shows I got three tasks 10 minutes after that, one 5 minutes later, and two more in the wee hours of the morning. None has been returned yet, but I just checked my radioreference.com feed and it's still online after over 16 hours of crunching Seti. That's not a definitive proof of no problem, though. I'll be satisfied when it exceeds 36 hours of uptime. One thing jumped out at me in your Einstein thread: you use Avira as your antivirus. I use Avira on this one computer, because it came preinstalled. I do, however, have an unused license for Bitdefender (which I use on my 2 laptops), so maybe I'll install that. I have a theory that Seti sort of regulates Einstein. That is, the Einstein problem builds up over time, and when BOINC switches from Einstein to Seti, it clears up whatever the problem is so it never builds up to a large enough degree to cause a freeze/crash. I think tomorrow morning, I'll try letting it have another Einstein unit to test this theory (assuming it hasn't crashed on Seti before then). This theory is based on the crashes first starting to happen at about the time Seti went offline. (Otoh.... I never had a problem during the regular 3-day Seti outages, so maybe this theory is not even worth the effort I've put into typing it.) David David Sitting on my butt while others boldly go, Waiting for a message from a small furry creature from Alpha Centauri. |
Swibby Bear Send message Joined: 1 Aug 01 Posts: 246 Credit: 7,945,093 RAC: 0 |
Actually I do not see a slow creation or assimilation rate. The "tapes" are only going to put out work units so fast. The assimilation is not seriously backlogged, the queries are not out of line on Carolyn, and Jocelyn seems to be keeping up just fine. The results in the field are going up at a fair rate, and turn around time is not out of line. Can they do better? Probably, but give them a chance to tune it. The biggest thing is, we are working again, have not crashed, do not seem to be over-stressing the servers yet. Sorry to disagree, but the current WU creation rate of 10 or 12 per second is much slower than the 30 to 40 per second that we used to see. As for assimiliation, it has always been slow, unable to keep up with validated work, and catch-up assimiliation took about a week after SETI went to sleep for the month. Look at the Scarecrow graph. Pending assimiliations are growing unabated, and SETI isn't even going full speed yet. The benefit we are getting from the slow WU creation (and the longer period between "connect" (5 minutes instead of 10 seconds)) is that the internet link is not saturated, and downloads are proceeding extremely well. Probably, there are no (or very few) ghosts being created. But no one answered the main thrust of my question. All you did was pick apart my contention that there is some slowness to work out. My question was, "My logic tells me that the BOINC database server (Carolyn) is where the WUs are assigned among all the client computers (and probably affects the creation rates), while the Science database (Oscar) only comes into play at the time of assimiliation. So, am I correct in the belief that tinkering with Oscar's DB settings would affect the assimiliation speed, and tinkering with Carolyn's DB settings would affect the WU creation rate (and deletions, purges, etc)?" Thanks again for your responses. |
1mp0£173 Send message Joined: 3 Apr 99 Posts: 8423 Credit: 356,897 RAC: 0 |
The benefit we are getting from the slow WU creation (and the longer period between "connect" (5 minutes instead of 10 seconds)) is that the internet link is not saturated, and downloads are proceeding extremely well. Probably, there are no (or very few) ghosts being created. Don't underestimate the value of this particular benefit. If I worked in the lab, I would have spent the last month trying to find a way to slow things down so that 200,000 hosts don't max out the bandwidth just hammering for more work. BOINC-equipped computers make an incredible botnet, and there is no real difference between hungry hosts and a DoS attack. It seems like holding down the work creation rate is doing that nicely. It could just be a happy accident, or it could be intentional. Either way, it seems to be working. ... and I think there is more to the answer to your other questions than just pointing at Carolyn or Oscar. |
Fred J. Verster Send message Joined: 21 Apr 04 Posts: 3252 Credit: 31,903,643 RAC: 0 |
|
Swibby Bear Send message Joined: 1 Aug 01 Posts: 246 Credit: 7,945,093 RAC: 0 |
The Berkeley Boys have found the magic cure! They put Oscar on WU splitting duty, freeing up Vader to concentrate his efforts on the assimiliation task. The result: Splitters are cranking out about 30+ WU per second and the Ready-To-Send queue is building nicely. And the Waiting-to-Assimilate queue is dropping like a rock. Paradise at last! Now you are getting your money's worth! Congratulations yet again !!! (Now I have server envy) |
Josef W. Segur Send message Joined: 30 Oct 99 Posts: 4504 Credit: 1,414,761 RAC: 0 |
I particularly admire their timing. With production limited to about 60000 MB and 570 AP tasks per hour for more than 2 days, many hosts reached the in progress limit. So turning up the creation rate did not lead to hordes of hosts trying to download 300 Mbps through a 100 Mbps pipe. If things go well over the weekend, I suppose the in progress limits will be boosted Monday. The system reaction will be interesting to watch. Joe |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.