Message boards :
Number crunching :
Panic Mode On (47) Server problems?
Message board moderation
Author | Message |
---|---|
arkayn Send message Joined: 14 May 99 Posts: 4438 Credit: 55,006,323 RAC: 0 |
We know that one of the servers is having difficulties http://setiathome.berkeley.edu/forum_thread.php?id=64259 Please continue the venting. |
Iona Send message Joined: 12 Jul 07 Posts: 790 Credit: 22,438,118 RAC: 0 |
Good for you. My cache will run out in a day or so (if I keep the PCs running, doing nothing else but S@H) and therefore, in addition to your requests, I will also demand a refund! Don't take life too seriously, as you'll never come out of it alive! |
Gary Charpentier Send message Joined: 25 Dec 00 Posts: 30919 Credit: 53,134,872 RAC: 32 |
I understand Bruno decided to play nice. |
perryjay Send message Joined: 20 Aug 02 Posts: 3377 Credit: 20,676,751 RAC: 0 |
Posted by Jeff Cobb over in Tech News..... The folks at Overland really came through! They give us amazing support. One of their engineers logged into the the server and worked his magic. The RAID and filesystem have come back to life. We'll let the RAID resync and do a final reboot to clear some flags and then restart work generation. PROUD MEMBER OF Team Starfire World BOINC |
KB7RZF Send message Joined: 15 Aug 99 Posts: 9549 Credit: 3,308,926 RAC: 2 |
I ran dry a day or 2 ago, but I wound up getting 2 resend WU's. LOL But, I've got plenty of other projects to crunch for. Hehehe |
Miep Send message Joined: 23 Jul 99 Posts: 2412 Credit: 351,996 RAC: 0 |
I ran dry a day or 2 ago, but I wound up getting 2 resend WU's. LOL But, I've got plenty of other projects to crunch for. Hehehe Yes, we can see that :D - is there something you DON'T crunch? I seem to have another day or so, before I can test how well the backup project mechanism works in 6.12.28. Carola ------- I'm multilingual - I can misunderstand people in several languages! |
Miep Send message Joined: 23 Jul 99 Posts: 2412 Credit: 351,996 RAC: 0 |
This is totally unacceptable. My caches will run out in 10 days, and I tell you people that if this problem isn't solved in the coming 25 years, I will leave this project forever. I wouldn't do that, if I was you, the counting down bit. What with the trees in the forest, I'd be afraid what exactly I was counting down to... Carola ------- I'm multilingual - I can misunderstand people in several languages! |
KB7RZF Send message Joined: 15 Aug 99 Posts: 9549 Credit: 3,308,926 RAC: 2 |
I ran dry a day or 2 ago, but I wound up getting 2 resend WU's. LOL But, I've got plenty of other projects to crunch for. Hehehe LOL There's a few newer projects that have come out that I haven't attached to. All of these in my sig have been ones I've crunched for during a teams project of the month and as a just because they looked interesting. SETI will always be home, but I figured its always nice to share. So, I shared. LOL |
Gary Charpentier Send message Joined: 25 Dec 00 Posts: 30919 Credit: 53,134,872 RAC: 32 |
This is totally unacceptable. My caches will run out in 10 days, and I tell you people that if this problem isn't solved in the coming 25 years, I will leave this project forever. When it is working again, do you suspend your count, or does it reset? |
Donald L. Johnson Send message Joined: 5 Aug 02 Posts: 8240 Credit: 14,654,533 RAC: 20 |
One positive thing that's come out of this is that the file deleters and db purge are getting some catch-up time. Results and Work Units waiting for db purge are both under 100K and dropping. Donald Infernal Optimist / Submariner, retired |
Donald L. Johnson Send message Joined: 5 Aug 02 Posts: 8240 Credit: 14,654,533 RAC: 20 |
This is totally unacceptable. My caches will run out in 10 days, and I tell you people that if this problem isn't solved in the coming 25 years, I will leave this project forever. So, if the project keeps going, we will have to deal with you for another 40+ years? Works for me. Donald Infernal Optimist / Submariner, retired |
Jason Safoutin Send message Joined: 8 Sep 05 Posts: 1386 Credit: 200,389 RAC: 0 |
I ran dry a day or 2 ago, but I wound up getting 2 resend WU's. LOL But, I've got plenty of other projects to crunch for. Hehehe Agreed. SETI is my favorite project and will always be my top one. I used to participate in other projects, but never was too interested in any of them. The problems happen often, but that won't stop me from crunching here ever. "By faith we understand that the universe was formed at God's command, so that what is seen was not made out of what was visible". Hebrews 11.3 |
Gary Charpentier Send message Joined: 25 Dec 00 Posts: 30919 Credit: 53,134,872 RAC: 32 |
So, if the project keeps going, we will have to deal with you for another 40+ years? Eric almost has that Fountain of Youth formula ET sent decoded ... |
Cosmic_Ocean Send message Joined: 23 Dec 00 Posts: 3027 Credit: 13,516,867 RAC: 13 |
There's life on the cricket graph starting around 0500utc. Upload server is still disabled though. Linux laptop: record uptime: 1511d 20h 19m (ended due to the power brick giving-up) |
tbret Send message Joined: 28 May 99 Posts: 3380 Credit: 296,162,071 RAC: 40 |
Instead of first asking a question that would expose how ignorant I am about this, let me get ahead of that and proclaim and admit that I am really ignorant about a lot of things. One of the many is a RAID array. Now, it's hard to be as ignorant as I am, especially since I once configured a RAID using an Adaptec controller that I seem to remember paying more for than the laptop I'm using to type this post. But that was back in the days when expensive motherboards would have two VESA Local Bus slots on them. Okay, so now that everyone's up-to-date on how out-of-date I am --- Just how big are these RAID arrays SETI is using (in GBs)? That question would probably answer my next question which is "Why are they using them?" I can't imagine that our upload / download activity, confined by the pipe into the lab, would need nearly 900MB/s and if it does then I can't imagine how many physical drives there would have to be in the array to handle it for more than half of a day at a time. Can someone give me a clue as to why you'd want to run a "striped" array (I understand redundancy) on this project in big, bold, conceptual strokes that even I can understand? I was just transferring an ISO file via wireless at 11.5MB/s across my den (I know that's one big file as opposed to 5,000 22k files). I'm thinking RAM makes more sense and so I want to know why I'm wrong; just as a sort-of "welcome to reality in the 21st century" lesson for me. |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13835 Credit: 208,696,464 RAC: 304 |
Just how big are these RAID arrays SETI is using (in GBs)? That question would probably answer my next question which is "Why are they using them?" RAID stands for Redundant Array of Inexpensive Disks. Note the redundant. RAID0 is just for speed, there is no redundancy. 1 disk dies, all data is lost. RAID 1 is mirroring, one disk maintains a copy of another disk. RAID 5+6 are the ones that really matter- data is spread across multiple disks. With RADI5, if one disk dies no data is lost. It can be rebuilt from the redundant data stored on the other disks in the array. With RAID6 2 disks in the array can die & still no data is lost. Grant Darwin NT |
Tim Send message Joined: 19 May 99 Posts: 211 Credit: 278,575,259 RAC: 0 |
Imagine a hard disk failure now…… at our computers with so many tasks to upload :-) |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14672 Credit: 200,643,578 RAC: 874 |
Just how big are these RAID arrays SETI is using (in GBs)? Looking at the Server Status Page, the line for 'Results out in the field' right now shows 5,857,809 for SETI (Multibeam), and 159,686 (Astropulse). Those are 'tasks' in modern terminology. Sometimes two people will be working on the same workunit, but often (more often for MB), one task will be complete and have been returned for validation, but the other will still be active. For the sake of argument, let's say that represents 5 million MB workunits, and 100 thousand AP workunits. That data has to be held at SETI, on those RAID arrays, until the results are validated - that's so a replacement copy can be sent out if validation is inconclusive or a worker times out. MB data files are 367 KiB in storage requirements, and AP are 8 MiB. Multiply that lot out, and I get the answer to be... Two thousand five hundred gigabytes When a RAID has to be rebuilt (as is going on at the moment), every single one of those bytes has to be read, and where appropriate written back to make the new redundant copy. If the RAID arrays held less data, the process would be quicker, and we could get back to work sooner. That's why I ask people not to hold so many tasks in their caches. |
Jason Safoutin Send message Joined: 8 Sep 05 Posts: 1386 Credit: 200,389 RAC: 0 |
I managed to get a cache of 59 WU's just not. This was the first time I tried to download work since the issues started. I noticed one took just a few seconds to crunch. I wonder if there will be many of those. Still not able to upload anything though. "By faith we understand that the universe was formed at God's command, so that what is seen was not made out of what was visible". Hebrews 11.3 |
Jason Safoutin Send message Joined: 8 Sep 05 Posts: 1386 Credit: 200,389 RAC: 0 |
As I understand it, the reason for using RAID configurations is because there is redundancy built in, i.e. a backup. RAID disks are hot swappable, so if a hard drive fails you simply take it out and throw it away and plug a new one in. Then the RAID array will copy whatever it needs to the new disk. Agree with you 100%. "By faith we understand that the universe was formed at God's command, so that what is seen was not made out of what was visible". Hebrews 11.3 |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.