Blips and Bursts (Aug 07 2008)

Message boards : Technical News : Blips and Bursts (Aug 07 2008)
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · Next

AuthorMessage
Profile [B^S] madmac
Volunteer tester
Avatar

Send message
Joined: 9 Feb 04
Posts: 1175
Credit: 4,754,897
RAC: 0
United Kingdom
Message 799938 - Posted: 19 Aug 2008, 16:11:54 UTC

No stats on Boinc Synergy, good leap yesterday however today credit is 0
ID: 799938 · Report as offensive
Bob Merrill

Send message
Joined: 7 Jun 99
Posts: 120
Credit: 8,531,677
RAC: 19
United States
Message 800227 - Posted: 20 Aug 2008, 22:27:02 UTC - in response to Message 794321.  

Towards the end of the afternoon yesterday we put in a new scheduler to fix a bug with "anonymous platforms" and the way they handle Astropulse workunits. This is working fine as far as I know, but at first there were some brief issues with uploads in general (human error when installing new scheduler).

Today got our new NAS machine into the closet. We're close to removing the old NetApp filer, which still works great after so many years, but the drives are too small and we can't afford support on this system, and buying new replacement drives is prohibilitively expensive. Plus the thing is just physically huge - a whole rack taking up a third of our closet for only 3 TB raw space. We're replacing it with a 3U system that will ultimately have about 7 TB raw space. Getting that into the closet meant I was able to fire up another server-to-be today in our prep lab and get that configured.

Traffic-wise we're still trying to get a feel for our demand and our bottlenecks. Eric wrote a script that is busy deleting antique workunits/results that exist on disk but not in the database (not sure why the antique deleter built into BOINC isn't working...). This will clear up additional much needed room but this is pretty much all we can do short of getting a whole new workunit storage server.

Looks like web code was updated just now, breaking a thing or two. I think Dave's addressing that stuff. I've been mostly catching up on several behind-the-scenes programming projects today.

- Matt



This was posted on the 7th, it is now the 20th. I know Matt is on vacation, and I would not begurdgd him time off for the world. But, isn't there any one else at SAH that could post a short update while is is off? There have been problem up the yeng yang and no info as to what is going on except for a few short notes on the start page. Its no fun being in the dark!!!!
ID: 800227 · Report as offensive
Profile JDWhale
Volunteer tester
Avatar

Send message
Joined: 6 Apr 99
Posts: 921
Credit: 21,935,817
RAC: 3
United States
Message 800274 - Posted: 21 Aug 2008, 1:18:36 UTC - in response to Message 800227.  
Last modified: 21 Aug 2008, 1:19:35 UTC

Towards the end of the afternoon yesterday we put in a new scheduler to fix a bug with "anonymous platforms" and the way they handle Astropulse workunits. This is working fine as far as I know, but at first there were some brief issues with uploads in general (human error when installing new scheduler).

Today got our new NAS machine into the closet. We're close to removing the old NetApp filer, which still works great after so many years, but the drives are too small and we can't afford support on this system, and buying new replacement drives is prohibilitively expensive. Plus the thing is just physically huge - a whole rack taking up a third of our closet for only 3 TB raw space. We're replacing it with a 3U system that will ultimately have about 7 TB raw space. Getting that into the closet meant I was able to fire up another server-to-be today in our prep lab and get that configured.

Traffic-wise we're still trying to get a feel for our demand and our bottlenecks. Eric wrote a script that is busy deleting antique workunits/results that exist on disk but not in the database (not sure why the antique deleter built into BOINC isn't working...). This will clear up additional much needed room but this is pretty much all we can do short of getting a whole new workunit storage server.

Looks like web code was updated just now, breaking a thing or two. I think Dave's addressing that stuff. I've been mostly catching up on several behind-the-scenes programming projects today.

- Matt



This was posted on the 7th, it is now the 20th. I know Matt is on vacation, and I would not begurdgd him time off for the world. But, isn't there any one else at SAH that could post a short update while is is off? There have been problem up the yeng yang and no info as to what is going on except for a few short notes on the start page. Its no fun being in the dark!!!!


I ain't at Berkeley but... There was a rough patch about a week ago... folks worked late and gave up their weekend to make things right. Everything is running better than expected... WUs are being sent out... Results are being received... Everyone is sleeping better at night as a result. Oh yeah... I haven't received an AP WU since 6 Aug... So I figure that the fix worked.

BOINC On...On...
JDWhale
ID: 800274 · Report as offensive
PhonAcq

Send message
Joined: 14 Apr 01
Posts: 1656
Credit: 30,658,217
RAC: 1
United States
Message 800395 - Posted: 21 Aug 2008, 12:17:14 UTC

two things... first, when they started up ap they had at least 3 splitters going. during the mess, they turned 2 off and then settled on running just 2 splitters, which is the case today. so throttling the ap wu creation may have helped to stablize things.

they also created an option to not accept any ap wu's. I selected that option and suspect others may have as well. personally, there are problems with ap and the credit granting that are being hotly discussed. so I don't want to waste any computer time until those issues settle down a bit. maybe in a month or two I will enable receiving ap's. there are plenty of other guinea pigs I suspect.

not everything is right yet with ap because I still have at least two ghost wu's assigned to me eventhough they don't exist on my clients. that should cure itself in september, or earlier if the staff decide to fix up the backend, I suppose.
ID: 800395 · Report as offensive
baid

Send message
Joined: 22 May 03
Posts: 3
Credit: 15,640,896
RAC: 0
Czech Republic
Message 800406 - Posted: 21 Aug 2008, 13:08:29 UTC - in response to Message 800395.  

they also created an option to not accept any ap wu's. I selected that option and suspect others may have as well.


And where exactly is that option?
ID: 800406 · Report as offensive
Aurora Borealis
Volunteer tester
Avatar

Send message
Joined: 14 Jan 01
Posts: 3075
Credit: 5,631,463
RAC: 0
Canada
Message 800408 - Posted: 21 Aug 2008, 13:33:05 UTC - in response to Message 800406.  
Last modified: 21 Aug 2008, 13:34:39 UTC

they also created an option to not accept any ap wu's. I selected that option and suspect others may have as well.


And where exactly is that option?

Your account SETI@home preferences. Don't forget to update the project in Boinc manager.

Boinc V7.2.42
Win7 i5 3.33G 4GB, GTX470
ID: 800408 · Report as offensive
Ingleside
Volunteer developer

Send message
Joined: 4 Feb 03
Posts: 1546
Credit: 15,832,022
RAC: 13
Norway
Message 800409 - Posted: 21 Aug 2008, 13:34:43 UTC - in response to Message 800406.  
Last modified: 21 Aug 2008, 13:38:42 UTC

they also created an option to not accept any ap wu's. I selected that option and suspect others may have as well.


And where exactly is that option?

It's the project-specific preferences, SETI@home preferences, the same place you'll set resource-share, default computer-location, show/hide computers, send/don't send email...

It's possible to configure upto 4 different Venues, in case you'll want to have different applications on various computers.
"I make so many mistakes. But then just think of all the mistakes I don't make, although I might."
ID: 800409 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 800473 - Posted: 21 Aug 2008, 18:10:23 UTC - in response to Message 800395.  

two things... first, when they started up ap they had at least 3 splitters going. during the mess, they turned 2 off and then settled on running just 2 splitters, which is the case today. so throttling the ap wu creation may have helped to stablize things.

they also created an option to not accept any ap wu's. I selected that option and suspect others may have as well. personally, there are problems with ap and the credit granting that are being hotly discussed. so I don't want to waste any computer time until those issues settle down a bit. maybe in a month or two I will enable receiving ap's. there are plenty of other guinea pigs I suspect.

not everything is right yet with ap because I still have at least two ghost wu's assigned to me eventhough they don't exist on my clients. that should cure itself in september, or earlier if the staff decide to fix up the backend, I suppose.

We had exactly the same situation back in December when a whole string of "shorties" came through.

You may remember this thread?
ID: 800473 · Report as offensive
PhonAcq

Send message
Joined: 14 Apr 01
Posts: 1656
Credit: 30,658,217
RAC: 1
United States
Message 800485 - Posted: 21 Aug 2008, 18:57:59 UTC - in response to Message 800473.  

two things... first, when they started up ap they had at least 3 splitters going. during the mess, they turned 2 off and then settled on running just 2 splitters, which is the case today. so throttling the ap wu creation may have helped to stablize things.

they also created an option to not accept any ap wu's. I selected that option and suspect others may have as well. personally, there are problems with ap and the credit granting that are being hotly discussed. so I don't want to waste any computer time until those issues settle down a bit. maybe in a month or two I will enable receiving ap's. there are plenty of other guinea pigs I suspect.

not everything is right yet with ap because I still have at least two ghost wu's assigned to me eventhough they don't exist on my clients. that should cure itself in september, or earlier if the staff decide to fix up the backend, I suppose.

We had exactly the same situation back in December when a whole string of "shorties" came through.

You may remember this thread?


Perhaps they need a quantitative model of how the system should work so that warning flags can be raised and adjustments made before the s..t hits the fan. I guess it's easier not to be proactive, given the nature of the project.
ID: 800485 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 800508 - Posted: 21 Aug 2008, 20:25:16 UTC - in response to Message 800485.  

two things... first, when they started up ap they had at least 3 splitters going. during the mess, they turned 2 off and then settled on running just 2 splitters, which is the case today. so throttling the ap wu creation may have helped to stablize things.

they also created an option to not accept any ap wu's. I selected that option and suspect others may have as well. personally, there are problems with ap and the credit granting that are being hotly discussed. so I don't want to waste any computer time until those issues settle down a bit. maybe in a month or two I will enable receiving ap's. there are plenty of other guinea pigs I suspect.

not everything is right yet with ap because I still have at least two ghost wu's assigned to me eventhough they don't exist on my clients. that should cure itself in september, or earlier if the staff decide to fix up the backend, I suppose.

We had exactly the same situation back in December when a whole string of "shorties" came through.

You may remember this thread?


Perhaps they need a quantitative model of how the system should work so that warning flags can be raised and adjustments made before the s..t hits the fan. I guess it's easier not to be proactive, given the nature of the project.

There may be times where that could help.

The solution that has worked reasonably well in the past is to always split tapes randomly. If the work comes in while different studies are controlling the telescope, they'll have a diverse set of angle ranges.

What happens through when all you have is short work -- when everything comes from the same angle range (same telescope motion)?

Seems to me that the only choice is to put a cap on the maximum number of split work units per day (or per hour) and then deal with all the complaints about the project running out of work (and falling RAC).

... or, you can add another study with big run-times and no "shorties" to counterbalance Multibeam when all the Multibeam work is ugly.
ID: 800508 · Report as offensive
Profile Neil Blaikie
Volunteer tester
Avatar

Send message
Joined: 17 May 99
Posts: 143
Credit: 6,652,341
RAC: 0
Canada
Message 800555 - Posted: 21 Aug 2008, 22:33:56 UTC
Last modified: 21 Aug 2008, 22:34:54 UTC

Seems like there might be problems again. I have been getting this message for the past few hours now.

8/21/2008 6:30:58 PM|SETI@home|Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 10 completed tasks
8/21/2008 6:31:01 PM||Project communication failed: attempting access to reference site
8/21/2008 6:31:02 PM||Access to reference site succeeded - project servers may be temporarily down.
8/21/2008 6:31:03 PM|SETI@home|Scheduler request failed: Server returned nothing (no headers, no data)
8/21/2008 6:32:04 PM|SETI@home|Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 10 completed tasks
8/21/2008 6:32:09 PM|SETI@home|Scheduler request succeeded: got 0 new tasks
8/21/2008 6:32:09 PM|SETI@home|Message from server: Server error: can't attach shared memory

Have tried exiting the software but it always come up with the same messages.
ID: 800555 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 800563 - Posted: 21 Aug 2008, 22:45:13 UTC - in response to Message 800555.  

Seems like there might be problems again. I have been getting this message for the past few hours now.

8/21/2008 6:30:58 PM|SETI@home|Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 10 completed tasks
8/21/2008 6:31:01 PM||Project communication failed: attempting access to reference site
8/21/2008 6:31:02 PM||Access to reference site succeeded - project servers may be temporarily down.
8/21/2008 6:31:03 PM|SETI@home|Scheduler request failed: Server returned nothing (no headers, no data)
8/21/2008 6:32:04 PM|SETI@home|Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 10 completed tasks
8/21/2008 6:32:09 PM|SETI@home|Scheduler request succeeded: got 0 new tasks
8/21/2008 6:32:09 PM|SETI@home|Message from server: Server error: can't attach shared memory

Have tried exiting the software but it always come up with the same messages.

See the text in bold above. It is a server error. Someone will kick it shortly, I'm sure.

ID: 800563 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14690
Credit: 200,643,578
RAC: 874
United Kingdom
Message 800565 - Posted: 21 Aug 2008, 22:47:17 UTC - in response to Message 800563.  

Seems like there might be problems again. I have been getting this message for the past few hours now.

8/21/2008 6:30:58 PM|SETI@home|Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 10 completed tasks
8/21/2008 6:31:01 PM||Project communication failed: attempting access to reference site
8/21/2008 6:31:02 PM||Access to reference site succeeded - project servers may be temporarily down.
8/21/2008 6:31:03 PM|SETI@home|Scheduler request failed: Server returned nothing (no headers, no data)
8/21/2008 6:32:04 PM|SETI@home|Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 10 completed tasks
8/21/2008 6:32:09 PM|SETI@home|Scheduler request succeeded: got 0 new tasks
8/21/2008 6:32:09 PM|SETI@home|Message from server: Server error: can't attach shared memory

Have tried exiting the software but it always come up with the same messages.

See the text in bold above. It is a server error. Someone will kick it shortly, I'm sure.

Have a look at the Server Status Page. Somebody is kicking it as we speak.
ID: 800565 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 800567 - Posted: 21 Aug 2008, 22:51:31 UTC - in response to Message 800565.  


8/21/2008 6:32:09 PM|SETI@home|Message from server: Server error: can't attach shared memory

Have tried exiting the software but it always come up with the same messages.

See the text in bold above. It is a server error. Someone will kick it shortly, I'm sure.

Have a look at the Server Status Page. Somebody is kicking it as we speak.

To me, a random server error isn't a call-to-action. I just assume that if it needs kicking, they'll kick it sooner or later.
ID: 800567 · Report as offensive
Invisible Man

Send message
Joined: 24 Jun 01
Posts: 22
Credit: 1,129,336
RAC: 0
United Kingdom
Message 800568 - Posted: 21 Aug 2008, 22:57:04 UTC - in response to Message 796900.  

FYI unless some other staff member picks it up, there won't be any Tech News for awhile. Matt is on vacation.


Out of 170 messages, it was Blurf who first gave us the above news on 12th August.
All will be revealed when Matt gets back. Until then, we will have to be patient!
ID: 800568 · Report as offensive
Profile Neil Blaikie
Volunteer tester
Avatar

Send message
Joined: 17 May 99
Posts: 143
Credit: 6,652,341
RAC: 0
Canada
Message 800592 - Posted: 21 Aug 2008, 23:58:23 UTC
Last modified: 21 Aug 2008, 23:58:58 UTC

I did notice that someone was kicking the server just shortly after I posted the message.
Hope Matt is not on vacation in Florida and actually having some well earned rest and recuperation.
Not too worried about the servers being down again as I ample work to keep me going for ages. Will be fixed when it is fixed.

Thanks for pointing it out that someone was looking at it though.
8PM EST, Cold Bud in fridge, time to consume one or two and then play some Wii :-)
ID: 800592 · Report as offensive
JBWoolley

Send message
Joined: 8 May 07
Posts: 35
Credit: 6,214,366
RAC: 0
United States
Message 800981 - Posted: 22 Aug 2008, 21:40:18 UTC

First, I'm sorry for showing my newbie ignorance. Please be kind.

Have read about the issues Seti is having with their servers, and being new I don't know if or how these are effecting my crunching / credits.

Have several Windows quad-cores crunching Seti only. Downloads, crunching and uploads seem to be working as usual. And the "Total User Credits" seems to be going as usual. I just broke the 1,000,000 mark. :-)

But the stats show that ALL the "Host Averages" on all machines taking a serious dive.

Anyone know why? And if/when back to normal...?

Thanks, Jack
ID: 800981 · Report as offensive
Profile JDWhale
Volunteer tester
Avatar

Send message
Joined: 6 Apr 99
Posts: 921
Credit: 21,935,817
RAC: 3
United States
Message 801007 - Posted: 22 Aug 2008, 22:11:06 UTC - in response to Message 800981.  

First, I'm sorry for showing my newbie ignorance. Please be kind.

Have read about the issues Seti is having with their servers, and being new I don't know if or how these are effecting my crunching / credits.

Have several Windows quad-cores crunching Seti only. Downloads, crunching and uploads seem to be working as usual. And the "Total User Credits" seems to be going as usual. I just broke the 1,000,000 mark. :-)

But the stats show that ALL the "Host Averages" on all machines taking a serious dive.

Anyone know why? And if/when back to normal...?

Thanks, Jack


Welcome to the message boards.

A couple possibilities for Host Averages(RAC) dropping...

1) Recent drop of credit per WU being enacted by the project.
2) Introduction of Astropulse WUs that are currently underclaiming credit.

If you make your hosts visible, the forum experts might be able to offer some additional help or suggestions.

Lots of good reading over in the Number Crunching forum.

Kind Regards,
JDWhale
ID: 801007 · Report as offensive
JBWoolley

Send message
Joined: 8 May 07
Posts: 35
Credit: 6,214,366
RAC: 0
United States
Message 801029 - Posted: 22 Aug 2008, 22:54:07 UTC - in response to Message 801007.  

First, I'm sorry for showing my newbie ignorance. Please be kind.

Have read about the issues Seti is having with their servers, and being new I don't know if or how these are effecting my crunching / credits.

Have several Windows quad-cores crunching Seti only. Downloads, crunching and uploads seem to be working as usual. And the "Total User Credits" seems to be going as usual. I just broke the 1,000,000 mark. :-)

But the stats show that ALL the "Host Averages" on all machines taking a serious dive.

Anyone know why? And if/when back to normal...?

Thanks, Jack


Welcome to the message boards.

A couple possibilities for Host Averages(RAC) dropping...

1) Recent drop of credit per WU being enacted by the project.
2) Introduction of Astropulse WUs that are currently underclaiming credit.

If you make your hosts visible, the forum experts might be able to offer some additional help or suggestions.

Lots of good reading over in the Number Crunching forum.

Kind Regards,
JDWhale


OK... as long as folks are aware. I'll just hang in there.

Seems strange though. If the project wants to move toward Astropulse, why not promote the move with higher credits... not less credit. Hummm. Strange.

Oh well. thanks for the answer. Jack

ID: 801029 · Report as offensive
OzzFan Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Apr 02
Posts: 15691
Credit: 84,761,841
RAC: 28
United States
Message 801081 - Posted: 23 Aug 2008, 1:34:50 UTC - in response to Message 801029.  

First, I'm sorry for showing my newbie ignorance. Please be kind.

Have read about the issues Seti is having with their servers, and being new I don't know if or how these are effecting my crunching / credits.

Have several Windows quad-cores crunching Seti only. Downloads, crunching and uploads seem to be working as usual. And the "Total User Credits" seems to be going as usual. I just broke the 1,000,000 mark. :-)

But the stats show that ALL the "Host Averages" on all machines taking a serious dive.

Anyone know why? And if/when back to normal...?

Thanks, Jack


Welcome to the message boards.

A couple possibilities for Host Averages(RAC) dropping...

1) Recent drop of credit per WU being enacted by the project.
2) Introduction of Astropulse WUs that are currently underclaiming credit.

If you make your hosts visible, the forum experts might be able to offer some additional help or suggestions.

Lots of good reading over in the Number Crunching forum.

Kind Regards,
JDWhale


OK... as long as folks are aware. I'll just hang in there.

Seems strange though. If the project wants to move toward Astropulse, why not promote the move with higher credits... not less credit. Hummm. Strange.

Oh well. thanks for the answer. Jack


Offering more credits would only cause problems with other BOINC projects that are not offering as much, and would encourage them to higher the amount of credits that they offer to attract any number of lost users. If every project did this, there would be a credit inflation problem which would devalue each contribution.
ID: 801081 · Report as offensive
Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · Next

Message boards : Technical News : Blips and Bursts (Aug 07 2008)


 
©2025 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.