Blips and Bursts (Aug 07 2008)

Message boards : Technical News : Blips and Bursts (Aug 07 2008)
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · 10 · Next

AuthorMessage
Profile ML1
Volunteer moderator
Volunteer tester

Send message
Joined: 25 Nov 01
Posts: 21681
Credit: 7,508,002
RAC: 20
United Kingdom
Message 797419 - Posted: 13 Aug 2008, 21:06:39 UTC - in response to Message 797395.  

I'm sure you crunched many more SETI workunits per day, so it follows that your daily RAC would be higher, whereas AP workunits are hard to complete in a single day so the daily RAC will decline.

Hopefully that is the RAC initially will decline until the big lumps of AP credit comes through to oncemore restore the original RAC...

Happy crunchin',
Martin

See new freedom: Mageia Linux
Take a look for yourself: Linux Format
The Future is what We all make IT (GPLv3)
ID: 797419 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 797464 - Posted: 13 Aug 2008, 21:59:15 UTC

There we go. Woke up a little while ago and got 62 new tasks. About half are still pending for download. I still had 15 or so left anyway, but now I don't have to sweat it.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 797464 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 797752 - Posted: 14 Aug 2008, 11:38:38 UTC - in response to Message 797095.  

See that the servers are down again. No work. Unable to join other projects because my client is SETI_Enhanced capable ONLY. So why do I see that I'm supposed to be working on an AstroPulse WU in my Task list? Which I don't have! Somebody should be checking on whether a client is capable of crunching AP WU's and not download them to that client. Now, the other person with my supposed WU is going to have to wait for the timeout on me and then wait again for somebody else to crunch that WU.


The only way it should have sent that to you is if your app_info.xml file says that you can crunch astropulse. Of course if somehow that app_info.xml file got deleted, you aren't crunching optimized any more and astropulse will be downloaded to your machine.



My app_info.xml is present and IS NOT optimized and states it is specifically ONLY for the Seti_Enhanced application as that is the only application we have ever had for our operating system. BOINC did something wrong in my case. I was just trying to let the operators know they goofed somehow on the AP work unit distribution.

Keith
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 797752 · Report as offensive
Profile Dave & Tammy Binning

Send message
Joined: 4 Sep 04
Posts: 10
Credit: 1,287,669
RAC: 0
United States
Message 797754 - Posted: 14 Aug 2008, 11:52:40 UTC - in response to Message 797391.  

I havn't been able to get new workunits for at least 4 days now , Anyone know when sti@home might get fixed ?



i had 7 WU`s start to download about 2 hrs ago, 4 of which downloaded ok, however the final 3 are still pending, i get a report keep telling me the servers are down again, oh hum!

Tom.


The downloads have really started coming in for me last night. It seems to be back to normal now.

ID: 797754 · Report as offensive
Profile Robert Gammon
Volunteer tester

Send message
Joined: 29 Aug 01
Posts: 21
Credit: 1,573,250
RAC: 0
United States
Message 797771 - Posted: 14 Aug 2008, 12:44:16 UTC - in response to Message 797754.  

I was allowed to get 10 wus tuesday early. However, uploads are still not allowed, uploads timeout with Access to refer site succeeded, project servers may be down. This has been going on for the last 24 hours. Access for me is twice a day for 45 minutes at a pop.
ID: 797771 · Report as offensive
Tom Richardson.
Avatar

Send message
Joined: 19 Jan 08
Posts: 4
Credit: 3,338
RAC: 0
United Kingdom
Message 797785 - Posted: 14 Aug 2008, 13:27:36 UTC - in response to Message 797754.  

I havn't been able to get new workunits for at least 4 days now , Anyone know when sti@home might get fixed ?



i had 7 WU`s start to download about 2 hrs ago, 4 of which downloaded ok, however the final 3 are still pending, i get a report keep telling me the servers are down again, oh hum!

Tom.


The downloads have really started coming in for me last night. It seems to be back to normal now.


Yes things are starting to pick up now, at least for me,more WU`s were downloaded last night and again this morning, wonder if it will continue that way, lets hope so.

Tom.
ID: 797785 · Report as offensive
Profile arr25b

Send message
Joined: 19 Nov 05
Posts: 16
Credit: 14,839,632
RAC: 0
United Kingdom
Message 797794 - Posted: 14 Aug 2008, 13:58:25 UTC - in response to Message 797785.  

I havn't been able to get new workunits for at least 4 days now , Anyone know when sti@home might get fixed ?



i had 7 WU`s start to download about 2 hrs ago, 4 of which downloaded ok, however the final 3 are still pending, i get a report keep telling me the servers are down again, oh hum!

Tom.


The downloads have really started coming in for me last night. It seems to be back to normal now.


Yes things are starting to pick up now, at least for me,more WU`s were downloaded last night and again this morning, wonder if it will continue that way, lets hope so.

Tom.



I wish I had the same luck as you, can't upload and cant get WU's, same old error project servers maybe be down
ID: 797794 · Report as offensive
seti@elrcastor.com
Volunteer tester

Send message
Joined: 30 Jan 00
Posts: 35
Credit: 4,879,559
RAC: 0
United States
Message 797798 - Posted: 14 Aug 2008, 14:17:29 UTC - in response to Message 797794.  
Last modified: 14 Aug 2008, 14:18:08 UTC


I wish I had the same luck as you, can't upload and cant get WU's, same old error project servers maybe be down



try exiting your boinc client and starting it back up again
ID: 797798 · Report as offensive
tokahai

Send message
Joined: 12 Mar 01
Posts: 4
Credit: 10,957,765
RAC: 1
Germany
Message 797814 - Posted: 14 Aug 2008, 15:22:20 UTC

I've got some WUs, too, but at the moment it's impossible to upload anything and there also a lot of WU downloads still pending. Recovery in the s@h lab seems to proceed very slowly this time.
ID: 797814 · Report as offensive
PhonAcq

Send message
Joined: 14 Apr 01
Posts: 1656
Credit: 30,658,217
RAC: 1
United States
Message 797831 - Posted: 14 Aug 2008, 15:41:07 UTC

Crickets is the key to understanding right now. Check the I/O loading first. If it is pegged like it is now, then U/L and D/L are going to be problematic.

The question for me is why has the total bandwidth been pegged for so long? At 90 Mbps for about 24h, that is like 10^12 bytes transferred out of the servers in the last day. A terrabyte. Hard to believe, unless there is a problem somewhere. Will this persist through the weekend? Will be fun to see.
ID: 797831 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 797849 - Posted: 14 Aug 2008, 16:24:51 UTC - in response to Message 797831.  

Crickets is the key to understanding right now. Check the I/O loading first. If it is pegged like it is now, then U/L and D/L are going to be problematic.

The question for me is why has the total bandwidth been pegged for so long? At 90 Mbps for about 24h, that is like 10^12 bytes transferred out of the servers in the last day. A terrabyte. Hard to believe, unless there is a problem somewhere. Will this persist through the weekend? Will be fun to see.

You might want to track down a copy of "Computer Networks" by Andrew Tannenbaum and look at what happens when network loading goes from 90% to 100% or just a little beyond.

Basically, new connections take up so much bandwidth that existing connections have trouble finishing.

I don't know if this is true, but we do know that 90 megabits is about capacity.
ID: 797849 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14690
Credit: 200,643,578
RAC: 874
United Kingdom
Message 797872 - Posted: 14 Aug 2008, 17:04:12 UTC - in response to Message 797752.  

My app_info.xml is present and IS NOT optimized and states it is specifically ONLY for the Seti_Enhanced application as that is the only application we have ever had for our operating system. BOINC did something wrong in my case. I was just trying to let the operators know they goofed somehow on the AP work unit distribution.

Keith

There was a bug in app_info.xml handling, but it was fixed on 6 August.

If your AP task was allocated on or before 6 August, then BOINC did something wrong and has had its wrist slapped. Forget it and move on.

If your AP task was allocated on or after 7 August, please re-report it as a new problem.
ID: 797872 · Report as offensive
Tommy

Send message
Joined: 26 Jul 00
Posts: 9
Credit: 530,369
RAC: 0
United States
Message 797901 - Posted: 14 Aug 2008, 18:30:40 UTC

No work units going up or down, looks like another weekwend with no WU's to crunch. What I find amazing is that no Staff Member has posted any updates for over 7 days, and that one announced "we were almost fixed, all it would take was time". Could we have some one who is actually working the problem, please give us an update. Matt are you on site? I am sure this will offend some of you, but that is not my intention.

Tommy
ID: 797901 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14690
Credit: 200,643,578
RAC: 874
United Kingdom
Message 797907 - Posted: 14 Aug 2008, 18:36:56 UTC - in response to Message 797901.  

No work units going up or down, looks like another weekwend with no WU's to crunch. What I find amazing is that no Staff Member has posted any updates for over 7 days, and that one announced "we were almost fixed, all it would take was time". Could we have some one who is actually working the problem, please give us an update. Matt are you on site? I am sure this will offend some of you, but that is not my intention.

Tommy

Matt is on vacation.

Eric posted in message 797866, an hour and a half ago,

We're still maxed out on number of connections and getting some timeouts. The fact that we're maxed on outbound bandwidth isn't helping.

We're going to try to get an additional machine handling uploads in the next couple hours.
ID: 797907 · Report as offensive
Profile ML1
Volunteer moderator
Volunteer tester

Send message
Joined: 25 Nov 01
Posts: 21681
Credit: 7,508,002
RAC: 20
United Kingdom
Message 797911 - Posted: 14 Aug 2008, 18:40:31 UTC - in response to Message 797901.  

... Could we have some one who is actually working the problem, please give us an update. Matt are you on site? ...

I'm very sure it's being worked on or it is being left to allow various surges and bottlenecks to settle. There are usually various parts of the system maxed out after an outage while a million or so very hungry hosts all try to grab whatever they can.

Meanwhile, Matt is indeed not onsite.

That doesn't mean nothing is being done. It's likely that the others are too busy or too sensible to risk the wrath of the forums!


Stay cool, the system always bounces (boincs) back...

Happy crunchin',
Martin

See new freedom: Mageia Linux
Take a look for yourself: Linux Format
The Future is what We all make IT (GPLv3)
ID: 797911 · Report as offensive
Profile ML1
Volunteer moderator
Volunteer tester

Send message
Joined: 25 Nov 01
Posts: 21681
Credit: 7,508,002
RAC: 20
United Kingdom
Message 797916 - Posted: 14 Aug 2008, 18:46:51 UTC - in response to Message 797911.  

... Could we have some one who is actually working the problem, please give us an update. ...

I'm very sure it's being worked on ...

See Eric's comment from a short while ago.

Meanwhile, note that there are known issues for when their 100Mbit/s link becomes saturated. Regardless, the surge will eventually push through. Just expect a few connection errors for the time being. The link is in effect 'strangled' after all.

Happy crunchin',
Martin

See new freedom: Mageia Linux
Take a look for yourself: Linux Format
The Future is what We all make IT (GPLv3)
ID: 797916 · Report as offensive
Profile Andy Lee Robinson
Avatar

Send message
Joined: 8 Dec 05
Posts: 630
Credit: 59,973,836
RAC: 0
Hungary
Message 798009 - Posted: 14 Aug 2008, 22:20:13 UTC - in response to Message 797916.  
Last modified: 14 Aug 2008, 22:31:14 UTC

Just expect a few connection errors for the time being. The link is in effect 'strangled' after all.


Yes, like trying to evacuate a stadium through a single set of doors.

Without more 'doors', then firewalling off parts of the net is one solution to allow a more ordered recovery and reduce logjams. Could alternate between allowing new connections from odd or even ip addresses for an hour each.

What is strange is that when downloads started working, uploads were still almost completely stalled for hours, and they are less than a tenth of the size of the downloads. Something quite seriously not balanced that needs a good look at.

Also, it would be good to delay the credit run after outages until the surge dies down, or change the timing of the credit runs to finish just before the Tuesday outage to avoid the weekly credit spikes in the stats.
ID: 798009 · Report as offensive
Profile Dave & Tammy Binning

Send message
Joined: 4 Sep 04
Posts: 10
Credit: 1,287,669
RAC: 0
United States
Message 798043 - Posted: 14 Aug 2008, 23:30:52 UTC

My work units are back to normal as far as I can tell but my up loads are taking some time to get to ready to report status but when they get to that stage they seem to report quicker than normal. I don't have a problem, I am just reporting on what seems to be happening with my two computers.
ID: 798043 · Report as offensive
DJStarfox

Send message
Joined: 23 May 01
Posts: 1066
Credit: 1,226,053
RAC: 2
United States
Message 798046 - Posted: 14 Aug 2008, 23:42:38 UTC - in response to Message 798009.  

What is strange is that when downloads started working, uploads were still almost completely stalled for hours, and they are less than a tenth of the size of the downloads. Something quite seriously not balanced that needs a good look at.


They would have to partition their internet connection, meaning QoS controls would have to reserve some of the bandwidth for each direction. You have to be very careful about doing this, as it will limit bandwidth on each direction absolutely. Also, I'd have to ask Matt if their Cisco router could even do it as is.
ID: 798046 · Report as offensive
Profile Bill Walker
Avatar

Send message
Joined: 4 Sep 99
Posts: 3868
Credit: 2,697,267
RAC: 0
Canada
Message 798143 - Posted: 15 Aug 2008, 2:05:14 UTC

Having "lurked" here for a few days, I have to admit that I have really learned nothing. Either the solutions/theories being presented are way over my head, or you guys don't know s..t.

Having been through several major SETI breakdowns in the last few years, I have learned patience, and to have other projects on the go. My ultimate fallback at the moment is climateprediction.net. I have two tasks under way with them, with completion dates of September 2010 and May 2009. When those dates pass without new tasks, I will sell my computers and return to knitting.

ID: 798143 · Report as offensive
Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · 10 · Next

Message boards : Technical News : Blips and Bursts (Aug 07 2008)


 
©2025 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.