Project Back Online After Overnight Outage

Message boards : News : Project Back Online After Overnight Outage
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · Next

AuthorMessage
Josephine Raiani

Send message
Joined: 20 Jun 99
Posts: 2
Credit: 3,580,137
RAC: 2
United States
Message 1195823 - Posted: 15 Feb 2012, 23:33:33 UTC - in response to Message 1195298.  

I see that you guys had a problem and supposedly it is now fixed. Well I have noticed that my Boinc Manager has not been running any work units for the past couple of days. It keeps reading "ERROR" "no work available to process". There is one work unit that is 100% complete but it has not been sent back to SETI, it's just sitting there in my "projects". Also, there is nothing in "tasks" and nothing in "transfers". I don't know if this problem is on my end or yours. Help!!! Help!!!
ID: 1195823 · Report as offensive
Profile AReimpell
Avatar

Send message
Joined: 20 Oct 99
Posts: 12
Credit: 16,970,443
RAC: 0
Venezuela
Message 1195836 - Posted: 16 Feb 2012, 0:32:56 UTC - in response to Message 1195823.  
Last modified: 16 Feb 2012, 1:08:25 UTC

I have noticed that my Boinc Manager has not been running any work units for the past couple of days. It keeps reading "ERROR" "no work available to process". There is one work unit that is 100% complete but it has not been sent back to SETI, it's just sitting there in my "projects". Also, there is nothing in "tasks" and nothing in "transfers."


Josephine:
I think you're a little bit confused.
What you see in the "Projects" tab are not work units. This is a list of all your attached boinc projects, their share over your computer's resources, and their status.
Looking at your account, I see you're only attached to seti, so obviously you see a 100% share (which -I think- you're misinterpreting as a 100% completed work unit).
SETI has been having problems in the generation of new work for several days. That's why your pc -after finishing previously received work- has not received new work units to process and ran empty. So you see the "no work available to process" message in the seti's status.
Once we get back to normal (we're on that path) you'll see a list of your new work unints in the "Tasks" tab, with their respective status, progress bar, time elapsed, time to go, etc.

Hope it helps...

Regards,
AR
ID: 1195836 · Report as offensive
Profile LiliKrist
Volunteer tester
Avatar

Send message
Joined: 12 Aug 09
Posts: 333
Credit: 143,167
RAC: 0
Indonesia
Message 1195848 - Posted: 16 Feb 2012, 1:16:29 UTC
Last modified: 16 Feb 2012, 1:16:59 UTC

No WU to crunch since yesterday =)


N = R x fp x ne x fl x fi x fc x L
ID: 1195848 · Report as offensive
triplemmm

Send message
Joined: 21 Nov 00
Posts: 27
Credit: 8,764,139
RAC: 13
Australia
Message 1195871 - Posted: 16 Feb 2012, 2:16:39 UTC

is there an issue with seti or are we just in a catch phase and i have been trying to get WU since 12th Feb without luck
just need to know so i can turn off PC's if not in use
ID: 1195871 · Report as offensive
Profile cliff
Avatar

Send message
Joined: 16 Dec 07
Posts: 625
Credit: 3,590,440
RAC: 0
United Kingdom
Message 1195880 - Posted: 16 Feb 2012, 3:05:39 UTC - in response to Message 1195777.  

Not so happy, seems the MB splitters are offline again:-( And no new work created..
Guess the servers waited until Matt went home and promptly went offline again.

Regards,
Cliff,
Been there, Done that, Still no damm T shirt!
ID: 1195880 · Report as offensive
Profile Belthazor
Volunteer tester
Avatar

Send message
Joined: 6 Apr 00
Posts: 219
Credit: 10,373,795
RAC: 13
Russia
Message 1195888 - Posted: 16 Feb 2012, 3:58:22 UTC

Nope, just no tapes on the game...
ID: 1195888 · Report as offensive
Profile Ronal E. Zepeda Trujillo
Avatar

Send message
Joined: 14 Jul 05
Posts: 9
Credit: 3,167,018
RAC: 0
Chile
Message 1195963 - Posted: 16 Feb 2012, 12:11:41 UTC

works fine for me!
great job guys!
Only a boy with responsabilities of an old man
ID: 1195963 · Report as offensive
Profile cliff
Avatar

Send message
Joined: 16 Dec 07
Posts: 625
Credit: 3,590,440
RAC: 0
United Kingdom
Message 1196107 - Posted: 16 Feb 2012, 20:27:14 UTC - in response to Message 1195963.  

Yup working for me as well, so far about 100 or so WU downloaded. Only problem is that the d/loads keep stuttering and stalling. Havta kick em in the rear end with network on/off routines.

Still its a lot better than a dry well:-)


Regards,
Cliff,
Been there, Done that, Still no damm T shirt!
ID: 1196107 · Report as offensive
Josephine Raiani

Send message
Joined: 20 Jun 99
Posts: 2
Credit: 3,580,137
RAC: 2
United States
Message 1196126 - Posted: 16 Feb 2012, 22:42:32 UTC - in response to Message 1195836.  

Thank you. Yes, you did clear things up for me. I will now be patient and wait for the work units to come in. I did think that there was a problem with the communications between Seti and my computer. But, again you did clear that up for me.
ID: 1196126 · Report as offensive
Profile archangel
Avatar

Send message
Joined: 25 Apr 01
Posts: 62
Credit: 1,842,428
RAC: 0
United States
Message 1196631 - Posted: 18 Feb 2012, 5:04:11 UTC

You know, when we have a server crash or an outage at Home Depot, if i took this long to get the system back up, or properly notify the users of staging developments, I'd get fired...

yay. unions and tenure, eh?

>.< buncha dope smoking hippies running a mainframe if ya's ask me.

<imaginary fadeout>
<3 days late>
meh, sorry guys, we were out back hitting this new gravity bong and then Dweezle heard this sound, and it turns out the alarm had been going off for like, three days, and we were like wow man, and he was like totally man, then we hit the gravity bong some more...

then, when we woke up, the new parts were here and we totally put them on the server, but then we forgot, and we looked for the parts for like 2 more days before we finally found them on the server, then we hit the gravity bong some more.

when we finally woke up, Dweezle said, "Oh wow man, we totally need to put those parts in, otherwise the server can't actually get the downloads out, it can only accept queries", and we were like oh, totally man, then we hit the gravity bong some more...

<imaginary sequence ends fade-in>

FFS man, call geek squad if you can't fix it!!

8)
ID: 1196631 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13854
Credit: 208,696,464
RAC: 304
Australia
Message 1196760 - Posted: 18 Feb 2012, 8:41:38 UTC - in response to Message 1196631.  
Last modified: 18 Feb 2012, 8:42:26 UTC

You know, when we have a server crash or an outage at Home Depot, if i took this long to get the system back up, or properly notify the users of staging developments, I'd get fired...

I'd hope so.
It's a business, that's why they pay all the money for that hardware & support.
This isn't, they rely on donations. If you want them to have 24/7 up time & notifications of what's going on, how about you providing all the hardware & money required to support it?
Grant
Darwin NT
ID: 1196760 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 36774
Credit: 261,360,520
RAC: 489
Australia
Message 1196783 - Posted: 18 Feb 2012, 10:30:48 UTC - in response to Message 1196760.  

You know, when we have a server crash or an outage at Home Depot, if i took this long to get the system back up, or properly notify the users of staging developments, I'd get fired...

I'd hope so.
It's a business, that's why they pay all the money for that hardware & support.
This isn't, they rely on donations. If you want them to have 24/7 up time & notifications of what's going on, how about you providing all the hardware & money required to support it?

Plus the people working on the setup only work on it part-time, unless you want to pay for a full-time person to baby-sit it all.

Cheers.
ID: 1196783 · Report as offensive
Profile archangel
Avatar

Send message
Joined: 25 Apr 01
Posts: 62
Credit: 1,842,428
RAC: 0
United States
Message 1196821 - Posted: 18 Feb 2012, 14:03:42 UTC - in response to Message 1196760.  
Last modified: 18 Feb 2012, 14:21:59 UTC

"If you want them to have 24/7 up time & notifications of what's going on, how about you providing all the hardware & money required to support it?"

That would be a web enabled smartphone, right?

All they need to do is set up a log file, and monitor it with trace32.exe.

Set up ping over time to each server, an outbound and inbound bandwidth ping set for 20k packets, and have the scheduler log delivered WU's to the log as well.

Then, with a smartphone you could monitor the log, see any interruption to any server, any interruption to the outbound WU's, any bandwidth constraints or interruptions to the network, and from home, make a post online detailing whhic server is down,, and or what the problem is.

You could even sprint for a laptop if you wanted to spend some real money, and remote the servers or the monitoring PC.

Course, now we are talking nearly $600.

I'll spring for that though, if they are short.

8)
ID: 1196821 · Report as offensive
Profile Slavac
Volunteer tester
Avatar

Send message
Joined: 27 Apr 11
Posts: 1932
Credit: 17,952,639
RAC: 0
United States
Message 1196822 - Posted: 18 Feb 2012, 14:13:53 UTC - in response to Message 1196821.  

"If you want them to have 24/7 up time & notifications of what's going on, how about you providing all the hardware & money required to support it?"

That would be a web enabled smartphone, right?

K, i'll spring for that 8)


Not quite. Here's the dent our donors have made in the past few months:

http://gpuug.org/purchases


Executive Director GPU Users Group Inc. -
brad@gpuug.org
ID: 1196822 · Report as offensive
Profile archangel
Avatar

Send message
Joined: 25 Apr 01
Posts: 62
Credit: 1,842,428
RAC: 0
United States
Message 1196823 - Posted: 18 Feb 2012, 14:28:37 UTC - in response to Message 1196822.  
Last modified: 18 Feb 2012, 14:37:21 UTC

Not quite. Here's the dent our donors have made in the past few months:

http://gpuug.org/purchases



I don't see what any of that has to do with providing a timely update on outages.

Sure, a nice new server would be great and all, but i think the effort required to get notices out in and of itself would be minimal and doable even for someone sitting at home on a sofa, just using windows utilities.

After seeing Seti was back up and WU's were getting through, i took the time to go around to my PC's and revert them back over to Seti from E@home...

When you have 9 computers, that takes about half an hour. To set them all back takes another half hour...

A timely update could have saved me that aggravation, that's all I'm saying.

You could even set up an alarm on trace32 monitoring the log to send a high priority email alert to a distribution list, or, if you are *super* lazy, you could set it up to post an alert to the message board itself...

Course, if you had a repeating failure, like an outage, that could result in spam posts, so probably best to have it alert a distribution list.
ID: 1196823 · Report as offensive
Profile Bill Walker
Avatar

Send message
Joined: 4 Sep 99
Posts: 3868
Credit: 2,697,267
RAC: 0
Canada
Message 1196888 - Posted: 18 Feb 2012, 17:15:59 UTC

This is a science project. Right now, there are more volunteers than there are WUs, on a regular basis.

The project is short of resources, and has to use the available resources where it helps the science. Timely updates on outages may make us feel good, but it does NOTHING for the science. So, the resources (including our donations) go elsewhere.

ID: 1196888 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51478
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1196891 - Posted: 18 Feb 2012, 17:23:20 UTC - in response to Message 1196823.  
Last modified: 18 Feb 2012, 17:29:04 UTC



After seeing Seti was back up and WU's were getting through, i took the time to go around to my PC's and revert them back over to Seti from E@home...

When you have 9 computers, that takes about half an hour. To set them all back takes another half hour...


I have 9 computers.
It doesn't take much effort to have another project on standby......
My faster rigs all have Einstein attached with a 0 percent share. They crunch only Seti until all WUs have been completed, and then automatically download a few hours of Einstein to work on. No muss, no fuss. If Seti work is received during the Einstein run, they switch back to processing Seti when the Einstein completes. If not, they get another batch of Einstein.
Very simple.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1196891 · Report as offensive
Profile archangel
Avatar

Send message
Joined: 25 Apr 01
Posts: 62
Credit: 1,842,428
RAC: 0
United States
Message 1197012 - Posted: 18 Feb 2012, 21:32:05 UTC - in response to Message 1196891.  

Ah, well my computers are across 2 sites, i have to drive to work.

I'll look into the configuration you suggested and see if i can figure it out, when i added E@home on mine, it downloaded like 15 E@H tasks and 1 S@H task, so i had to remove the E@H, because i was up against space limitations...

Thought that was why i wasn't getting S@H WU's, because E@H was hogging all the drive space.

But of course, turns out it was just another outage... :)
ID: 1197012 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51478
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1197015 - Posted: 18 Feb 2012, 21:35:38 UTC - in response to Message 1197012.  

Ah, well my computers are across 2 sites, i have to drive to work.

I'll look into the configuration you suggested and see if i can figure it out, when i added E@home on mine, it downloaded like 15 E@H tasks and 1 S@H task, so i had to remove the E@H, because i was up against space limitations...

Thought that was why i wasn't getting S@H WU's, because E@H was hogging all the drive space.

But of course, turns out it was just another outage... :)

Yeah, if you are tight on HD storage, that's gonna throw another wrinkle into things. Boinc's workfetch can't work quite right if it has insufficient space to store the work. Einstein does take a bit of space.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1197015 · Report as offensive
buzzard7

Send message
Joined: 20 Jul 11
Posts: 1
Credit: 419,890
RAC: 0
United States
Message 1197039 - Posted: 18 Feb 2012, 22:11:29 UTC

I'm not getting any work out of the scheduler. Anybody else having trouble?

ID: 1197039 · Report as offensive
Previous · 1 · 2 · 3 · 4 · Next

Message boards : News : Project Back Online After Overnight Outage


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.