Panic Mode On (46) Server problems

Message boards : Number crunching : Panic Mode On (46) Server problems
Message board moderation

To post messages, you must log in.

1 · 2 · 3 · 4 . . . 11 · Next

AuthorMessage
Profile arkayn
Volunteer tester
Avatar

Send message
Joined: 14 May 99
Posts: 4438
Credit: 55,006,323
RAC: 0
United States
Message 1091556 - Posted: 30 Mar 2011, 5:50:43 UTC

Seems an appropriate time to start the next version.

ID: 1091556 · Report as offensive
Profile Fred J. Verster
Volunteer tester
Avatar

Send message
Joined: 21 Apr 04
Posts: 3252
Credit: 31,903,643
RAC: 0
Netherlands
Message 1091726 - Posted: 30 Mar 2011, 21:06:54 UTC - in response to Message 1091723.  
Last modified: 30 Mar 2011, 21:10:37 UTC

And they're still as of 30 Mar 2011 | 21:00:09 UTC,Not Running or Disabled according to the SERVER page.
But I still got work AP, MB, MB-CUDA, also Einstein and some others as well.

No new work, until this is fixed, IMHO.
ID: 1091726 · Report as offensive
B-Man
Volunteer tester

Send message
Joined: 11 Feb 01
Posts: 253
Credit: 147,366
RAC: 0
United States
Message 1091745 - Posted: 30 Mar 2011, 22:11:20 UTC

Oh no Panic!!!!!! I am down to 9 hours more with task switching. What ever will I do????????

I guess I will just try to keep breathing. Other projects will take up the slack if they can't fix it in the next day or so.
ID: 1091745 · Report as offensive
Profile perryjay
Volunteer tester
Avatar

Send message
Joined: 20 Aug 02
Posts: 3377
Credit: 20,676,751
RAC: 0
United States
Message 1091761 - Posted: 30 Mar 2011, 23:10:25 UTC - in response to Message 1091745.  

AAKKKKKKKKKKKKKKKKK!!! I got a herd of work but they are all shorties!!!! Man, it won't take anytime at all to run through them.


PROUD MEMBER OF Team Starfire World BOINC
ID: 1091761 · Report as offensive
Profile SciManStev Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Jun 99
Posts: 6652
Credit: 121,090,076
RAC: 0
United States
Message 1091769 - Posted: 30 Mar 2011, 23:26:05 UTC

I can't believe it! I finally filled up. I am loaded with AP, MB, and CUDA wu's. It's been a long time since I had a full cache.

Steve
Warning, addicted to SETI crunching!
Crunching as a member of GPU Users Group.
GPUUG Website
ID: 1091769 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34744
Credit: 261,360,520
RAC: 489
Australia
Message 1091771 - Posted: 30 Mar 2011, 23:36:03 UTC - in response to Message 1091769.  

I must be 1 of the lucky 1's as my 3 PC's have been able to top up at the right times so I don't have any complaints. :)

Cheers.
ID: 1091771 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1091779 - Posted: 31 Mar 2011, 0:20:08 UTC

Mine have stayed pretty much filled for the past 2-3 weeks.

I say pretty much because every now and then, one AP runs 50% longer for no apparent reason and throws the DCF waaaay out, and then BOINC thinks I have 12 days of cache when it is really only ~8. Takes about a dozen tasks to complete with a normal duration for the DCF estimates to come back down to a reasonable number..and then it happens again.

So pretty much, every time my machine does actually ask for work, it usually gets a task or two.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1091779 · Report as offensive
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 34255
Credit: 79,922,639
RAC: 80
Germany
Message 1091859 - Posted: 31 Mar 2011, 6:45:03 UTC - in response to Message 1091779.  
Last modified: 31 Mar 2011, 6:45:53 UTC

Mine have stayed pretty much filled for the past 2-3 weeks.

I say pretty much because every now and then, one AP runs 50% longer for no apparent reason and throws the DCF waaaay out, and then BOINC thinks I have 12 days of cache when it is really only ~8. Takes about a dozen tasks to complete with a normal duration for the DCF estimates to come back down to a reasonable number..and then it happens again.

So pretty much, every time my machine does actually ask for work, it usually gets a task or two.


It has a reason.
AP run times depends largely on blankings.
One of your results are 93% blanked therefore it took 30000 seconds longer.
Thats pretty normal.


With each crime and every kindness we birth our future.
ID: 1091859 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1091907 - Posted: 31 Mar 2011, 10:47:11 UTC

That used to be true (technically, still is, but usually less than 5,000 seconds). When the pre-blanked WUs started happening, the observed effect is completely random. I have a spreadsheet of every AP I've ever done while using the optimized apps, and there's no correlation between the tasks running significantly longer and any aspects about the WU itself (amount of blanking or number of pulses found).

For example..

A while back, I had a task that had zero blanking, yet took 122,000 seconds. 14 WUs later, another zero blanked task took 88,000 seconds.

Inversely, I have one that had 91.06% blanked and took 96,000, whereas that recent one has 92.72% and took 121,000. It's random.

I've talked to Josef about it and there's a theory that it is one of the optimizations that started in r292 and can suffer tremendously from heavy L1/2 cache usage by other processes. Just a theory. The other theory is that it's probably a case of what has been known for a while about MB being on multiple cores and having bottleneck issues without some other kind of task running as well.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1091907 · Report as offensive
Profile SciManStev Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Jun 99
Posts: 6652
Credit: 121,090,076
RAC: 0
United States
Message 1092110 - Posted: 1 Apr 2011, 2:17:32 UTC

It seems the scheduler is disabled, and the cricket graph output has dropped.

Steve
Warning, addicted to SETI crunching!
Crunching as a member of GPU Users Group.
GPUUG Website
ID: 1092110 · Report as offensive
bill

Send message
Joined: 16 Jun 99
Posts: 861
Credit: 29,352,955
RAC: 0
United States
Message 1092116 - Posted: 1 Apr 2011, 2:47:16 UTC - in response to Message 1092110.  

It seems the scheduler is disabled, and the cricket graph output has dropped.

Steve


Is it Waily! Waily! time?
ID: 1092116 · Report as offensive
Profile Donald L. Johnson
Avatar

Send message
Joined: 5 Aug 02
Posts: 8240
Credit: 14,654,533
RAC: 20
United States
Message 1092120 - Posted: 1 Apr 2011, 2:51:01 UTC - in response to Message 1092116.  

It seems the scheduler is disabled, and the cricket graph output has dropped.

Steve


Is it Waily! Waily! time?

Nah. Time to relax, go have a beer (or your beverage of choice), and wait until morning in Berkeley.
If they don't get it fixed by knock-off time on Friday, then you may want to be concerned.
Donald
Infernal Optimist / Submariner, retired
ID: 1092120 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1092168 - Posted: 1 Apr 2011, 6:28:13 UTC

Methinks probably a disk punted itself from an array and caused everything to lock up, and scheduler was remotely disabled. That's my guess.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1092168 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34744
Credit: 261,360,520
RAC: 489
Australia
Message 1092209 - Posted: 1 Apr 2011, 11:09:00 UTC - in response to Message 1092168.  

Well I'll be ok for a while yet before I start panicking. ;)

Cheers.
ID: 1092209 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1092216 - Posted: 1 Apr 2011, 13:43:58 UTC - in response to Message 1092215.  

Something's up - logging in here is like stirring treacle ...

Frozen treacle with raisins I'd say.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1092216 · Report as offensive
Profile [B^S] madmac
Volunteer tester
Avatar

Send message
Joined: 9 Feb 04
Posts: 1175
Credit: 4,754,897
RAC: 0
United Kingdom
Message 1092218 - Posted: 1 Apr 2011, 16:19:08 UTC

any ideas as to what went wrong
ID: 1092218 · Report as offensive
Cruncher-American Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor

Send message
Joined: 25 Mar 02
Posts: 1513
Credit: 370,893,186
RAC: 340
United States
Message 1092219 - Posted: 1 Apr 2011, 16:20:41 UTC - in response to Message 1092218.  

any ideas as to what went wrong


Something broke, I think.

ID: 1092219 · Report as offensive
Profile SciManStev Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Jun 99
Posts: 6652
Credit: 121,090,076
RAC: 0
United States
Message 1092220 - Posted: 1 Apr 2011, 16:24:34 UTC

It seems the only strange thing is that RAC is like a basketball. I can live with that, as long as the search for ET continues.

Steve
Warning, addicted to SETI crunching!
Crunching as a member of GPU Users Group.
GPUUG Website
ID: 1092220 · Report as offensive
Profile KWSN THE Holy Hand Grenade!
Volunteer tester
Avatar

Send message
Joined: 20 Dec 05
Posts: 3187
Credit: 57,163,290
RAC: 0
United States
Message 1092226 - Posted: 1 Apr 2011, 16:57:33 UTC

The scheduling server is off-line...
.

Hello, from Albany, CA!...
ID: 1092226 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1092238 - Posted: 1 Apr 2011, 17:40:35 UTC - in response to Message 1092226.  

The scheduling server is off-line...

Perhaps it was the one mucking things up last night.

Seems the cricket went flat at 00:00 UTC. Then around 12:00 UTC all the web stuff went wonky for a while.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1092238 · Report as offensive
1 · 2 · 3 · 4 . . . 11 · Next

Message boards : Number crunching : Panic Mode On (46) Server problems


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.