Panic Mode On (56) Server problems?

Message boards : Number crunching : Panic Mode On (56) Server problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · 11 · Next

AuthorMessage
__W__
Avatar

Send message
Joined: 28 Mar 09
Posts: 116
Credit: 5,943,642
RAC: 0
Germany
Message 1157631 - Posted: 1 Oct 2011, 0:00:33 UTC - in response to Message 1157625.  
Last modified: 1 Oct 2011, 0:09:45 UTC

Perhaps the problem is not with SETI at the moment. A traceroute from the US east coast dies at paix.he.net (198.32.176.20). It could be a Hurricane Electric problem again. Who knows.

I don't know, but i saw similar things about 5 houres ago, when i checked the connection to seti servers from some lookingglass servers around the world - the traceroute packets on the last hopes often cycle around on two or three IPs like a merry-go-round before the reached Berkely - if they reach :-( .
My guess - a router/connection problem.
AND I DON'T WANT TO HEAR SOMETHING ABOUT YELLOW ... ;-)



__W__

Edit: Was something broken by the Earthquake "Magnitude 3.1 - SAN FRANCISCO BAY AREA, CALIFORNIA"
2011 September 29 23:47:54 UTC ? Source: http://earthquake.usgs.gov/earthquakes/recenteqsus/Quakes/nc71655651.html
_______________________________________________________________________________
ID: 1157631 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51522
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1157698 - Posted: 1 Oct 2011, 2:48:43 UTC

We're baaaaaaaaaaaaaaaaack.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1157698 · Report as offensive
Profile Dimly Lit Lightbulb 😀
Volunteer tester
Avatar

Send message
Joined: 30 Aug 08
Posts: 15401
Credit: 7,423,413
RAC: 1
United Kingdom
Message 1157703 - Posted: 1 Oct 2011, 3:02:39 UTC

Woooooooooohoooooooooo! I'm getting tasks. The problem being that for a shorty it's at 13 minutes. The crunches time will be about 45 mins. Has DA put another fix in the works.
ID: 1157703 · Report as offensive
Profile Lint trap

Send message
Joined: 30 May 03
Posts: 871
Credit: 28,092,319
RAC: 0
United States
Message 1157704 - Posted: 1 Oct 2011, 3:05:17 UTC - in response to Message 1157702.  

Yeah, 93 reported and only 1 downloaded...



That's Great, Vic!!

It gives me hope that someday soon I might be able to u/l and d/l cleanly again. I was able to Update a few mins ago, and that's a Big Improvement.

Lt

ID: 1157704 · Report as offensive
Profile arkayn
Volunteer tester
Avatar

Send message
Joined: 14 May 99
Posts: 4438
Credit: 55,006,323
RAC: 0
United States
Message 1157708 - Posted: 1 Oct 2011, 3:09:10 UTC

Since I still have the flops in my app_info, my work is still fairly close to run time.

ID: 1157708 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51522
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1157721 - Posted: 1 Oct 2011, 3:59:54 UTC

Well, that didn't last long.
All of a sudden the rigs can't connect again.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1157721 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13904
Credit: 208,696,464
RAC: 304
Australia
Message 1157723 - Posted: 1 Oct 2011, 4:03:41 UTC - in response to Message 1157698.  

We're baaaaaaaaaaaaaaaaack.

Occasionally.
For a while there i could upload- now it's a case of being able to upoload for a few minuts, then not being able to for 15min+
Won't be able to get any new work till all the previous work has been returned. Could take the rest of the day, unless things improve or die completely again.
Grant
Darwin NT
ID: 1157723 · Report as offensive
W-K 666 Project Donor
Volunteer tester

Send message
Joined: 18 May 99
Posts: 19550
Credit: 40,757,560
RAC: 67
United Kingdom
Message 1157724 - Posted: 1 Oct 2011, 4:04:42 UTC - in response to Message 1157721.  
Last modified: 1 Oct 2011, 4:06:07 UTC

Well, that didn't last long.
All of a sudden the rigs can't connect again.

I was just going to ask all you American cousins to call it a night and allow us Europeans a chance to do some uploading. I've never seen uploading so slow before, 3m:54s to upload one task.

Got 11 uploaded, over a 100 left to do before I can even think of re-filling. If no go will be out of work by lunchtime.

edit] and it does seem like another nosedive on the cricket graphs.
ID: 1157724 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51522
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1157725 - Posted: 1 Oct 2011, 4:09:56 UTC - in response to Message 1157724.  

Well, that didn't last long.
All of a sudden the rigs can't connect again.

I was just going to ask all you American cousins to call it a night and allow us Europeans a chance to do some uploading. I've never seen uploading so slow before, 3m:54s to upload one task.

Got 11 uploaded, over a 100 left to do before I can even think of re-filling. If no go will be out of work by lunchtime.

edit] and it does seem like another nosedive on the cricket graphs.

Yup....it's dead, Jim.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1157725 · Report as offensive
Terror Australis
Volunteer tester

Send message
Joined: 14 Feb 04
Posts: 1817
Credit: 262,693,308
RAC: 44
Australia
Message 1157727 - Posted: 1 Oct 2011, 4:14:38 UTC

Oh well, It was nice while it lasted :-(
ID: 1157727 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 37595
Credit: 261,360,520
RAC: 489
Australia
Message 1157728 - Posted: 1 Oct 2011, 4:14:54 UTC - in response to Message 1157725.  

Yep, the Cricket Graph took another nose dive again. :(

Cheers.
ID: 1157728 · Report as offensive
W-K 666 Project Donor
Volunteer tester

Send message
Joined: 18 May 99
Posts: 19550
Credit: 40,757,560
RAC: 67
United Kingdom
Message 1157743 - Posted: 1 Oct 2011, 5:27:40 UTC

Comms now back up, well I have uploaded, waiting to see if I can report/request. First attempt failed.

As I type reported ok, but no tasks available.
ID: 1157743 · Report as offensive
W-K 666 Project Donor
Volunteer tester

Send message
Joined: 18 May 99
Posts: 19550
Credit: 40,757,560
RAC: 67
United Kingdom
Message 1157769 - Posted: 1 Oct 2011, 6:04:37 UTC - in response to Message 1157752.  

I'm full as Seti said the following:

1314 SETI@home 9/30/2011 10:37:12 PM This computer has reached a limit on tasks in progress

Congratulations, now plse, suspend comms, and let me get some, my cpu will start going cold in under 1 hour.
ID: 1157769 · Report as offensive
BetelgeuseFive Project Donor
Volunteer tester

Send message
Joined: 6 Jul 99
Posts: 158
Credit: 17,117,787
RAC: 19
Netherlands
Message 1157794 - Posted: 1 Oct 2011, 7:42:02 UTC

I am receiving some new tasks (although not enough to fill my cache). Problem is the estimated computation time for these new tasks. I'm running Lunatics optimized CUDA on my GT240 and the 'shorties' take slightly less than 5 minutes to complete. The new tasks I am receiving have an estimated computation time of only 58 seconds. This probably means they will error out after twice that time.
I know there have been server side changes and I was able to solve similar problems two weeks ago by removing the FLOPS entry from my app_info file.
Now something seems to have changed once again.
Is there anything I can do to prevent the tasks from being aborted because they take too long to process ? I would hate to waste the small number of tasks I do receive ...

Tom

ID: 1157794 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14690
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1157799 - Posted: 1 Oct 2011, 8:31:02 UTC - in response to Message 1157794.  

I am receiving some new tasks (although not enough to fill my cache). Problem is the estimated computation time for these new tasks. I'm running Lunatics optimized CUDA on my GT240 and the 'shorties' take slightly less than 5 minutes to complete. The new tasks I am receiving have an estimated computation time of only 58 seconds. This probably means they will error out after twice that time.
I know there have been server side changes and I was able to solve similar problems two weeks ago by removing the FLOPS entry from my app_info file.
Now something seems to have changed once again.
Is there anything I can do to prevent the tasks from being aborted because they take too long to process ? I would hate to waste the small number of tasks I do receive ...

Tom

Don't worry. Tasks only error out if they take ten times as long as expected.

The current estimates are, deliberately, five times too short, to avoid reaching that error limit. Just let them run normally. As soon as the first "58 second" task has completed - which will take the normal five minutes - BOINC will realise what's going on and adjust the estimates for the remaining new tasks. You will, temporarily, have a larger cache than you expected, but again - and again deliberately - the quota limit is in place to prevent things getting out of hand.
ID: 1157799 · Report as offensive
Profile S@NL Etienne Dokkum
Volunteer tester
Avatar

Send message
Joined: 11 Jun 99
Posts: 212
Credit: 43,822,095
RAC: 0
Netherlands
Message 1157808 - Posted: 1 Oct 2011, 8:42:31 UTC

well, at least I reported everything now, but as no AP's seem to get validated there are more problems on the way I suppose...

The machine I left running last night didn't report anything on it's own, but had to be "helped" this morning.

Still enough work though as I was so lucky to get almost 30 AP's thursday...
ID: 1157808 · Report as offensive
Profile S@NL Etienne Dokkum
Volunteer tester
Avatar

Send message
Joined: 11 Jun 99
Posts: 212
Credit: 43,822,095
RAC: 0
Netherlands
Message 1157810 - Posted: 1 Oct 2011, 8:46:26 UTC - in response to Message 1157794.  

I am receiving some new tasks (although not enough to fill my cache). Problem is the estimated computation time for these new tasks. I'm running Lunatics optimized CUDA on my GT240 and the 'shorties' take slightly less than 5 minutes to complete. The new tasks I am receiving have an estimated computation time of only 58 seconds. This probably means they will error out after twice that time.
I know there have been server side changes and I was able to solve similar problems two weeks ago by removing the FLOPS entry from my app_info file.
Now something seems to have changed once again.
Is there anything I can do to prevent the tasks from being aborted because they take too long to process ? I would hate to waste the small number of tasks I do receive ...

Tom


sort of similar problem here as I have 1 AP left from when the estimates were made too high. Now this WU has a est. time of 108 hours when it just takes 12 to 13 hours to process on my CPU. At least all the other WU's seem to be "back to normal" now...
ID: 1157810 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13904
Credit: 208,696,464
RAC: 304
Australia
Message 1157816 - Posted: 1 Oct 2011, 9:05:09 UTC - in response to Message 1157794.  
Last modified: 1 Oct 2011, 9:08:57 UTC

the 'shorties' take slightly less than 5 minutes to complete. The new tasks I am receiving have an estimated computation time of only 58 seconds.

For me the estimates are all over the place. The DCF is moving around between 0.7 & 1.5. As each GPU tasks complete, their ridiculously long completion times slowly drop, making the almost correct CPU times drop as well. They get down to about half of the actual completion time is when one finally completes & the estimates get bumped up; pushing the GPU task completion times to new heights of ridiculousness.
Hopefully Seti can stay up for the next few days & things will start to settle down.



Although things are still looking bit broken- doesn't look as though AP work is going out. And a lot of the requests for work result in none. Sometimes i get 1 or 2 WUs, occasionally i'll get 20+. But mostly it's "Project has no tasks available".
Grant
Darwin NT
ID: 1157816 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13904
Credit: 208,696,464
RAC: 304
Australia
Message 1157819 - Posted: 1 Oct 2011, 9:52:11 UTC


Now i'm not getting any response from the Scheduler.
Grant
Darwin NT
ID: 1157819 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51522
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1157826 - Posted: 1 Oct 2011, 10:18:35 UTC

Everything's still working at the moment....
The kitties are in there scrapping for anything the servers can send.
Not getting a lot, but enough to keep things a bit warmer in the crunching den.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1157826 · Report as offensive
Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · 11 · Next

Message boards : Number crunching : Panic Mode On (56) Server problems?


 
©2025 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.