Panic Mode On (101) Server Problems?

Message boards : Number crunching : Panic Mode On (101) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 17 · 18 · 19 · 20 · 21 · 22 · 23 . . . 27 · Next

AuthorMessage
Profile William
Volunteer tester
Avatar

Send message
Joined: 14 Feb 13
Posts: 2037
Credit: 17,689,662
RAC: 0
Message 1746269 - Posted: 2 Dec 2015, 11:09:47 UTC - in response to Message 1746246.  

edit: why do I keep running into buglets? try hitting quote or reply when you are _not_ logged in -> https://setiathome.berkeley.edu//extra_arg_post.html 404 not found. mhm. extra / there?

This is now fixed, with thanks to David looking into that. The HTTP->HTTPS links could be a side-effect of me using Firefox 42.0 to test. Perhaps that it overzealously tries to put all links to HTTPS.

Edit: no, the HTTP->HTTPS thing is what everyone will now see, as log in has been made HTTPS for everyone. However, when coming to the forums using HTTP, it'll return to HTTP after log in. New code.

Thanks :) was probably fallout from the http https conversion/debate I guess.
A person who won't read has no advantage over one who can't read. (Mark Twain)
ID: 1746269 · Report as offensive
Speedy
Volunteer tester
Avatar

Send message
Joined: 26 Jun 04
Posts: 1643
Credit: 12,921,799
RAC: 89
New Zealand
Message 1746358 - Posted: 2 Dec 2015, 19:23:38 UTC - in response to Message 1746262.  

SSP is starting to look a bit scary when it comes to tapes in queue ...


Oddly, a couple of the AP splitters keep popping online, but nothing AP-wise is being split.

P.

This may be because the data has already been split from that particular tape that we are really running for MB work
ID: 1746358 · Report as offensive
Darth Beaver Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Avatar

Send message
Joined: 20 Aug 99
Posts: 6728
Credit: 21,443,075
RAC: 3
Australia
Message 1746522 - Posted: 3 Dec 2015, 13:39:01 UTC

Is there a reason I'm getting no CPU work ?

my log file says this


4/12/2015 12:00:35 AM | SETI@home | Requesting new tasks for CPU
4/12/2015 12:00:39 AM | SETI@home | Scheduler request completed: got 0 new tasks
4/12/2015 12:00:39 AM | SETI@home | No tasks sent
4/12/2015 12:00:39 AM | SETI@home | No tasks are available for SETI@home v7
4/12/2015 12:00:39 AM | SETI@home | Tasks for NVIDIA GPU are available, but your preferences are set to not accept them
4/12/2015 12:00:39 AM | SETI@home | Tasks for AMD/ATI GPU are available, but your preferences are set to not accept them
4/12/2015 12:00:39 AM | SETI@home | Tasks for Intel GPU are available, but your preferences are set to not accept them


I have even set the preferences to only send me MB CPU work as I have no work for the CPU at all neither AP's or MB's
ID: 1746522 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1746540 - Posted: 3 Dec 2015, 15:09:57 UTC

I think this thread is drifting a little way away from server problems, no?
ID: 1746540 · Report as offensive
Profile Gordon Lowe
Avatar

Send message
Joined: 5 Nov 00
Posts: 12094
Credit: 6,317,865
RAC: 0
United States
Message 1746553 - Posted: 3 Dec 2015, 15:56:49 UTC

There's nothing wrong with off-topic commentary in the course of the discussion, but the last few posts have been getting a little cranky.
The mind is a weird and mysterious place
ID: 1746553 · Report as offensive
Profile arkayn
Volunteer tester
Avatar

Send message
Joined: 14 May 99
Posts: 4438
Credit: 55,006,323
RAC: 0
United States
Message 1746563 - Posted: 3 Dec 2015, 16:44:56 UTC - in response to Message 1746553.  

There's nothing wrong with off-topic commentary in the course of the discussion, but the last few posts have been getting a little cranky.


Yes they have and I would like them to stop.

ID: 1746563 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13736
Credit: 208,696,464
RAC: 304
Australia
Message 1747000 - Posted: 5 Dec 2015, 6:32:51 UTC - in response to Message 1746581.  

Is there a problem with the MB splitters or just a slow down?


2 days later & there's a problem. 3 of them are not running so output has dropped by almost half. Get work while you can.
Grant
Darwin NT
ID: 1747000 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22202
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1747006 - Posted: 5 Dec 2015, 8:15:04 UTC

...what is more worrying is that the current batch of tapes being split contain errors - is this a tape problem or is it a splitter problem?
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1747006 · Report as offensive
Profile Jimbocous Project Donor
Volunteer tester
Avatar

Send message
Joined: 1 Apr 13
Posts: 1853
Credit: 268,616,081
RAC: 1,349
United States
Message 1747017 - Posted: 5 Dec 2015, 10:07:03 UTC - in response to Message 1747006.  

...what is more worrying is that the current batch of tapes being split contain errors - is this a tape problem or is it a splitter problem?

Definitely a mess. Looks like about 15 min worth of cache being maintained.
ID: 1747017 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 1747035 - Posted: 5 Dec 2015, 12:31:30 UTC

Now all channels have errors. I've never seen that before.
ID: 1747035 · Report as offensive
ChrisD
Volunteer tester

Send message
Joined: 25 Sep 99
Posts: 158
Credit: 2,496,342
RAC: 0
Denmark
Message 1747037 - Posted: 5 Dec 2015, 12:33:55 UTC
Last modified: 5 Dec 2015, 12:34:23 UTC

24fe11aa.5320.646.8.12.175_0 and 138_1 both errored out after approx. 18 min.

Is this the WU's mentioned earlier.

I have 3 more in that series, do You think I should abort these?

ChrisD
ID: 1747037 · Report as offensive
Profile Dr Grey

Send message
Joined: 27 May 99
Posts: 154
Credit: 104,147,344
RAC: 21
United Kingdom
Message 1747048 - Posted: 5 Dec 2015, 13:25:11 UTC

Starting to run low here. Perhaps they should try turning it off and on again?
ID: 1747048 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1747049 - Posted: 5 Dec 2015, 13:26:15 UTC

Definitely something is not happy in splitterland.
Messages sent to Eric, Matt, and Jeff.
Hopefully somebody can have a poke at it.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1747049 · Report as offensive
Profile JaundicedEye
Avatar

Send message
Joined: 14 Mar 12
Posts: 5375
Credit: 30,870,693
RAC: 1
United States
Message 1747052 - Posted: 5 Dec 2015, 13:48:21 UTC

It's time to hit that 'special spot' on the offending server with Maxwel's Silver Hammer.....if it's not dead, it's close.....

"Sour Grapes make a bitter Whine." <(0)>
ID: 1747052 · Report as offensive
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1747053 - Posted: 5 Dec 2015, 13:58:39 UTC

At least the channels are erroring out, and not sending out crap like it did before.

From the output, it looks to me like only resends are being sent out.
ID: 1747053 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1747055 - Posted: 5 Dec 2015, 14:01:44 UTC - in response to Message 1747037.  

24fe11aa.5320.646.8.12.175_0 and 138_1 both errored out after approx. 18 min.

Is this the WU's mentioned earlier.

I have 3 more in that series, do You think I should abort these?

ChrisD

"Reason: Integer Divide by Zero (0xc0000094)"

That certainly sounds like something dodgy in the data, and outside our control. But I wouldn't abort anything yet, until we have a better picture of what's wrong and how widespread it is.
ID: 1747055 · Report as offensive
ChrisD
Volunteer tester

Send message
Joined: 25 Sep 99
Posts: 158
Credit: 2,496,342
RAC: 0
Denmark
Message 1747058 - Posted: 5 Dec 2015, 14:16:33 UTC - in response to Message 1747055.  
Last modified: 5 Dec 2015, 14:23:08 UTC

"Reason: Integer Divide by Zero (0xc0000094)"

That certainly sounds like something dodgy in the data, and outside our control. But I wouldn't abort anything yet, until we have a better picture of what's wrong and how widespread it is.


OK with that, but my cache holds just 29 tasks each taking 4 mins to complete.

BOINC refuses to fetch more work as long as any task is suspended, so in 2 Hours, I will have to decide what to do.

But, If You think I can help by processing the remaining 3 tasks, I will do that.

ChrisD

EDIT: Just finished another of these, one that I did not discover and suspended.
So I'll abort the 3.
ID: 1747058 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1747059 - Posted: 5 Dec 2015, 14:37:37 UTC - in response to Message 1747058.  
Last modified: 5 Dec 2015, 15:09:54 UTC

29 tasks at 4 minutes each? That looks like you'll have run out before sunrise in California, whatever you do. If you have a backup project, switch to it now: otherwise, I'd let them run, purely to find out whether they fail or not: that's about the only help we can provide to the staff at this point.

Did anyone happen to notice which was the first tape to show errors during splitting, on the SSP? The first one I actually checked when I got up this morning was 24mr11ae, but that was after Rob had already posted. I'll maybe promote the one(s) I have from that tape, and see if they're affected too.

Edit - I have 24fe11aa.5320.8417.8.12.221 from a little further down the same tape as yours, running now. We'll see how it gets on.
ID: 1747059 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1747066 - Posted: 5 Dec 2015, 15:54:31 UTC - in response to Message 1747059.  
Last modified: 5 Dec 2015, 16:21:11 UTC

Well, my mine canary finished OK (late overflow): 24fe11aa.5320.8417.8.12.221_1 (and now validated against Linux/ATI)

But that was with the NVidia app. Your ATI build may react differently to whatever the problem is.
ID: 1747066 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1747074 - Posted: 5 Dec 2015, 16:24:32 UTC

Hmmmm....
Something's afoot. All of a sudden there are a bunch more channels showing on the SSP for MB splitting.
Hopefully that means on of da boyz in da lab has their fingers in the works.

Meow.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1747074 · Report as offensive
Previous · 1 . . . 17 · 18 · 19 · 20 · 21 · 22 · 23 . . . 27 · Next

Message boards : Number crunching : Panic Mode On (101) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.