Panic Mode On (79) Server Problems?

Message boards : Number crunching : Panic Mode On (79) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 11 · 12 · 13 · 14 · 15 · 16 · 17 . . . 22 · Next

AuthorMessage
Lionel

Send message
Joined: 25 Mar 00
Posts: 680
Credit: 563,640,304
RAC: 597
Australia
Message 1311990 - Posted: 7 Dec 2012, 0:19:41 UTC


uploads stuck ...
ID: 1311990 · Report as offensive
Lionel

Send message
Joined: 25 Mar 00
Posts: 680
Credit: 563,640,304
RAC: 597
Australia
Message 1311993 - Posted: 7 Dec 2012, 0:28:05 UTC - in response to Message 1311990.  

uploads stuck ...


edit: only stuck via the proxy I was using ... uurrggghhhh

ID: 1311993 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22149
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1312050 - Posted: 7 Dec 2012, 6:28:24 UTC

You've just hit one of the snags of using a proxy, they have a habit of stopping working if you hit them too hard too often with one target address.
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1312050 · Report as offensive
Profile S@NL Etienne Dokkum
Volunteer tester
Avatar

Send message
Joined: 11 Jun 99
Posts: 212
Credit: 43,822,095
RAC: 0
Netherlands
Message 1312058 - Posted: 7 Dec 2012, 7:09:49 UTC

What has happened ???!!!

This morning I look at my tasks and Seti tells me I aborted all 200 tasks last night at 18.47 UTC although they are still running, uploading and reporting. No credit for any tasks since then and they don't show as pending or validated even after I reported them.

Anyone else got this problem yet ?
ID: 1312058 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65690
Credit: 55,293,173
RAC: 49
United States
Message 1312059 - Posted: 7 Dec 2012, 7:12:56 UTC - in response to Message 1312058.  

Nor here, earlier I had a Win7 BSOD, totally weird, that was a driver problem...
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 1312059 · Report as offensive
Profile Khangollo
Avatar

Send message
Joined: 1 Aug 00
Posts: 245
Credit: 36,410,524
RAC: 0
Slovenia
Message 1312081 - Posted: 7 Dec 2012, 7:50:59 UTC - in response to Message 1312058.  
Last modified: 7 Dec 2012, 7:56:31 UTC

What has happened ???!!!

This morning I look at my tasks and Seti tells me I aborted all 200 tasks last night at 18.47 UTC although they are still running, uploading and reporting. No credit for any tasks since then and they don't show as pending or validated even after I reported them.

Anyone else got this problem yet ?

Yes, this is normal seti@home function now. Auto abandoning all tasks at random.
Apparently, I got full cache of 22 astropulses abandoned yesterday (again!). This is the third time this happened to me. Usually it happens when scheduler connection times out a lot.
Fun.

You should manually abort all those tasks now.
ID: 1312081 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13715
Credit: 208,696,464
RAC: 304
Australia
Message 1312082 - Posted: 7 Dec 2012, 7:52:39 UTC - in response to Message 1312081.  


Add to that, neither the MB or AP assimilators appear to be working. They both show green on the server status page, but the number waiting to be assimilated continues to climb for both.
Grant
Darwin NT
ID: 1312082 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13715
Credit: 208,696,464
RAC: 304
Australia
Message 1312084 - Posted: 7 Dec 2012, 7:58:41 UTC - in response to Message 1312081.  

What has happened ???!!!

This morning I look at my tasks and Seti tells me I aborted all 200 tasks last night at 18.47 UTC although they are still running, uploading and reporting. No credit for any tasks since then and they don't show as pending or validated even after I reported them.

Anyone else got this problem yet ?

Yes, this is normal seti@home function now. Auto abandoning all tasks at random.
Apparently, I got full cache of 22 astropulses abandoned yesterday (again!). This is the third time this happened to me. Usually it happens when scheduler connection times out a lot.
Fun.

You should manually abort all those tasks now.

And to add to that, since the last outage i've had a whole bunch of tasks timeout- within minutes of getting them (one took a couple of hours but most were within 10 minutes).
Grant
Darwin NT
ID: 1312084 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13715
Credit: 208,696,464
RAC: 304
Australia
Message 1312299 - Posted: 7 Dec 2012, 21:08:00 UTC - in response to Message 1312082.  

Add to that, neither the MB or AP assimilators appear to be working. They both show green on the server status page, but the number waiting to be assimilated continues to climb for both.

MB & AP assimilator backlogs continue to climb.
Grant
Darwin NT
ID: 1312299 · Report as offensive
Tom*

Send message
Joined: 12 Aug 11
Posts: 127
Credit: 20,769,223
RAC: 9
United States
Message 1312375 - Posted: 8 Dec 2012, 3:19:55 UTC
Last modified: 8 Dec 2012, 3:22:46 UTC

Since the grass needs to be cut, everything seems to be running optimally.

I know the really heavy consumers of internet traffic are busy elsewhere,

but limiting internet downloads seems to help the schedular and the download

speeds. Now if we can switch over to AP's with no Shorties or MB's that would

be an excellent test of increasing the bandwidth from 100 Mbts to 200 Mbts.

Posing a question - Can 100Mbits handle confortably just AP's with no MB's

as 100 Mbits seem to be able to handle just MB's currently????

Its a question that begs IMHO to be answered.

Thanks

Bill aka Tom
ID: 1312375 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1312379 - Posted: 8 Dec 2012, 3:31:06 UTC - in response to Message 1312375.  
Last modified: 8 Dec 2012, 3:33:19 UTC

Since the grass needs to be cut, everything seems to be running optimally.

I know the really heavy consumers of internet traffic are busy elsewhere,

but limiting internet downloads seems to help the schedular and the download

speeds. Now if we can switch over to AP's with no Shorties or MB's that would

be an excellent test of increasing the bandwidth from 100 Mbts to 200 Mbts.

Posing a question - Can 100Mbits handle confortably just AP's with no MB's

as 100 Mbits seem to be able to handle just MB's currently????

Its a question that begs IMHO to be answered.

Thanks

Bill aka Tom

AP tasks are about 20 times more data to transfer, but only require 10 times the amount of time to process. At leas for CPU. I imagine GPU processing would make the ratio even worse.
However if that was the only work going out it would be hard to say. Not all the current machines running can complete an AP tasks in time & some choose not to do them at all.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1312379 · Report as offensive
Profile ivan
Volunteer tester
Avatar

Send message
Joined: 5 Mar 01
Posts: 783
Credit: 348,560,338
RAC: 223
United Kingdom
Message 1312535 - Posted: 8 Dec 2012, 12:38:47 UTC - in response to Message 1312375.  

Posing a question - Can 100Mbits handle confortably just AP's with no MB's

as 100 Mbits seem to be able to handle just MB's currently????

Its a question that begs IMHO to be answered.

Well, as I see it, an AP WU is 8 MB; our download bandwidth peaks at 94 Mb/s, being generous that's 10 MB/s -- so we max out at 1.25 AP WUs/sec! Any more pressure than that and we get into the realm of lost ACKs and ghost workunits.
ID: 1312535 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1312565 - Posted: 8 Dec 2012, 13:41:31 UTC
Last modified: 8 Dec 2012, 13:42:51 UTC

It´s simple question of numbers...

If with MB only, the graphs shows about 75% average usage of the avaiable bandwith and AP uses aproximately the same band... need to say anything else?

What they realy needs is split the work in two diferent lines (100 MBps each) or... finaly put the 1GBps link to work... The actual configuration with MB/AP running at the same time reaches it´s full capacity a long time ago.

In bussines (ok SETI is not bussines but the rule apply in the same manner), every IT men knows, you could never use an average off more than 3/4 of your total bandwidth or... your will start to loose clients. Besides mantain your clients you need to have the capacity to allow new ones.

Any other solution, if works, will work for a few amount of time, too many new powerfull GPU´s are expected to come to crunch in the next months/years...
ID: 1312565 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13715
Credit: 208,696,464
RAC: 304
Australia
Message 1313177 - Posted: 9 Dec 2012, 18:08:59 UTC - in response to Message 1312565.  


Assimilator backlogs continue to grow.
Grant
Darwin NT
ID: 1313177 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65690
Credit: 55,293,173
RAC: 49
United States
Message 1313242 - Posted: 9 Dec 2012, 19:56:45 UTC - in response to Message 1313177.  


Assimilator backlogs continue to grow.

That's probably cause the Borg Drones the project got are experiencing traffic, maybe. ;) Merry XMAS Grant.
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 1313242 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1314324 - Posted: 12 Dec 2012, 18:16:05 UTC

Something's afoot...
Server status now shows database is not accessible.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1314324 · Report as offensive
Mark Stevenson Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 8 Sep 11
Posts: 1736
Credit: 174,899,165
RAC: 91
United Kingdom
Message 1314348 - Posted: 12 Dec 2012, 18:39:22 UTC

The SSP comes up ok here nut have been having reporting tasks since last nite but got thro eventuly and got some more tasks ( mostly ghosts) . Hope there is'nt a problem developing but if there is we'l get thro like always
Life is what you make of it :-)

When i'm good i'm very good , but when i'm bad i'm shi#eloads better ;-) In't I " buttercups " p.m.s.l at authoritie !!;-)
ID: 1314348 · Report as offensive
Lionel

Send message
Joined: 25 Mar 00
Posts: 680
Credit: 563,640,304
RAC: 597
Australia
Message 1314425 - Posted: 12 Dec 2012, 21:45:56 UTC


has anyone else noticed that their Valid tasks are increasing ??
ID: 1314425 · Report as offensive
bill

Send message
Joined: 16 Jun 99
Posts: 861
Credit: 29,352,955
RAC: 0
United States
Message 1314431 - Posted: 12 Dec 2012, 21:54:28 UTC - in response to Message 1314425.  

and the problem with that is?
ID: 1314431 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14645
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1314437 - Posted: 12 Dec 2012, 22:06:43 UTC - in response to Message 1314431.  

and the problem with that is?

That it appears that the db_purge.x86_64 daemon has joined its comrades in the sah_assimilator union - and gone on strike.
ID: 1314437 · Report as offensive
Previous · 1 . . . 11 · 12 · 13 · 14 · 15 · 16 · 17 . . . 22 · Next

Message boards : Number crunching : Panic Mode On (79) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.