Panic Mode On (77) Server Problems?

Message boards : Number crunching : Panic Mode On (77) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 17 · 18 · 19 · 20 · 21 · 22 · Next

AuthorMessage
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13715
Credit: 208,696,464
RAC: 304
Australia
Message 1300956 - Posted: 1 Nov 2012, 10:54:56 UTC - in response to Message 1300952.  


Still getting nothing but Scheduler timeouts.
Grant
Darwin NT
ID: 1300956 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1300984 - Posted: 1 Nov 2012, 13:44:13 UTC - in response to Message 1300956.  
Last modified: 1 Nov 2012, 13:44:23 UTC


Still getting nothing but Scheduler timeouts.

I've set NNT and limited <max_tasks_reported> to five, and i am managing to report tasks, but not on every attempt.

Claggy
ID: 1300984 · Report as offensive
David S
Volunteer tester
Avatar

Send message
Joined: 4 Oct 99
Posts: 18352
Credit: 27,761,924
RAC: 12
United States
Message 1300987 - Posted: 1 Nov 2012, 14:20:53 UTC

I think they need to put a throttle on the number of APs that can go out at one time.

David
Sitting on my butt while others boldly go,
Waiting for a message from a small furry creature from Alpha Centauri.

ID: 1300987 · Report as offensive
Profile James Sotherden
Avatar

Send message
Joined: 16 May 99
Posts: 10436
Credit: 110,373,059
RAC: 54
United States
Message 1301010 - Posted: 1 Nov 2012, 16:23:17 UTC

The pipes sure are clogged up. Im downloading at a whopping 0.82 kbs.
Wish I could report work.
[/quote]

Old James
ID: 1301010 · Report as offensive
GaryG
Avatar

Send message
Joined: 17 Mar 12
Posts: 8
Credit: 2,593,273
RAC: 0
United States
Message 1301013 - Posted: 1 Nov 2012, 16:49:41 UTC

Since the MB's crunch for long periods without any AP's available, perhaps the AP's should be released more slowly to relieve the congestion.
ID: 1301013 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13715
Credit: 208,696,464
RAC: 304
Australia
Message 1301035 - Posted: 1 Nov 2012, 18:08:23 UTC - in response to Message 1301013.  
Last modified: 1 Nov 2012, 18:08:54 UTC

Overnight one of my systems was able to report some work. Once. Every other attempt has resulted in a Scheduler timeout. My other system has had nothing but Scheduler timeouts.
Something is seriously wrong.
Grant
Darwin NT
ID: 1301035 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22149
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1301042 - Posted: 1 Nov 2012, 18:18:05 UTC

Very strange....
Although recent attempts to report work have resulted in the following set of messages:
01/11/2012 18:12:03 Project communication failed: attempting access to reference site
01/11/2012 18:12:03 SETI@home Scheduler request failed: Timeout was reached
01/11/2012 18:12:05 Internet access OK - project servers may be temporarily down.

Tasks are being reported as I'm getting tasks returned, validated and credited, I'm not getting the "returned receipt" message to clear them from my hosts...
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1301042 · Report as offensive
Profile Khangollo
Avatar

Send message
Joined: 1 Aug 00
Posts: 245
Credit: 36,410,524
RAC: 0
Slovenia
Message 1301054 - Posted: 1 Nov 2012, 18:46:46 UTC
Last modified: 1 Nov 2012, 18:54:39 UTC

One of my hosts is getting nothing but scheduler timeouts and http errors. Uploads work fine.
The only difference between this and other host is it's on a different network using different ISP.
Scheduler requests DO get through to server, but no response is received. This host has now probably collected more "lost tasks" than it could ever process in time.

Edit: All my tasks suddenly got marked as "Abandoned" on server (all of them!!) but I didn't touch the boinc in any way and it still has its cache and crunches!
http://setiathome.berkeley.edu/results.php?hostid=5323998
Brilliant.
ID: 1301054 · Report as offensive
Keith White
Avatar

Send message
Joined: 29 May 99
Posts: 392
Credit: 13,035,233
RAC: 22
United States
Message 1301059 - Posted: 1 Nov 2012, 19:05:52 UTC - in response to Message 1301054.  

My last successful scheduler request was at 5:24 am EDT, also got resent 8 lost tasks when that happened. Before that it was 6:58 pm EDT on the 31st with failures in between and since. Currently have 22 Ghosts out there, since the 5:24 am update.
"Life is just nature's way of keeping meat fresh." - The Doctor
ID: 1301059 · Report as offensive
alan
Avatar

Send message
Joined: 18 Feb 00
Posts: 131
Credit: 401,606
RAC: 0
United Kingdom
Message 1301064 - Posted: 1 Nov 2012, 19:13:32 UTC

No problem with reporting here. Of course I'm not reporting more than one or two at a time, being a very small cruncher. Two AP's downloaded, but I can't get any more even though the server status shows there are 7000 available. I think they've been put on hold at the servers.

The thing is, larger tasks ought to be better users of the bandwidth, as you have less time establishing contact with the client and so on. For some reason SETI@Home doesn't gain that advantage.
ID: 1301064 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22149
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1301074 - Posted: 1 Nov 2012, 20:19:41 UTC

Certainly something amiss in the server closet - the query rate has climbed to about 3000/s instead of the usual few hundred....
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1301074 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1301077 - Posted: 1 Nov 2012, 20:23:47 UTC - in response to Message 1301074.  

Certainly something amiss in the server closet - the query rate has climbed to about 3000/s instead of the usual few hundred....

Probably just the daily stat dump being generated.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1301077 · Report as offensive
MikeN

Send message
Joined: 24 Jan 11
Posts: 319
Credit: 64,719,409
RAC: 85
United Kingdom
Message 1301092 - Posted: 1 Nov 2012, 21:46:23 UTC

Very difficult to report here, and the more tasks to report the more difficult it is. When I did manage to report, the only new WUs I got were shorties. Combination of APs and shorties makes for a perfect storm and everything grinds to a halt.
ID: 1301092 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1301102 - Posted: 1 Nov 2012, 23:14:45 UTC
Last modified: 1 Nov 2012, 23:24:20 UTC

Starting to get a bunch of APs allocated to me. Download speeds increased by a factor of 10 for me sometime today to mid-20KB/sec range. Getting a lot of "resent lost task" to go with all of the "timeout was reached" messages.



Unrelated, but involves my crunchers.. Both of my UPSes (APC 1300, Tripp-Lite 1400) on my desk simultaneously died this morning at 0730utc. I woke up because all of the fan hum stopped (two rigs and a rack-mount 24-port gigabit switch), and there was a 60dB 2.5kHz tone coming from one of them. That was fun to deal with. Now I need 172Ah worth of 12v batteries to get all five of my units (7800VA in total) back up and running. Looks like it's going to cost about US$550. Or.. I can get an APC 3000VA 3U rackmount unit with new batteries on eBay for $550 + shipping. Decisions, decisions. I think I might go for getting the APC 2200 up and running to replace the two that were on the desk.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1301102 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 1301112 - Posted: 2 Nov 2012, 0:53:12 UTC

Maybe the last 20 hours or something no well scheduler contact. Always:
Scheduler request failed: Timeout was reached


*new tasks* was enabled.

I set *no new tasks* - and then 178 uploaded tasks were accepted from the scheduler server in a bunch (successful report).


* Best regards! :-) * Sutaru Tsureku, team seti.international founder. * Optimize your PC for higher RAC. * SETI@home needs your help. *
ID: 1301112 · Report as offensive
Terror Australis
Volunteer tester

Send message
Joined: 14 Feb 04
Posts: 1817
Credit: 262,693,308
RAC: 44
Australia
Message 1301118 - Posted: 2 Nov 2012, 1:34:28 UTC

By installing a 7.xx client, setting a max report of 10 tasks in my cc_config file and setting No New Tasks. I find I can reliably report.

A report of 10 tasks was the most I could go, 15 or 20 timed out.

With nearly a thousand task to report, it's gonna be a Looooong day :-)

T.A.
ID: 1301118 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1301119 - Posted: 2 Nov 2012, 1:41:17 UTC - in response to Message 1301118.  

By installing a 7.xx client, setting a max report of 10 tasks in my cc_config file and setting No New Tasks. I find I can reliably report.

A report of 10 tasks was the most I could go, 15 or 20 timed out.

With nearly a thousand task to report, it's gonna be a Looooong day :-)

T.A.

Do you generate 10 new ones to report within that 303-second scheduler interval? If so, it's a negative-feedback loop and you won't catch up.

My scheduler contacts are hit-and-miss, and I'm not even reporting any completed tasks. Some of them get a reply in 10 seconds, others reach the time-out period.

I am, however, trying to hoard APs. Om nom nom nom..
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1301119 · Report as offensive
Terror Australis
Volunteer tester

Send message
Joined: 14 Feb 04
Posts: 1817
Credit: 262,693,308
RAC: 44
Australia
Message 1301122 - Posted: 2 Nov 2012, 1:49:58 UTC - in response to Message 1301119.  
Last modified: 2 Nov 2012, 1:51:39 UTC

Luckily the task s I'm crunching atm are taking around 10 minutes to crunch so the reporting is slowly pulling ahead.

I'm also helping with a bit of "button abuse". If you have set NNT you don't have to wait the 5 minutes for another "hit".

T.A.
ID: 1301122 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1301125 - Posted: 2 Nov 2012, 2:08:05 UTC - in response to Message 1301122.  

If you have set NNT you don't have to wait the 5 minutes for another "hit".

Interesting to know. I thought it was a universal 303 seconds between accepted contacts. Not that I ever have a huge pile of tasks to report since I'm CPU-only and AP.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1301125 · Report as offensive
Profile cov_route
Avatar

Send message
Joined: 13 Sep 12
Posts: 342
Credit: 10,270,618
RAC: 0
Canada
Message 1301128 - Posted: 2 Nov 2012, 2:15:28 UTC

I don't know what happened during the day today, but I come home and the task list on my account page insists I have 2270 tasks in progress.

There are actually 431 in my cache. Hmm.
ID: 1301128 · Report as offensive
Previous · 1 . . . 17 · 18 · 19 · 20 · 21 · 22 · Next

Message boards : Number crunching : Panic Mode On (77) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.