Panic Mode On (54) Server problems?

Message boards : Number crunching : Panic Mode On (54) Server problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 . . . 10 · Next

AuthorMessage
Profile Mad Fritz
Avatar

Send message
Joined: 20 Jul 01
Posts: 87
Credit: 11,334,904
RAC: 0
Switzerland
Message 1154261 - Posted: 20 Sep 2011, 22:58:04 UTC - in response to Message 1154250.  


[dream]...
... it can do an AP in ten seconds and fifty of them at a time [/dream]

Now that thing will shure stiff the servers . . .



The normal daily jobs now already do that ;-)
ID: 1154261 · Report as offensive
Profile eaglescouter

Send message
Joined: 28 Dec 02
Posts: 162
Credit: 42,012,553
RAC: 0
United States
Message 1154262 - Posted: 20 Sep 2011, 23:00:40 UTC - in response to Message 1154260.  

Providing Berkeley BOINC with more high power CPU's is pointless if they cannot keep their network connection working properly....

Likewise I cannot justify replacing my aging farm if Berkeley cannot keep the work flowing.

No I do not do other projects and do not need a backup project so take that arguement elsewhere.

My expectations are low:

Keep a reasonable amount of work flowing on a regular basis.
Post updates when there is a breakdown.
Let the world know when we find ET.
Post my earned credits regularly and reliably.


It's not too many computers, it's a lack of circuit breakers for this room. But we can fix it :)
ID: 1154262 · Report as offensive
Profile Mad Fritz
Avatar

Send message
Joined: 20 Jul 01
Posts: 87
Credit: 11,334,904
RAC: 0
Switzerland
Message 1154272 - Posted: 20 Sep 2011, 23:39:54 UTC

The biggest problem I see here is that the "information flow" to the distributors of this project is a little bit flaw.
We recognise the unscheduled outages, moaning about them, the problem is solved sooner or later but what problem was...

ATM WCG has taken over 2 of my machines , Einstein has won one (0% resource sharing with S@H).

Makes me sad but not crying for suicide :-)


ID: 1154272 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1154273 - Posted: 20 Sep 2011, 23:39:59 UTC

So my cache is lasting just slightly longer than I thought. No idle cores yet, but that will start in about 4 hours. No more tasks "ready to start". 3.4M seconds for work requests end up with "Message from server: Project has no tasks available". Wondering what it takes to get an AP that thinks it will take months to complete.

Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1154273 · Report as offensive
.clair.

Send message
Joined: 4 Nov 04
Posts: 1300
Credit: 55,390,408
RAC: 69
United Kingdom
Message 1154283 - Posted: 21 Sep 2011, 0:39:14 UTC - in response to Message 1154260.  

snip/snip/chainsaw...
Power7 includes 32 Mbytes in embedded DRAM in L3 cache alone. The chip also sports 590 Gbytes/second total chip bandwidth including two four-channel memory controllers per die.


IBM's Power7 packs eight cores and 32 Mbytes eDRAM on a die.
Click on image to enlarge.


32 mega bytes - - - is that that it, huh, how are we supposed to crunch a workunit in that little space :¬)

Or does it mean click to enlarge the L3 as required, I can work with that :¬)
ID: 1154283 · Report as offensive
Profile SciManStev Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Jun 99
Posts: 6652
Credit: 121,090,076
RAC: 0
United States
Message 1154286 - Posted: 21 Sep 2011, 0:48:58 UTC

The cricket graph seem very weak, and I can't report anything. I wonder if this is a changeset to correct last weeks situation. I'm sure that at some point things will improve, but for now my end seems a bit dicey.

Steve
Warning, addicted to SETI crunching!
Crunching as a member of GPU Users Group.
GPUUG Website
ID: 1154286 · Report as offensive
Blake Bonkofsky
Volunteer tester
Avatar

Send message
Joined: 29 Dec 99
Posts: 617
Credit: 46,383,149
RAC: 0
United States
Message 1154287 - Posted: 21 Sep 2011, 0:51:21 UTC - in response to Message 1154286.  

Yup, I can't do anything either. Uploads are fine, but can't reach the scheduler.
ID: 1154287 · Report as offensive
Profile soft^spirit
Avatar

Send message
Joined: 18 May 99
Posts: 6497
Credit: 34,134,168
RAC: 0
United States
Message 1154289 - Posted: 21 Sep 2011, 0:57:54 UTC - in response to Message 1154286.  

The cricket graph seem very weak, and I can't report anything. I wonder if this is a changeset to correct last weeks situation. I'm sure that at some point things will improve, but for now my end seems a bit dicey.

Steve


Do you mean like this?
9/20/2011 5:26:10 PM SETI@home Sending scheduler request: To fetch work.
9/20/2011 5:26:10 PM SETI@home Requesting new tasks for GPU
9/20/2011 5:26:32 PM Project communication failed: attempting access to reference site
9/20/2011 5:26:32 PM SETI@home Scheduler request failed: Couldn't connect to server
9/20/2011 5:26:33 PM Internet access OK - project servers may be temporarily down.
9/20/2011 5:41:37 PM SETI@home Sending scheduler request: To fetch work.
9/20/2011 5:41:37 PM SETI@home Requesting new tasks for GPU
9/20/2011 5:42:21 PM SETI@home Scheduler request failed: HTTP internal server error
9/20/2011 5:54:03 PM SETI@home update requested by user
9/20/2011 5:54:06 PM SETI@home Sending scheduler request: Requested by user.
9/20/2011 5:54:06 PM SETI@home Requesting new tasks for CPU and GPU
9/20/2011 5:54:28 PM Project communication failed: attempting access to reference site
9/20/2011 5:54:28 PM SETI@home Scheduler request failed: Couldn't connect to server
9/20/2011 5:54:29 PM Internet access OK - project servers may be temporarily down.

Janice
ID: 1154289 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 1154290 - Posted: 21 Sep 2011, 0:58:14 UTC

My machine can't connect to the scheduler server too.


Scheduler request failed: Couldn't connect to server

and
Scheduler request failed: Timeout was reached



- Best regards! - Sutaru Tsureku, team seti.international founder. - Optimize your PC for higher RAC. - SETI@home needs your help. -
ID: 1154290 · Report as offensive
EdwardPF
Volunteer tester

Send message
Joined: 26 Jul 99
Posts: 389
Credit: 236,772,605
RAC: 374
United States
Message 1154307 - Posted: 21 Sep 2011, 1:39:45 UTC - in response to Message 1154290.  
Last modified: 21 Sep 2011, 2:12:52 UTC

Who just let the dogs out?? I just got a gulp of 49 MB's!!!!

<edit> with time est of 3-11 hrs ( for a 2min 45sec run) </edit>
<edit2> all the MB went to the Nvidia GPU </edit2>
<edit3> still getting:
This computer has reached a limit on tasks in progress

</edit3>

Ed F
ID: 1154307 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 1154310 - Posted: 21 Sep 2011, 1:46:31 UTC - in response to Message 1154307.  

Who just let the dogs out?? I just got a gulp of 49 MB's!!!!

Ed F


You have <flops> entries in your app_info.xml file?

How are the estimated times of the new WUs after this weekly maintenance?


- Best regards! - Sutaru Tsureku, team seti.international founder. - Optimize your PC for higher RAC. - SETI@home needs your help. -
ID: 1154310 · Report as offensive
EdwardPF
Volunteer tester

Send message
Joined: 26 Jul 99
Posts: 389
Credit: 236,772,605
RAC: 374
United States
Message 1154311 - Posted: 21 Sep 2011, 1:47:18 UTC - in response to Message 1154310.  

no <flops>

Ed F
ID: 1154311 · Report as offensive
Profile Jim_S
Avatar

Send message
Joined: 23 Feb 00
Posts: 4705
Credit: 64,560,357
RAC: 31
United States
Message 1154317 - Posted: 21 Sep 2011, 2:15:41 UTC

Where do you put the <flops> entry in the App_info?

I Desire Peace and Justice, Jim Scott (Mod-Ret.)
ID: 1154317 · Report as offensive
Profile Sunny129
Avatar

Send message
Joined: 7 Nov 00
Posts: 190
Credit: 3,163,755
RAC: 0
United States
Message 1154318 - Posted: 21 Sep 2011, 2:17:41 UTC

wow, AP tasks "ready to send" went from ~32,000 to ~14,000 in less than an hour! which hosts are receiving tasks?..b/c i haven't gotten an AP task in days, and i know several others haven't either...
ID: 1154318 · Report as offensive
EdwardPF
Volunteer tester

Send message
Joined: 26 Jul 99
Posts: 389
Credit: 236,772,605
RAC: 374
United States
Message 1154320 - Posted: 21 Sep 2011, 2:23:36 UTC - in response to Message 1154318.  

I just turned AP off 'cause that's all I've been running for a several days ... I'm doing my last 8 (20 hrs each on all 8 CPU's) now.

I have NO IDEA the "whys and wherefores" of getting so many this past "uptime"

Ed F
ID: 1154320 · Report as offensive
Profile Geek@Play
Volunteer tester
Avatar

Send message
Joined: 31 Jul 01
Posts: 2467
Credit: 86,146,931
RAC: 0
United States
Message 1154322 - Posted: 21 Sep 2011, 2:28:45 UTC

I got 4 AP assigned to each of two machines but downloading them is a real problem.
Boinc....Boinc....Boinc....Boinc....
ID: 1154322 · Report as offensive
Profile perryjay
Volunteer tester
Avatar

Send message
Joined: 20 Aug 02
Posts: 3377
Credit: 20,676,751
RAC: 0
United States
Message 1154323 - Posted: 21 Sep 2011, 2:32:40 UTC - in response to Message 1154318.  

wow, AP tasks "ready to send" went from ~32,000 to ~14,000 in less than an hour! which hosts are receiving tasks?..b/c i haven't gotten an AP task in days, and i know several others haven't either...



I've sorta got two of them. They're stuck in download.



PROUD MEMBER OF Team Starfire World BOINC
ID: 1154323 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65737
Credit: 55,293,173
RAC: 49
United States
Message 1154326 - Posted: 21 Sep 2011, 2:51:33 UTC - in response to Message 1154289.  

Yep that's what I'm seeing too, No work units to download.
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 1154326 · Report as offensive
Profile Floyd
Avatar

Send message
Joined: 19 May 11
Posts: 524
Credit: 1,870,625
RAC: 0
United States
Message 1154327 - Posted: 21 Sep 2011, 2:54:28 UTC
Last modified: 21 Sep 2011, 2:58:36 UTC

9/20/2011 9:48:52 PM | SETI@home | Requesting new tasks for CPU and NVIDIA GPU
9/20/2011 9:51:24 PM | SETI@home | Scheduler request failed: Server returned nothing (no headers, no data)9/20/2011 9:51:39 PM | | Project communication failed: attempting access to reference site
9/20/2011 9:51:42 PM | | Internet access OK - project servers may be temporarily down.

According to server status , it should be working just fine... ?
SETI@home server status information is also available in XML.

[As of 21 Sep 2011 | 2:50:05 UTC]

Part of server Vader is showing working and another part is showing down....

download server 2

vader

Running
--------------

sah_assimilator2

vader

Not Running
ID: 1154327 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1154328 - Posted: 21 Sep 2011, 2:57:39 UTC
Last modified: 21 Sep 2011, 3:12:05 UTC

You guys!!!

20-Sep-2011 22:36:05 [SETI@home] Sending scheduler request: To fetch work. Requesting 3303712 seconds of work, reporting 0 completed tasks
20-Sep-2011 22:37:10 [SETI@home] Scheduler request succeeded: got 9 new tasks


APs! ETA of 89:03:35 instead of ~25:30:00. Downloading is a problem though. Of the 9, 3 finished when I wasn't paying attention, the other six either go into instant-fail or time-out after several minutes.

edit:
2011-09-20 23:01:16|SETI@home|Finished download of ap_27jn11ac_B5_P0_00085_20110919_09041.wu
2011-09-20 23:04:13|SETI@home|Finished download of ap_27jn11ac_B5_P0_00089_20110919_09041.wu
2011-09-20 23:05:25|SETI@home|Finished download of ap_27jn11ac_B5_P0_00034_20110919_09041.wu
2011-09-20 23:06:03|SETI@home|Finished download of ap_27jn11ac_B5_P0_00098_20110919_09041.wu
2011-09-20 23:07:48|SETI@home|Finished download of ap_27jn11ac_B5_P0_00044_20110919_09041.wu
2011-09-20 23:10:01|SETI@home|Finished download of ap_27jn11ac_B5_P0_00049_20110919_09041.wu


Got them done. Now to get to crunching them and watch them drop minutes per second on the ETA and get this DCF fixed.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1154328 · Report as offensive
Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 . . . 10 · Next

Message boards : Number crunching : Panic Mode On (54) Server problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.