Panic Mode On (115) Server Problems?

Message boards : Number crunching : Panic Mode On (115) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 18 · 19 · 20 · 21 · 22 · 23 · 24 . . . 31 · Next

AuthorMessage
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13159
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1984287 - Posted: 9 Mar 2019, 15:58:46 UTC - in response to Message 1984259.  

So it was me repeatedly trying to get the stuck Einstein uploads to finish by hitting retry and having them all in "Active" is what prevented the others from uploading or downloading and going into backoff.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1984287 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14532
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1984294 - Posted: 9 Mar 2019, 16:28:28 UTC - in response to Message 1984287.  

Sounds plausible.
ID: 1984294 · Report as offensive
Speedy
Volunteer tester
Avatar

Send message
Joined: 26 Jun 04
Posts: 1638
Credit: 12,921,799
RAC: 89
New Zealand
Message 1984647 - Posted: 12 Mar 2019, 2:36:01 UTC
Last modified: 12 Mar 2019, 2:37:03 UTC

Lots of data files sitting complete at (128) it would be nice if they didn't load any more data after the outage so we can burn through some of the smaller files. Just my thoughts in an ideal world.

It would kinda be nice if we could have a feature that told us how many files we have processed in the last 24 hours. I am aware that you can do it by basic maths but you would need to check it each day at the same time. Also hard to get an accurate number if they have added more data. I am aware that this probably will not happen as it would more strain on the database.
ID: 1984647 · Report as offensive
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1984683 - Posted: 12 Mar 2019, 5:58:36 UTC - in response to Message 1984647.  

If I remember right, Eric said they do ~1.2TB of data from Greenbank per day.
So 1024 * 1.2 / 52.39GB = 23.5 files.
My best guess there ....
ID: 1984683 · Report as offensive
Speedy
Volunteer tester
Avatar

Send message
Joined: 26 Jun 04
Posts: 1638
Credit: 12,921,799
RAC: 89
New Zealand
Message 1984700 - Posted: 12 Mar 2019, 8:07:01 UTC - in response to Message 1984683.  

If I remember right, Eric said they do ~1.2TB of data from Greenbank per day.
So 1024 * 1.2 / 52.39GB = 23.5 files.
My best guess there ....

Thank you for the information very interesting. Now if only we could see what files have been processed before they are removed. I am aware as I said in my previous post that there are approximately 9 files sitting at (128) I guess in a way this is showing me they are complete. :) I have a feeling these maybe stuck.
ID: 1984700 · Report as offensive
Profile Jimbocous Project Donor
Volunteer tester
Avatar

Send message
Joined: 1 Apr 13
Posts: 1849
Credit: 268,616,081
RAC: 1,349
United States
Message 1984702 - Posted: 12 Mar 2019, 8:23:29 UTC - in response to Message 1984700.  

Thank you for the information very interesting. Now if only we could see what files have been processed before they are removed. I am aware as I said in my previous post that there are approximately 9 files sitting at (128) I guess in a way this is showing me they are complete. :) I have a feeling these maybe stuck.
No, it means the SSP is borked, as it has been intermittently for a while. If you look, it will also say that there are 28 GBT splitters running (channels in progress), when only 14 are provisioned. If you see that, pretty much everything else is possibly inaccurate as well. This seems to include failing to drop off the page files that have been completely split.
No way to know for sure, but my suspicion is that when they redid the throttle process a while back they failed to account for updating some portion of the SSP, as this seems to happen after the throttle kicks in.
ID: 1984702 · Report as offensive
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5124
Credit: 276,046,078
RAC: 462
Message 1984780 - Posted: 12 Mar 2019, 15:18:57 UTC

Well, its 10:17 CST(USA) and the website is still up. Usually it is down by now.

Maybe the putative ram upgrade is delaying things.

Tom
A proud member of the OFA (Old Farts Association).
ID: 1984780 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14532
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1984782 - Posted: 12 Mar 2019, 15:21:31 UTC - in response to Message 1984780.  

Or perhaps Jeff is on vacation and Eric is throwing the switches.
ID: 1984782 · Report as offensive
Ian&Steve C.
Avatar

Send message
Joined: 28 Sep 99
Posts: 4267
Credit: 1,282,604,591
RAC: 6,640
United States
Message 1984787 - Posted: 12 Mar 2019, 19:19:31 UTC

that was quick :)
Seti@Home classic workunits: 29,492 CPU time: 134,419 hours

ID: 1984787 · Report as offensive
Profile -= Vyper =-
Volunteer tester
Avatar

Send message
Joined: 5 Sep 99
Posts: 1652
Credit: 1,065,191,981
RAC: 2,537
Sweden
Message 1984848 - Posted: 12 Mar 2019, 23:34:39 UTC
Last modified: 12 Mar 2019, 23:35:33 UTC

What have they done in the labs?! The servers are responsive and Quick after an outage, result Creation rate just flies. It was only a slight roadbump instead of a massive tornado wrecking it for hours?!

Well done all staff for speeding it up. Whatever you did, you did it flawlessly! *hats off* https://setistats.haveland.com/sah_v8_results.html

_________________________________________________________________________
Addicted to SETI crunching!
Founder of GPU Users Group
ID: 1984848 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13159
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1984858 - Posted: 13 Mar 2019, 0:12:42 UTC - in response to Message 1984848.  

+1

I agree. Don't know if the delayed upgrade of the last 32GB into carolyn happened or not . . . . it seemed very responsive and current even without the full memory package since last Tuesday. The website and servers seem to be running very well and the website is responsive and quick for a change. A pleasure to use.

Good job, lab elves!
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1984858 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1984860 - Posted: 13 Mar 2019, 0:13:23 UTC - in response to Message 1984858.  

+ 2
ID: 1984860 · Report as offensive
Profile Jimbocous Project Donor
Volunteer tester
Avatar

Send message
Joined: 1 Apr 13
Posts: 1849
Credit: 268,616,081
RAC: 1,349
United States
Message 1984905 - Posted: 13 Mar 2019, 6:58:57 UTC

Just goes to show how badly memory problems can slow things down.
ID: 1984905 · Report as offensive
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5124
Credit: 276,046,078
RAC: 462
Message 1984939 - Posted: 13 Mar 2019, 14:54:38 UTC - in response to Message 1984860.  

+ 2


+3 or even +42 :)
A proud member of the OFA (Old Farts Association).
ID: 1984939 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13400
Credit: 208,696,464
RAC: 304
Australia
Message 1985256 - Posted: 15 Mar 2019, 9:19:08 UTC
Last modified: 15 Mar 2019, 9:19:26 UTC

Been getting the (very) occasional sticky download.
The timer starts, but nothing happens for 30sec to several minutes, then it finally gets going & eventually downloads.
Grant
Darwin NT
ID: 1985256 · Report as offensive
Profile Jimbocous Project Donor
Volunteer tester
Avatar

Send message
Joined: 1 Apr 13
Posts: 1849
Credit: 268,616,081
RAC: 1,349
United States
Message 1985274 - Posted: 15 Mar 2019, 10:34:23 UTC - in response to Message 1985256.  

Been getting the (very) occasional sticky download.
The timer starts, but nothing happens for 30sec to several minutes, then it finally gets going & eventually downloads.

FWIW, haven't experienced that here this evening/morning.
ID: 1985274 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1985284 - Posted: 15 Mar 2019, 11:40:26 UTC - in response to Message 1985256.  

Been getting the (very) occasional sticky download.
The timer starts, but nothing happens for 30sec to several minutes, then it finally gets going & eventually downloads.


. . FWIW me too ...

Stephen

<shrug>
ID: 1985284 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13159
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1985342 - Posted: 15 Mar 2019, 17:43:40 UTC - in response to Message 1985284.  

I had a stuck upload on a host that would never start over two days. Exiting BOINC and restarting never changed its condition. I know I could have edited client_state but it wasn't worth the effort and possible dumping of all work so I just aborted the upload.

I too see some very slow downloads. Takes a while for them to start and finish. I also have some go into delayed retries because my connect time is reduced to 90 seconds and they never get started in time.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1985342 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1985611 - Posted: 17 Mar 2019, 14:54:01 UTC

Not to turn on the panic bottom but

Did anyone tried to see it's validated WU for example? In my case the page runs, runs & runs but never opens.
ID: 1985611 · Report as offensive
Profile Siran d'Vel'nahr
Volunteer tester
Avatar

Send message
Joined: 23 May 99
Posts: 7373
Credit: 44,181,323
RAC: 238
United States
Message 1985618 - Posted: 17 Mar 2019, 15:14:21 UTC - in response to Message 1985611.  

Not to turn on the panic bottom but

Did anyone tried to see it's validated WU for example? In my case the page runs, runs & runs but never opens.

Hi Juan,

I just checked and it works fine for me. :)

Have a great day! :)

Siran
CAPT Siran d'Vel'nahr XO - L L & P _\\//
USS Vre'kasht NCC-33187
Winders 10 OS? "What a piece of junk!" - L. Skywalker
"Logic is the cement of our civilization with which we ascend from chaos using reason as our guide." - T'Plana-hath
ID: 1985618 · Report as offensive
Previous · 1 . . . 18 · 19 · 20 · 21 · 22 · 23 · 24 . . . 31 · Next

Message boards : Number crunching : Panic Mode On (115) Server Problems?


 
©2022 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.