The Server Issues / Outages Thread - Panic Mode On! (119)

Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (119)
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 55 · 56 · 57 · 58 · 59 · 60 · 61 . . . 107 · Next

AuthorMessage
Profile Unixchick Project Donor
Avatar

Send message
Joined: 5 Mar 12
Posts: 815
Credit: 2,361,516
RAC: 22
United States
Message 2042466 - Posted: 2 Apr 2020, 5:46:37 UTC

Warning: number_format() expects parameter 1 to be double, string given in /disks/carolyn/b/home/boincadm/projects/sah/html/seti_boinc_html/sah_status.php on line 604 Warning: number_format() expects parameter 1 to be double, string given in /disks/carolyn/b/home/boincadm/projects/sah/html/seti_boinc_html/sah_status.php on line 608
ID: 2042466 · Report as offensive     Reply Quote
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13855
Credit: 208,696,464
RAC: 304
Australia
Message 2042469 - Posted: 2 Apr 2020, 6:11:16 UTC

Now getting Scheduler request failures.
Grant
Darwin NT
ID: 2042469 · Report as offensive     Reply Quote
Profile Jimbocous Project Donor
Volunteer tester
Avatar

Send message
Joined: 1 Apr 13
Posts: 1856
Credit: 268,616,081
RAC: 1,349
United States
Message 2042470 - Posted: 2 Apr 2020, 6:12:17 UTC - in response to Message 2042469.  
Last modified: 2 Apr 2020, 6:16:33 UTC

Now getting Scheduler request failures.

+1
...
ID: 2042470 · Report as offensive     Reply Quote
BetelgeuseFive Project Donor
Volunteer tester

Send message
Joined: 6 Jul 99
Posts: 158
Credit: 17,117,787
RAC: 19
Netherlands
Message 2042478 - Posted: 2 Apr 2020, 7:36:47 UTC

SSP now shows:

Workunits waiting for assimilation 0 0 54 15m
Workunits waiting for db purging 0 7,380 22,669,039 15m
Results waiting for db purging 0 1 27,791,219 15m

From what I remember there were millions of units waiting for assimilation last night.
What happened ? Were they all processed over night or is the data on SSP incorrect ?

Tom
ID: 2042478 · Report as offensive     Reply Quote
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13855
Credit: 208,696,464
RAC: 304
Australia
Message 2042480 - Posted: 2 Apr 2020, 7:49:07 UTC

Scheduler's awake again, an there's work ready to go.
Unfortunately the Scheduler isn't actually sending anything out at present.
Grant
Darwin NT
ID: 2042480 · Report as offensive     Reply Quote
Speedy
Volunteer tester
Avatar

Send message
Joined: 26 Jun 04
Posts: 1643
Credit: 12,921,799
RAC: 89
New Zealand
Message 2042481 - Posted: 2 Apr 2020, 7:50:46 UTC - in response to Message 2042478.  

Tom I'm not exactly sure what happened. From what I remember they got shifted back to waiting for administration. Hard to know I guess it's just a waiting game
ID: 2042481 · Report as offensive     Reply Quote
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 2042482 - Posted: 2 Apr 2020, 7:50:51 UTC - in response to Message 2042478.  
Last modified: 2 Apr 2020, 7:56:43 UTC

SSP now shows:
Workunits waiting for assimilation 0 0 54 15m
Workunits waiting for db purging 0 7,380 22,669,039 15m
Results waiting for db purging 0 1 27,791,219 15m
From what I remember there were millions of units waiting for assimilation last night.
What happened ? Were they all processed over night or is the data on SSP incorrect ?

Tom


. . If you look now there are 22 million WUs waiting for purging and 27 million Tasks waiting to be purged. It seems that furball has been moved ...

. . Sorry that was what you were pointing out, there were too many numbers jumbled together ... :(

. . The missing WUs waiting for assimilation may be because they are part of the rise to 8.5 million tasks in the field.

Stephen

. . YAY! ... (?)
ID: 2042482 · Report as offensive     Reply Quote
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13855
Credit: 208,696,464
RAC: 304
Australia
Message 2042483 - Posted: 2 Apr 2020, 7:51:02 UTC - in response to Message 2042478.  

SSP now shows:

Workunits waiting for assimilation 0 0 54 15m
Workunits waiting for db purging 0 7,380 22,669,039 15m
Results waiting for db purging 0 1 27,791,219 15m

From what I remember there were millions of units waiting for assimilation last night.
What happened ? Were they all processed over night or is the data on SSP incorrect ?
Most likely another glitch.
After the last system outage the numbers were all messed up for an hour or two before returning to their previous excessive values.
Grant
Darwin NT
ID: 2042483 · Report as offensive     Reply Quote
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13855
Credit: 208,696,464
RAC: 304
Australia
Message 2042484 - Posted: 2 Apr 2020, 7:53:01 UTC - in response to Message 2042482.  

. . If you look now there are 22 million WUs waiting for purging and 27 million Tasks waiting to be purged. It seems that furball has been moved ...
And all the Ready-to-send numbers went along with them.
Grant
Darwin NT
ID: 2042484 · Report as offensive     Reply Quote
BetelgeuseFive Project Donor
Volunteer tester

Send message
Joined: 6 Jul 99
Posts: 158
Credit: 17,117,787
RAC: 19
Netherlands
Message 2042485 - Posted: 2 Apr 2020, 7:55:16 UTC - in response to Message 2042483.  

SSP now shows:

Workunits waiting for assimilation 0 0 54 15m
Workunits waiting for db purging 0 7,380 22,669,039 15m
Results waiting for db purging 0 1 27,791,219 15m

From what I remember there were millions of units waiting for assimilation last night.
What happened ? Were they all processed over night or is the data on SSP incorrect ?
Most likely another glitch.
After the last system outage the numbers were all messed up for an hour or two before returning to their previous excessive values.


Good guess: SSP now shows

Workunits waiting for assimilation 0 0 8,172,827 9m

Back to where we were before ...

Tom
ID: 2042485 · Report as offensive     Reply Quote
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14679
Credit: 200,643,578
RAC: 874
United Kingdom
Message 2042486 - Posted: 2 Apr 2020, 8:21:22 UTC

And on the next refresh, it says that there are 4,882,659 Results ready to send. I think we have one (or possibly several) very confused servers. Hopefully they'll stop arguing amongst themselves as the day progresses.
ID: 2042486 · Report as offensive     Reply Quote
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 2042488 - Posted: 2 Apr 2020, 8:32:40 UTC - in response to Message 2042486.  

it says that there are 4,882,659 Results ready to send.
You forgot to wash your eyes out with coffee again in these coronadays? RRtS is 0, has been since the last tape went through. RoitF is 4,880,027
ID: 2042488 · Report as offensive     Reply Quote
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14679
Credit: 200,643,578
RAC: 874
United Kingdom
Message 2042489 - Posted: 2 Apr 2020, 8:34:37 UTC - in response to Message 2042488.  

it says that there are 4,882,659 Results ready to send.
You forgot to wash your eyes out with coffee again in these coronadays? RRtS is 0, has been since the last tape went through. RoitF is 4,880,027
Well, it was a copy'n'paste from the 08:10 show. Went back to 0 at 08:20. Like I said, confused.
ID: 2042489 · Report as offensive     Reply Quote
Profile Kissagogo27 Special Project $75 donor
Avatar

Send message
Joined: 6 Nov 99
Posts: 716
Credit: 8,032,827
RAC: 62
France
Message 2042495 - Posted: 2 Apr 2020, 9:19:32 UTC

MB ready to send fall to 0 and creation rate to 2/sec
ID: 2042495 · Report as offensive     Reply Quote
AllgoodGuy

Send message
Joined: 29 May 01
Posts: 293
Credit: 16,348,499
RAC: 266
United States
Message 2042503 - Posted: 2 Apr 2020, 11:35:31 UTC - in response to Message 2042489.  

it says that there are 4,882,659 Results ready to send.
You forgot to wash your eyes out with coffee again in these coronadays? RRtS is 0, has been since the last tape went through. RoitF is 4,880,027
Well, it was a copy'n'paste from the 08:10 show. Went back to 0 at 08:20. Like I said, confused.

This is what happened the other day when they switched the pages to the replica DB. In fact, it looks like exactly the same data being displayed on my data when that happened. High readings on the astropulse valid tasks. Probably a remnent of that data.
ID: 2042503 · Report as offensive     Reply Quote
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5126
Credit: 276,046,078
RAC: 462
Message 2042527 - Posted: 2 Apr 2020, 14:02:12 UTC

I would say I must have some ghosts. The website says I have 640 waiting to be processed.

I only have one system (my Windows 10 box) that is still processing.
And it is processing GPU tasks only.
The tasks are taking up to 40+ minutes per task so it is taking a long time to get through the cache.

Tom M
A proud member of the OFA (Old Farts Association).
ID: 2042527 · Report as offensive     Reply Quote
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 2042530 - Posted: 2 Apr 2020, 14:07:04 UTC

At least it was a good time to change BOINC over onto a new webserver, which promptly went offline.
Good thing I read that David asked to report any problems. I think the server being AWOL since about 10am LT counts like such. :P
ID: 2042530 · Report as offensive     Reply Quote
Ian&Steve C.
Avatar

Send message
Joined: 28 Sep 99
Posts: 4267
Credit: 1,282,604,591
RAC: 6,640
United States
Message 2042531 - Posted: 2 Apr 2020, 14:07:32 UTC - in response to Message 2042527.  

I would say I must have some ghosts. The website says I have 640 waiting to be processed.

I only have one system (my Windows 10 box) that is still processing.
And it is processing GPU tasks only.
The tasks are taking up to 40+ minutes per task so it is taking a long time to get through the cache.

Tom M


the website is 3-4 days out of date due to the replica delay. it's been this way for a long time. anything you see on the website reflects what your system was doing several days ago.
Seti@Home classic workunits: 29,492 CPU time: 134,419 hours

ID: 2042531 · Report as offensive     Reply Quote
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 2042539 - Posted: 2 Apr 2020, 14:59:30 UTC - in response to Message 2042488.  

it says that there are 4,882,659 Results ready to send.
You forgot to wash your eyes out with coffee again in these coronadays? RRtS is 0, has been since the last tape went through. RoitF is 4,880,027


. . The numbers have been flicking back and forth like a fan dancers fans ... very confusing ...

Stephen

:(
ID: 2042539 · Report as offensive     Reply Quote
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14679
Credit: 200,643,578
RAC: 874
United Kingdom
Message 2042547 - Posted: 2 Apr 2020, 15:14:41 UTC - in response to Message 2042530.  

At least it was a good time to change BOINC over onto a new webserver, which promptly went offline.
Good thing I read that David asked to report any problems. I think the server being AWOL since about 10am LT counts like such. :P
Isn't that David's usual modus operandi?

* no pre-warning
* do the business last thing at night
* post about what a cracking success it's been
* go to bed
* let the Europeans watch it fall over

?
ID: 2042547 · Report as offensive     Reply Quote
Previous · 1 . . . 55 · 56 · 57 · 58 · 59 · 60 · 61 . . . 107 · Next

Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (119)


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.