Panic Mode On (116) Server Problems?

Message boards : Number crunching : Panic Mode On (116) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 17 · 18 · 19 · 20 · 21 · 22 · 23 . . . 47 · Next

AuthorMessage
Speedy
Volunteer tester
Avatar

Send message
Joined: 26 Jun 04
Posts: 1643
Credit: 12,921,799
RAC: 89
New Zealand
Message 1994246 - Posted: 18 May 2019, 23:14:36 UTC

No panic. I wish they would allow partly started files to finish before starting a bunch of new work. E.g. allowing the 33, 34 and 35 files to finish before they started the bunch of 25 files. Does anyone else feel the same? On the plus side I guess it gives the server a bit of a rest because not it is many results are being returned per hour currently somewhere around 106,000.

I will be pleased when blc34_2bit_guppi_58389_22167_FRB121102_DIAG_0013 180.04 GB gets processed because I want to see how long it takes to process such a big file. I have absolutely no idea when that will be as it has been on the page for quite some time. Clearly other work gets put in front of it as we have seen with the tapes that were added just before the weekend
ID: 1994246 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14654
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1994566 - Posted: 21 May 2019, 10:39:03 UTC

Groan. The great silent tinkerer has messed up the server status page, yet again. Sometime in the last three hours,

Warning: number_format() expects parameter 1 to be double, string given in /disks/carolyn/b/home/boincadm/projects/sah/html/seti_boinc_html/sah_status.php 
has appeared for lines 604, 606 and 608.

And the column widths have gone walkabout.
ID: 1994566 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34971
Credit: 261,360,520
RAC: 489
Australia
Message 1994567 - Posted: 21 May 2019, 10:58:25 UTC - in response to Message 1994566.  

Groan. The great silent tinkerer has messed up the server status page, yet again. Sometime in the last three hours,

Warning: number_format() expects parameter 1 to be double, string given in /disks/carolyn/b/home/boincadm/projects/sah/html/seti_boinc_html/sah_status.php 
has appeared for lines 604, 606 and 608.

And the column widths have gone walkabout.
Well that's been fixed now so I guess that we'll just wait and see what pops out next.

Cheers.
ID: 1994567 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14654
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1994569 - Posted: 21 May 2019, 12:14:53 UTC - in response to Message 1994567.  

So I see. You'd sort-of assume that these errors would appear as a (thankfully temporary) side-effect of some deliberate maintenance or upgrading, but I can't see any changes on the (fixed) page. So what was all the tinkering about?
ID: 1994569 · Report as offensive
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1994590 - Posted: 21 May 2019, 13:50:19 UTC - in response to Message 1994168.  

Cool, thanks. I got it working, albeit a bit wonky at the moment. I went with SSE4.1 for MB and SSE3 for AP.

And did a manual install of 6.10.58, but Manager is angry about running. I don't need a GUI on the linux machine though.. i just use Manager from my daily driver to remote connect.

4770K is many orders of magnitude faster than the Sempron 3500+ was. :)
For a Gen4 CPU I think AVX would be the best.
I looked it Up and that 4770K stomps on your AMD 6100 :)
I think those CPUs were also Overclock monsters :D But not the RAM.
The LGA 1150 boards are pretty cheap if you need a better layout.

But first things first, finish setup and let it run untouched for a week to see if it reliable.
It would actually make a pretty nice Daily Driver.
ID: 1994590 · Report as offensive
Profile Unixchick Project Donor
Avatar

Send message
Joined: 5 Mar 12
Posts: 815
Credit: 2,361,516
RAC: 22
United States
Message 1994609 - Posted: 21 May 2019, 15:19:59 UTC

No Panic. The system has running so well. Hopefully our outage today will be nice and short.

These blc25 WUs take a bit longer to run, so I think it will take us a while to process all the files they have put into the queue.
ID: 1994609 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1994620 - Posted: 21 May 2019, 18:12:49 UTC - in response to Message 1994590.  

Cool, thanks. I got it working, albeit a bit wonky at the moment. I went with SSE4.1 for MB and SSE3 for AP.

And did a manual install of 6.10.58, but Manager is angry about running. I don't need a GUI on the linux machine though.. i just use Manager from my daily driver to remote connect.

4770K is many orders of magnitude faster than the Sempron 3500+ was. :)
For a Gen4 CPU I think AVX would be the best.
I looked it Up and that 4770K stomps on your AMD 6100 :)
I think those CPUs were also Overclock monsters :D But not the RAM.
The LGA 1150 boards are pretty cheap if you need a better layout.

But first things first, finish setup and let it run untouched for a week to see if it reliable.
It would actually make a pretty nice Daily Driver.

Yeah it's not bad for pulling it out of a trash pile.

The Sempron 3500 that it replaced was doing these MBs in about 5.5 hours and was single-core. 4770K is doing them in about 1 hour, and I've got it set for 4 at a time. So that's what.. 20-22x faster? If you go and look at the oldest tasks on that machine, you can see what the run times for the Sempron were.

Before those disappear, this is what they are:
19,350.31
20,327.48
18,999.27
18,965.42
20,198.80
20,674.27

Compared to:
4,032.10
3,932.09
3,925.96
3,830.79
3,861.32

...and 4 at a time.

Waiting for some APs to show up to see how those do. Sempron was doing those in about 45 hours. I have to imagine this will chew through it much faster.

As for daily driver, it's still a very capable CPU for today. I have a friend that has one in his gaming machine and it is still doing just fine.. it just uses more power and makes more heat than a newer gen, but performance-wise.. still very capable. Not as good as Ryzen though. :)
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1994620 · Report as offensive
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5124
Credit: 276,046,078
RAC: 462
Message 1994628 - Posted: 21 May 2019, 20:05:59 UTC

Has the maintenance come and gone? Local time 10:30 am CST is normally right in the middle of the outage. But I am still typing here....

Tom
A proud member of the OFA (Old Farts Association).
ID: 1994628 · Report as offensive
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5124
Credit: 276,046,078
RAC: 462
Message 1994629 - Posted: 21 May 2019, 20:07:15 UTC - in response to Message 1994628.  

Has the maintenance come and gone? Local time 10:30 am CST is normally right in the middle of the outage. But I am still typing here....

Tom


Lucky me. Right after I posted that, the website went down :)
Its 3:07 local/CST so I would say its "up" for what it is worth.

Tom
A proud member of the OFA (Old Farts Association).
ID: 1994629 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1994635 - Posted: 21 May 2019, 22:04:58 UTC - in response to Message 1994629.  

Has the maintenance come and gone? Local time 10:30 am CST is normally right in the middle of the outage. But I am still typing here....

Tom


Lucky me. Right after I posted that, the website went down :)
Its 3:07 local/CST so I would say its "up" for what it is worth.

Tom


. . Well according to my logs it lasted less than 2 hours (again).

. . I still cannot get used to these really short outages. I keep waiting for the "gotcha"!

Stephen

? ?
ID: 1994635 · Report as offensive
Lionel

Send message
Joined: 25 Mar 00
Posts: 680
Credit: 563,640,304
RAC: 597
Australia
Message 1994637 - Posted: 21 May 2019, 22:06:48 UTC - in response to Message 1994635.  

i think they're great :)
ID: 1994637 · Report as offensive
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5124
Credit: 276,046,078
RAC: 462
Message 1994655 - Posted: 21 May 2019, 23:30:02 UTC - in response to Message 1994635.  

Has the maintenance come and gone? Local time 10:30 am CST is normally right in the middle of the outage. But I am still typing here....

Tom


Lucky me. Right after I posted that, the website went down :)
Its 3:07 local/CST so I would say its "up" for what it is worth.

Tom


. . Well according to my logs it lasted less than 2 hours (again).

. . I still cannot get used to these really short outages. I keep waiting for the "gotcha"!

Stephen

? ?


G O T C H A !!!! ;)
A proud member of the OFA (Old Farts Association).
ID: 1994655 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1994739 - Posted: 22 May 2019, 14:01:10 UTC
Last modified: 22 May 2019, 14:40:33 UTC

. . Well now I think the 'gotcha' has arrived. Downloads hanging again and task cache getting low :(

. . Same on all rigs and the main unit is totally out of work. The problem must have started hours ago. Manually kicking the servers is getting some d/ls to complete but most are still hanging.

. . Finally cleared the d/l backlog on one machine but still unable to report completed work as 'cannot connect to server'.

. . Also the message base is slow, taking forever to send this message ...

Stephen

:(
ID: 1994739 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1994795 - Posted: 22 May 2019, 22:01:28 UTC

. . It seems I am the only one getting this problem as there have been no other comments in 7 hours :(

. . Maybe it is time for reboots all round.

Stephen

? ?
ID: 1994795 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34971
Credit: 261,360,520
RAC: 489
Australia
Message 1994796 - Posted: 22 May 2019, 22:04:37 UTC

Might be Stephen as no problems have been logged here.

Cheers.
ID: 1994796 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13755
Credit: 208,696,464
RAC: 304
Australia
Message 1994803 - Posted: 22 May 2019, 22:40:43 UTC
Last modified: 22 May 2019, 22:42:34 UTC

Just had a look in my log & no signs of issues since before midnight last night CST, although someone has posted in another thread about not being able to contact the servers (although their issue is with just one machine of 3).

Maybe give your modem a reboot as well?
Grant
Darwin NT
ID: 1994803 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1994805 - Posted: 22 May 2019, 23:01:44 UTC - in response to Message 1994739.  

I get a few backed off downloads on a couple of machines usually in the first hour after the project returns. No problem getting them to download as soon as I notice them in the Transfers tab of BoincTasks.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1994805 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1994809 - Posted: 23 May 2019, 0:16:15 UTC - in response to Message 1994803.  
Last modified: 23 May 2019, 0:17:49 UTC

Just had a look in my log & no signs of issues since before midnight last night CST, although someone has posted in another thread about not being able to contact the servers (although their issue is with just one machine of 3).

Maybe give your modem a reboot as well?


. . Well I haven't gotten around to doing the reboots yet but the problem seems to have cleared by itself. I had tried changing the 'hosts' file but no combination made any difference, so maybe it was an ISP issue. The up/downloads seem to be fine now and the forum is responding AOK. I will monitor during the day and if there is any hint of further problems I will reboot everything ...

Stephen

:(
ID: 1994809 · Report as offensive
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5124
Credit: 276,046,078
RAC: 462
Message 1994819 - Posted: 23 May 2019, 1:40:35 UTC - in response to Message 1994795.  

. . It seems I am the only one getting this problem as there have been no other comments in 7 hours :(

. . Maybe it is time for reboots all round.

Stephen

? ?


Just came home to some kind of network connectivity issue with Seti. I had a modest bunch of uploads with 5 hour back offs. Hit with a re-send and they all went through. Then I realized I was having some issues with WCG so I unplugged/plugged in my "dongle" and all is working again.

Tom
A proud member of the OFA (Old Farts Association).
ID: 1994819 · Report as offensive
Profile Unixchick Project Donor
Avatar

Send message
Joined: 5 Mar 12
Posts: 815
Credit: 2,361,516
RAC: 22
United States
Message 1995034 - Posted: 24 May 2019, 17:17:18 UTC

No Panic.

The results received in last hour has been climbing. It has been under 100K with this latest batch of blc25s , but has now climbed to 135k. I think the system can handle about 200K/hour , so no panic, just curiosity.
ID: 1995034 · Report as offensive
Previous · 1 . . . 17 · 18 · 19 · 20 · 21 · 22 · 23 . . . 47 · Next

Message boards : Number crunching : Panic Mode On (116) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.