Panic Mode On (115) Server Problems?

Message boards : Number crunching : Panic Mode On (115) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 23 · 24 · 25 · 26 · 27 · 28 · 29 . . . 30 · Next

AuthorMessage
Profile Kissagogo27 Special Project $75 donor
Avatar

Send message
Joined: 6 Nov 99
Posts: 717
Credit: 8,032,827
RAC: 62
France
Message 1987110 - Posted: 25 Mar 2019, 20:11:10 UTC - in response to Message 1986953.  

Try looking at who #28 is listed as belonging to, https://setiathome.berkeley.edu/top_hosts.php?sort_by=expavg_credit&offset=20
ID: 8682108
#28 Magne 171,989.52 8,889,326 7.8.3
GenuineIntel Intel(R) Core(TM) i7-4770K CPU @ 3.50GHz [Family 6 Model 60 Stepping 3] (8 processors)
NVIDIA GeForce GTX 1060 6GB (4095MB) driver: 418.43 OpenCL: 1.2
Linux Ubuntu Ubuntu 18.04.2 LTS [4.15.0-46-generic]

Now look at #8, https://setiathome.berkeley.edu/top_hosts.php
ID: 8690734
#8 Magne 363,269.95 18,080,802 7.4.44
GenuineIntel Intel(R) Core(TM) i7-4770K CPU @ 3.50GHz [Family 6 Model 60 Stepping 3] (8 processors)
[5] NVIDIA GeForce GTX 1060 6GB (4095MB) driver: 418.43
Linux 4.15.0-46-generic

Now note Magne only has One Linux Machine, https://setiathome.berkeley.edu/hosts_user.php?userid=118177
You should also note the RAC on his User page, Recent average credit 199,322.93
Most people would conclude it is the SAME machine.

You can also look at Juan's page and note the Credit; https://setiathome.berkeley.edu/show_user.php?userid=8606388
Total credit: 450,616,983
But when you look at his host, https://setiathome.berkeley.edu/hosts_user.php?userid=8606388
ID: 8662921: Total credit: 1,375,790,492
So, how does a Total of 450,616k translate to a Host of 1,375,790k?
Strange things around here...


possibliy using more than one session of boinc and then make fusion between them ?
ID: 1987110 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51527
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1987111 - Posted: 25 Mar 2019, 20:11:11 UTC

Eric is working on the problems.

Meow!
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1987111 · Report as offensive
Speedy
Volunteer tester
Avatar

Send message
Joined: 26 Jun 04
Posts: 1646
Credit: 12,921,799
RAC: 89
New Zealand
Message 1987115 - Posted: 25 Mar 2019, 20:56:17 UTC - in response to Message 1987111.  

Eric is working on the problems.

Meow!

Thanks for the installation Mark
ID: 1987115 · Report as offensive
rcthardcore

Send message
Joined: 23 Nov 08
Posts: 48
Credit: 1,306,006
RAC: 0
United States
Message 1987116 - Posted: 25 Mar 2019, 21:01:31 UTC

Uploads are definitely very iffy today.
ID: 1987116 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1987119 - Posted: 25 Mar 2019, 21:22:19 UTC - in response to Message 1987024.  

. . Aaahh! Here we go again, 24 hours (or so) to maintenance outage and the uploads are playing up .... :(

Just like a switch was flipped ...


. . Yep, one minute everything is fine, then no uploads without kicking them over and over ... :(

Stephen

:(
ID: 1987119 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1987120 - Posted: 25 Mar 2019, 21:22:31 UTC - in response to Message 1987110.  
Last modified: 25 Mar 2019, 21:36:56 UTC

Look at the User Stats. A User can't have a Host with better stats than the User. Here's Magne's stats, https://boincstats.com/en/stats/-1/user/detail/204994/lastDays
The machine that was at #28 had OpenCL installed, the machine that appeared at #8 Doesn't have OpenCL installed. Not having OpenCL is common on a New Build as the installer doesn't install OpenCL by default.
We Know what happened to Juan. He broke his One computer's networking and had to reinstall the OS. When it came back online with the New system the SETI Server gave it a New ID with an Extra One BILLION Credits and Over an Extra One MILLION RAC. Juan wasn't doing anything out of the ordinary, certainly not 'fusioning'. It appears the Exact same thing happened with Magne, it blinked out with an RAC of around 172k and blinked back in with an RAC of 363k which is impossible for that machine, and his User Stats.

Try looking at who #28 is listed as belonging to, https://setiathome.berkeley.edu/top_hosts.php?sort_by=expavg_credit&offset=20
ID: 8682108
#28 Magne 171,989.52 8,889,326 7.8.3
GenuineIntel Intel(R) Core(TM) i7-4770K CPU @ 3.50GHz [Family 6 Model 60 Stepping 3] (8 processors)
NVIDIA GeForce GTX 1060 6GB (4095MB) driver: 418.43 OpenCL: 1.2
Linux Ubuntu Ubuntu 18.04.2 LTS [4.15.0-46-generic]

Now look at #8, https://setiathome.berkeley.edu/top_hosts.php
ID: 8690734
#8 Magne 363,269.95 18,080,802 7.4.44
GenuineIntel Intel(R) Core(TM) i7-4770K CPU @ 3.50GHz [Family 6 Model 60 Stepping 3] (8 processors)
[5] NVIDIA GeForce GTX 1060 6GB (4095MB) driver: 418.43
Linux 4.15.0-46-generic

Now note Magne only has One Linux Machine, https://setiathome.berkeley.edu/hosts_user.php?userid=118177
You should also note the RAC on his User page, Recent average credit 199,322.93
Most people would conclude it is the SAME machine.

You can also look at Juan's page and note the Credit; https://setiathome.berkeley.edu/show_user.php?userid=8606388
Total credit: 450,616,983
But when you look at his host, https://setiathome.berkeley.edu/hosts_user.php?userid=8606388
ID: 8662921: Total credit: 1,375,790,492
So, how does a Total of 450,616k translate to a Host of 1,375,790k?
Strange things around here...


possibliy using more than one session of boinc and then make fusion between them ?
ID: 1987120 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1987121 - Posted: 25 Mar 2019, 21:27:53 UTC - in response to Message 1987085.  

I'm also getting a lot of my uploads stuck at 100% while others timeout and just go into retry loops.


. . They are the most frustrating, after kicking them and kicking them you see one get to 100%, then ... nada, it just sits there taunting you ... aaarrggghh!

Stephen

:(
ID: 1987121 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22759
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1987123 - Posted: 25 Mar 2019, 21:37:28 UTC

We've seen this sort of things a few times when the database gets its bits in a twist, and that has been followed by a disk handing its notice in....
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1987123 · Report as offensive
Ian&Steve C.
Avatar

Send message
Joined: 28 Sep 99
Posts: 4267
Credit: 1,282,604,591
RAC: 6,640
United States
Message 1987142 - Posted: 25 Mar 2019, 23:46:56 UTC

welp, the 1mil system is out of work, with about 2800 uploads just waiting to go through.
Seti@Home classic workunits: 29,492 CPU time: 134,419 hours

ID: 1987142 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1987145 - Posted: 25 Mar 2019, 23:57:40 UTC - in response to Message 1987142.  

Are they all reported?
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1987145 · Report as offensive
Profile Jimbocous Project Donor
Volunteer tester
Avatar

Send message
Joined: 1 Apr 13
Posts: 1859
Credit: 268,616,081
RAC: 1,349
United States
Message 1987147 - Posted: 26 Mar 2019, 0:08:32 UTC - in response to Message 1987145.  

Are they all reported?

Reporting happens after upload is complete, right? At least here :)
ID: 1987147 · Report as offensive
Ian&Steve C.
Avatar

Send message
Joined: 28 Sep 99
Posts: 4267
Credit: 1,282,604,591
RAC: 6,640
United States
Message 1987150 - Posted: 26 Mar 2019, 0:17:31 UTC - in response to Message 1987145.  
Last modified: 26 Mar 2019, 0:18:16 UTC

Nope. That’s the problem. They won’t upload (well they do eventually. Just a lot slower than normal). So I can’t report them. And with so many uploads pending, I haven’t been able to download new tasks all day. So I’ve run dry. Only the slower systems have been able to keep a low enough cache of pending uploads to still get downloads.
Seti@Home classic workunits: 29,492 CPU time: 134,419 hours

ID: 1987150 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1987152 - Posted: 26 Mar 2019, 0:20:29 UTC - in response to Message 1987150.  

I've been babysitting the farm all day. As long as a couple get uploaded, I can then do an update to report 100 tasks. Then I can get more work.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1987152 · Report as offensive
Profile Jimbocous Project Donor
Volunteer tester
Avatar

Send message
Joined: 1 Apr 13
Posts: 1859
Credit: 268,616,081
RAC: 1,349
United States
Message 1987155 - Posted: 26 Mar 2019, 0:34:29 UTC
Last modified: 26 Mar 2019, 0:40:24 UTC

What I was trying to suggest is that Reporting is the act of telling the scheduling server that a task has been completed and uploaded to the upload server. True?
ID: 1987155 · Report as offensive
Ian&Steve C.
Avatar

Send message
Joined: 28 Sep 99
Posts: 4267
Credit: 1,282,604,591
RAC: 6,640
United States
Message 1987162 - Posted: 26 Mar 2019, 0:56:46 UTC - in response to Message 1987152.  

I've been babysitting the farm all day. As long as a couple get uploaded, I can then do an update to report 100 tasks. Then I can get more work.

I was babysitting it all day too. But I can only report them at the rate that they upload. Which is slower than the rate at which they are computed. So the pending cache just grew and grew and never let up
ID: 1987162 · Report as offensive
Profile betreger Project Donor
Avatar

Send message
Joined: 29 Jun 99
Posts: 11451
Credit: 29,581,041
RAC: 66
United States
Message 1987163 - Posted: 26 Mar 2019, 1:43:36 UTC - in response to Message 1987162.  

Yep uploads remain borked.
ID: 1987163 · Report as offensive
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5126
Credit: 276,046,078
RAC: 462
Message 1987169 - Posted: 26 Mar 2019, 3:09:22 UTC - in response to Message 1987152.  
Last modified: 26 Mar 2019, 3:56:09 UTC

I've been babysitting the farm all day. As long as a couple get uploaded, I can then do an update to report 100 tasks. Then I can get more work.


Thank you for the hint. I tried a manual update and suddenly it started uploading......
--edit----------
Rats, it looks like it stalled again...
--edit---
Tried the 8 uploads trick. And that got something moving again. I even got some Seti downloads. Then it started timing out again so I dropped it back to 2. And it is still clumping along uploading.
--edit---
Apparently leaving it at 8 uploads/downloads keeps enough connections "active" that 1 or 2 get through every once in a while.

Tom
A proud member of the OFA (Old Farts Association).
ID: 1987169 · Report as offensive
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1987186 - Posted: 26 Mar 2019, 5:57:42 UTC

This is getting tiring ...
I up'd my transfer count to 16 which seems to have helped by having more connections open.
ID: 1987186 · Report as offensive
Cherokee150

Send message
Joined: 11 Nov 99
Posts: 192
Credit: 58,513,758
RAC: 74
United States
Message 1987188 - Posted: 26 Mar 2019, 6:10:12 UTC - in response to Message 1987169.  

Why stop at 8? I put mine at 14 simultaneously long ago, and it has helped me through many of these bottlenecks. It's not 100% successful, but enough get through that I can usually get enough downloads to keep reasonably busy. Also, if your machines process units super fast, and you are completely unable to get new work because of too many downloads, you might try setting "No new tasks" and nurse all the uploads through. When they are all uploaded, you can turn new work back on and, hopefully, fill your caches completely, or nearly completely, all at once, before too many new units complete and build up the backlog, resulting in the old, frustrating message, "Not requesting tasks: too many uploads in progress" bites your machines from behind, so to speak.

Just a thought, eh?
ID: 1987188 · Report as offensive
Cherokee150

Send message
Joined: 11 Nov 99
Posts: 192
Credit: 58,513,758
RAC: 74
United States
Message 1987189 - Posted: 26 Mar 2019, 6:34:08 UTC - in response to Message 1987186.  

Yes, Brent, you've got the right idea! (I was typing my response to Tom when you posted, so I didn't see it. ) I think I will boost mine from 14 1o 16, also. I made a bold move to 14 years ago, long before GPU processing took SETI to its "Next Generation". I guess it's time to add a couple more to mine, too. :-)
ID: 1987189 · Report as offensive
Previous · 1 . . . 23 · 24 · 25 · 26 · 27 · 28 · 29 . . . 30 · Next

Message boards : Number crunching : Panic Mode On (115) Server Problems?


 
©2025 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.