The Server Issues / Outages Thread - Panic Mode On! (119)

Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (119)
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 91 · 92 · 93 · 94 · 95 · 96 · 97 . . . 107 · Next

AuthorMessage
Ville Saari
Avatar

Send message
Joined: 30 Nov 00
Posts: 1158
Credit: 49,177,052
RAC: 82,530
Finland
Message 2046891 - Posted: 25 Apr 2020, 7:05:36 UTC

If you force the client to poll exactly at every cooldown period, you'll never get any work because you always hit the scheduler a little bit too soon. The cooldown is measured from the completion of one request to the start of the next one (as the server sees them), so you have to allow some time for the server to process the request.
ID: 2046891 · Report as offensive     Reply Quote
Profile Siran d'Vel'nahr
Volunteer tester
Avatar

Send message
Joined: 23 May 99
Posts: 7379
Credit: 44,181,323
RAC: 238
United States
Message 2046895 - Posted: 25 Apr 2020, 10:19:49 UTC - in response to Message 2046815.  

Perhaps even too many and that leads to the current situation...
The problem is that Boinc prefers handing any new tasks to those hosts that already have work.

Hosts that have work do scheduler requests as often as the cooldown allows. Empty ones end up having several hour backoffs between requests. So the empty hosts that could return the results almost immediately never get them.

So why not set NNT and allow the rest of us the chance to get a few?
CAPT Siran d'Vel'nahr - L L & P _\\//
Winders 11 OS? "What a piece of junk!" - L. Skywalker
"Logic is the cement of our civilization with which we ascend from chaos using reason as our guide." - T'Plana-hath
ID: 2046895 · Report as offensive     Reply Quote
Profile Bernie Vine
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 26 May 99
Posts: 9954
Credit: 103,452,613
RAC: 328
United Kingdom
Message 2046904 - Posted: 25 Apr 2020, 12:00:43 UTC - in response to Message 2046895.  
Last modified: 25 Apr 2020, 12:01:54 UTC

So why not set NNT and allow the rest of us the chance to get a few?



I suppose everyone want to "crunch the last WU"

Does not really bother me, I leave my 2 Windows machines on and get what I get. As I use them both during the day and am a believer in not powering them on and off they will be on anyway.
ID: 2046904 · Report as offensive     Reply Quote
Profile Keith T.
Volunteer tester
Avatar

Send message
Joined: 23 Aug 99
Posts: 962
Credit: 537,293
RAC: 9
United Kingdom
Message 2046906 - Posted: 25 Apr 2020, 12:31:24 UTC - in response to Message 2046904.  
Last modified: 25 Apr 2020, 12:47:09 UTC

So why not set NNT and allow the rest of us the chance to get a few?



I suppose everyone want to "crunch the last WU"


Well 2 out of my last 5 Tasks were USEFUL, because they helped to complete Workunits that still needed Validation

What is the point of crunching work on a super fast GPU that has already been VALIDATED days ago by 2 or 3 slower hosts ?

I would estimate that around 90% of Mr 1337's tasks In Progress are already validated !

My final Pending https://setiathome.berkeley.edu/workunit.php?wuid=3953270075 was one of 7 that had him as the original wingman. The other 6 are already validated days ago, but the 1337 guy still hasn't got around to them yet !
ID: 2046906 · Report as offensive     Reply Quote
Profile Siran d'Vel'nahr
Volunteer tester
Avatar

Send message
Joined: 23 May 99
Posts: 7379
Credit: 44,181,323
RAC: 238
United States
Message 2046907 - Posted: 25 Apr 2020, 12:47:36 UTC - in response to Message 2046906.  

So why not set NNT and allow the rest of us the chance to get a few?



I suppose everyone want to "crunch the last WU"


Well 2 out of my last 5 Tasks were USEFUL, because they helped to complete Workunits that still needed Validation

What is the point of crunching work on a super fast GPU that has already been VALIDATED days ago by 2 or 3 slower hosts ?

I would estimate that around 90% of Mr 1337's tasks are already validated !

My final Pending https://setiathome.berkeley.edu/workunit.php?wuid=3953270075 was one of 7 that had him as the original wingman. The other 6 are already validated days ago, but the 1337 guy still hasn't got around to them yet !

Hi Keith,

I looked at the hosts of several of the top 10 heavy hitters and very few of them had very few tasks on their hosts. None of them had what Mr. 1337 has. Some people just think they are better and more deserving than all others and have no compassion at all. What can you say? Par for the course.

Have a great day! :)

Siran
CAPT Siran d'Vel'nahr - L L & P _\\//
Winders 11 OS? "What a piece of junk!" - L. Skywalker
"Logic is the cement of our civilization with which we ascend from chaos using reason as our guide." - T'Plana-hath
ID: 2046907 · Report as offensive     Reply Quote
Ville Saari
Avatar

Send message
Joined: 30 Nov 00
Posts: 1158
Credit: 49,177,052
RAC: 82,530
Finland
Message 2046908 - Posted: 25 Apr 2020, 12:49:11 UTC - in response to Message 2046906.  

What is the point of crunching work on a super fast GPU that has already been VALIDATED days ago by 2 or 3 slower hosts ?
The host returns them when it returns them. Do you really expect me to click through all the tasks on the web site one by one to figure out what task have a filled quorum and what tasks do not?
ID: 2046908 · Report as offensive     Reply Quote
Ville Saari
Avatar

Send message
Joined: 30 Nov 00
Posts: 1158
Credit: 49,177,052
RAC: 82,530
Finland
Message 2046910 - Posted: 25 Apr 2020, 12:59:39 UTC - in response to Message 2046907.  

I looked at the hosts of several of the top 10 heavy hitters and very few of them had very few tasks on their hosts. None of them had what Mr. 1337 has. Some people just think they are better and more deserving than all others and have no compassion at all. What can you say? Par for the course.
Lol I guess was then good for your blood pressure that you did not check their hosts couple of weeks ago ;)
ID: 2046910 · Report as offensive     Reply Quote
Profile Keith T.
Volunteer tester
Avatar

Send message
Joined: 23 Aug 99
Posts: 962
Credit: 537,293
RAC: 9
United Kingdom
Message 2046911 - Posted: 25 Apr 2020, 13:04:07 UTC - in response to Message 2046908.  

What is the point of crunching work on a super fast GPU that has already been VALIDATED days ago by 2 or 3 slower hosts ?
The host returns them when it returns them. Do you really expect me to click through all the tasks on the web site one by one to figure out what task have a filled quorum and what tasks do not?


If this project was able to cancel Unnecessary, Unstarted Tasks then you wouldn't need to.
I can see that it is impossible to do that manually.

I was trying to be considerate of people's electrical bills, when I suggested that a few days ago.
Then the NEEDED results would be returned more quickly.
ID: 2046911 · Report as offensive     Reply Quote
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 2046916 - Posted: 25 Apr 2020, 13:57:51 UTC - in response to Message 2046911.  
Last modified: 25 Apr 2020, 14:54:53 UTC

Then the NEEDED results would be returned more quickly.

If that is what they rely need is simple. Send the task for the high reliable hosts (they have the list of them) and all work will be returned ASAP. But instead of that they decided to send several resends of the same WU for a random number of hosts.
If you look just the top 20 reliable hosts has the capacity of crunching around 100 K WU per day.
And Before the thread could be ignited. I'm not telling anything elitists, just saying if they want the WU returned fast they need to use this fasts & reliable hosts. Only 1 of the ian hosts could crunch >10 K per day.
I was talking just about crunching speed and a way to garante the last few WU to be returned fast so they could close the science DB for good, nothing else, no rants please.
Anyway the total number of not rely validated WU (those who are rely waiting for another wingmen) at this time must be very low maybe less than 300K.
ID: 2046916 · Report as offensive     Reply Quote
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 2046918 - Posted: 25 Apr 2020, 14:03:28 UTC - in response to Message 2046910.  
Last modified: 25 Apr 2020, 14:53:07 UTC

Lol I guess was then good for your blood pressure that you did not check their hosts couple of weeks ago ;)

I agree, sure not good for the blood pressure do that, only my host has > 130K WU 3 weeks ago (all served & returned BTW) and there where others who has a lot more. Not forgetting the anonymous who still has >40K Wu today! That sure could be a cause for a heart attack! LOL
ID: 2046918 · Report as offensive     Reply Quote
Profile Keith T.
Volunteer tester
Avatar

Send message
Joined: 23 Aug 99
Posts: 962
Credit: 537,293
RAC: 9
United Kingdom
Message 2046965 - Posted: 25 Apr 2020, 19:58:34 UTC - in response to Message 2046730.  

At this rate soon they will pass the Results out in the field too. LOL


I think that will probably be at least 24 hours at the current rate.

Average turnaround times are now more than 7 days for both V8 and AstroPulse, they were below 7 days for the last few days.


"Results waiting for db purging" is now ahead of both "Results out in the field" and "Results returned and awaiting validation".

There are now probably between 2 and 3 In Progress tasks for each Workunit, with between 1 and 2 Tasks awaiting validation for each Workunit.
ID: 2046965 · Report as offensive     Reply Quote
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14653
Credit: 200,643,578
RAC: 874
United Kingdom
Message 2046966 - Posted: 25 Apr 2020, 20:06:49 UTC

I've got several like this one - looks like it could be the middle of May before that one's cooked.
ID: 2046966 · Report as offensive     Reply Quote
Profile Keith T.
Volunteer tester
Avatar

Send message
Joined: 23 Aug 99
Posts: 962
Credit: 537,293
RAC: 9
United Kingdom
Message 2046970 - Posted: 25 Apr 2020, 20:22:32 UTC - in response to Message 2046966.  

I've got several like this one - looks like it could be the middle of May before that one's cooked.


Both the_0 and_3 hosts have contacted the servers today, although it looks like the 0 had a batch of ghosts.
I wouldn't hold out much hope for the _1
ID: 2046970 · Report as offensive     Reply Quote
Profile ML1
Volunteer moderator
Volunteer tester

Send message
Joined: 25 Nov 01
Posts: 20323
Credit: 7,508,002
RAC: 20
United Kingdom
Message 2046976 - Posted: 25 Apr 2020, 20:37:00 UTC - in response to Message 2046916.  
Last modified: 25 Apr 2020, 20:38:11 UTC

Then the NEEDED results would be returned more quickly.

If that is what they rely need is simple. Send the task for the high reliable hosts (they have the list of them) and all work will be returned ASAP. But instead of that they decided to send several resends of the same WU for a random number of hosts.
If you look just the top 20 reliable hosts has the capacity of crunching around 100 K WU per day...

Good comment.

Note that an important part of the experiment that is 's@h and boinc' is to test the feasibility of using "untrusted" computers in a distributed computer system. Part of that experiment requires distributing the tasks randomly between all the volunteer clients.

Ok, so that random distribution is skewed somewhat by the number of WUs that the fast systems have gobbled through...

Another aspect is that we have been processing WUs for over two decades. The research work will continue for some years yet. Hence there will be no need for any rush to finish the last few WUs.


All very good science.

And some great participant fun along the way!

Enjoy!!

Keep searchin',
Martin
See new freedom: Mageia Linux
Take a look for yourself: Linux Format
The Future is what We all make IT (GPLv3)
ID: 2046976 · Report as offensive     Reply Quote
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 2046983 - Posted: 25 Apr 2020, 20:51:31 UTC - in response to Message 2046976.  

Hence there will be no need for any rush to finish the last few WUs.

If was in the past i say you are right, but that is not what as posted by Eric on the FB.
Unless i understand something wrong they ask to return the remaining WU ASAP to close the Science DB.
That is one of the reasons why they are generating extra wingmen for all the reaming WU's.
ID: 2046983 · Report as offensive     Reply Quote
Profile Keith T.
Volunteer tester
Avatar

Send message
Joined: 23 Aug 99
Posts: 962
Credit: 537,293
RAC: 9
United Kingdom
Message 2046995 - Posted: 25 Apr 2020, 21:22:29 UTC
Last modified: 25 Apr 2020, 21:29:23 UTC

ID: 2046995 · Report as offensive     Reply Quote
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 2047009 - Posted: 25 Apr 2020, 23:09:48 UTC - in response to Message 2046995.  

https://www.facebook.com/groups/317689381579505/permalink/3463142420367503/

Eric's reply is 5th or 6th down.


. . That link only shows the original post by Mr Cook. No comments are viewable including Eric's :(

Stephen

:(
ID: 2047009 · Report as offensive     Reply Quote
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 2047010 - Posted: 25 Apr 2020, 23:10:59 UTC - in response to Message 2047009.  
Last modified: 25 Apr 2020, 23:26:38 UTC

https://www.facebook.com/groups/317689381579505/permalink/3463142420367503/

Eric's reply is 5th or 6th down.


. . That link only shows the original post by Mr Cook. No comments are viewable including Eric's :(

Stephen

:(

This is his post:

Eric Korpela We're resending a lot these days to try to get the outstanding ones validated and into the science database so we have a "final complete set".

You need to click where you see 9 Commentaries to see the rest.

<edit> @Stephen
Finally we have the same number of WU In progress.... zero! (0). LOL
ID: 2047010 · Report as offensive     Reply Quote
Ian&Steve C.
Avatar

Send message
Joined: 28 Sep 99
Posts: 4267
Credit: 1,282,604,591
RAC: 6,640
United States
Message 2047012 - Posted: 25 Apr 2020, 23:14:30 UTC - in response to Message 2047010.  

It might not load comments if you’re not a Facebook user. Just a guess.
Seti@Home classic workunits: 29,492 CPU time: 134,419 hours

ID: 2047012 · Report as offensive     Reply Quote
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 2047013 - Posted: 25 Apr 2020, 23:15:53 UTC - in response to Message 2047012.  
Last modified: 25 Apr 2020, 23:17:13 UTC

It might not load comments if you’re not a Facebook user. Just a guess.

Strange. Why i could see here? I'm not a FB user. Not even have one FB account AFAIK. LOL
ID: 2047013 · Report as offensive     Reply Quote
Previous · 1 . . . 91 · 92 · 93 · 94 · 95 · 96 · 97 . . . 107 · Next

Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (119)


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.