The Server Issues / Outages Thread - Panic Mode On! (119)

Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (119)
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 90 · 91 · 92 · 93 · 94 · 95 · 96 . . . 107 · Next

AuthorMessage
Profile tullio
Volunteer tester

Send message
Joined: 9 Apr 04
Posts: 8797
Credit: 2,930,782
RAC: 1
Italy
Message 2046697 - Posted: 24 Apr 2020, 6:37:56 UTC

Science United reports for every project the CPU usage and the GPU usage, and the relative weight. Mostly CPU usage is greater tha GPU usage. There is only a project, GPUGRID where the reverse is true.
Tullio
ID: 2046697 · Report as offensive     Reply Quote
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14653
Credit: 200,643,578
RAC: 874
United Kingdom
Message 2046702 - Posted: 24 Apr 2020, 7:02:42 UTC - in response to Message 2046697.  

Science United will be reporting the BOINC figures - it has no independent source of data. The BOINC estimates of how much CPU support will be needed for a GPU application are notoriously unreliable: they take no account of the programming language used (which makes a huge difference), and they are not adjusted in the light of observed reality.
ID: 2046702 · Report as offensive     Reply Quote
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5124
Credit: 276,046,078
RAC: 462
Message 2046712 - Posted: 24 Apr 2020, 12:21:31 UTC - in response to Message 2046630.  

found a Science United Account with 2.8K hosts and a lot of tasks in progress ...
Science United is a collection of thousands of different people that use that platform to crunch BOINC projects instead of manually attaching to projects. no one person is managing all of the computers on that account.
https://scienceunited.org/
It's a dumbed down way to run boinc for the social media generation. One step towards the society depicted in the movie 'Idiocracy'.


. . I recommend a Sci-Fi short story called 'The Marching Morons' by C.M.Kornbluth (also nicely subtle in the title 8-})

Stephen

:)


+1 I have always liked that story :)

Tom M
A proud member of the OFA (Old Farts Association).
ID: 2046712 · Report as offensive     Reply Quote
Profile Keith T.
Volunteer tester
Avatar

Send message
Joined: 23 Aug 99
Posts: 962
Credit: 537,293
RAC: 9
United Kingdom
Message 2046717 - Posted: 24 Apr 2020, 12:45:20 UTC

"Results waiting for db purging" may exceed " Results returned and awaiting validation" in the next few hours !
ID: 2046717 · Report as offensive     Reply Quote
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 2046719 - Posted: 24 Apr 2020, 12:49:53 UTC - in response to Message 2046717.  

At this rate soon they will pass the Results out in the field too. LOL
ID: 2046719 · Report as offensive     Reply Quote
Profile Keith T.
Volunteer tester
Avatar

Send message
Joined: 23 Aug 99
Posts: 962
Credit: 537,293
RAC: 9
United Kingdom
Message 2046730 - Posted: 24 Apr 2020, 13:19:36 UTC - in response to Message 2046719.  

At this rate soon they will pass the Results out in the field too. LOL


I think that will probably be at least 24 hours at the current rate.

Average turnaround times are now more than 7 days for both V8 and AstroPulse, they were below 7 days for the last few days.
ID: 2046730 · Report as offensive     Reply Quote
Profile Unixchick Project Donor
Avatar

Send message
Joined: 5 Mar 12
Posts: 815
Credit: 2,361,516
RAC: 22
United States
Message 2046749 - Posted: 24 Apr 2020, 14:45:00 UTC

Will they send out one last wave of forced resends? I still have Validation inconclusive WUs that need a third wingman
ID: 2046749 · Report as offensive     Reply Quote
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 2046755 - Posted: 24 Apr 2020, 14:59:24 UTC - in response to Message 2046730.  

At this rate soon they will pass the Results out in the field too. LOL


I think that will probably be at least 24 hours at the current rate.

Average turnaround times are now more than 7 days for both V8 and AstroPulse, they were below 7 days for the last few days.


. . That was only because the resends were mostly being processed pretty much immediately.

Stephen

. . .
ID: 2046755 · Report as offensive     Reply Quote
Profile Keith T.
Volunteer tester
Avatar

Send message
Joined: 23 Aug 99
Posts: 962
Credit: 537,293
RAC: 9
United Kingdom
Message 2046758 - Posted: 24 Apr 2020, 15:33:14 UTC - in response to Message 2046749.  

Will they send out one last wave of forced resends? I still have Validation inconclusive WUs that need a third wingman


I think at this stage, it might be more effective to Cancel unstarted Tasks for Workunits that are Already Validated.
There are probably now more than 100,000 Tasks still In Progress that are not needed.

This might help more quickly than just sending out more
The ones In the field will probably be returned eventually, but they might be waiting for their Hosts to process unnecessary duplicates first !
ID: 2046758 · Report as offensive     Reply Quote
Profile Unixchick Project Donor
Avatar

Send message
Joined: 5 Mar 12
Posts: 815
Credit: 2,361,516
RAC: 22
United States
Message 2046760 - Posted: 24 Apr 2020, 15:48:12 UTC - in response to Message 2046758.  
Last modified: 24 Apr 2020, 15:49:29 UTC

Will they send out one last wave of forced resends? I still have Validation inconclusive WUs that need a third wingman


I think at this stage, it might be more effective to Cancel unstarted Tasks for Workunits that are Already Validated.
There are probably now more than 100,000 Tasks still In Progress that are not needed.

This might help more quickly than just sending out more
The ones In the field will probably be returned eventually, but they might be waiting for their Hosts to process unnecessary duplicates first !


That is an interesting idea. I manually went through my list of WUs and prioritized the ones that didn't already have a matching valid, but it was effort on my part. Most people don't do anything but let boinc run. I'm now just crunching WUs that already have matching valids.

edit: secondary projects with shorter deadlines might also be delaying seti WUs
ID: 2046760 · Report as offensive     Reply Quote
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 2046799 - Posted: 24 Apr 2020, 19:59:30 UTC - in response to Message 2046760.  

That is an interesting idea. I manually went through my list of WUs and prioritized the ones that didn't already have a matching valid, but it was effort on my part. Most people don't do anything but let boinc run. I'm now just crunching WUs that already have matching valids.
edit: secondary projects with shorter deadlines might also be delaying seti WUs

. . That is why I have my secondary project set to resource 0%. It mean there is very little waiting to run S@H work if any turns up. But since there is none that doesn't make much difference ...

Stephen

. .
ID: 2046799 · Report as offensive     Reply Quote
Profile Jimbocous Project Donor
Volunteer tester
Avatar

Send message
Joined: 1 Apr 13
Posts: 1853
Credit: 268,616,081
RAC: 1,349
United States
Message 2046807 - Posted: 24 Apr 2020, 20:57:01 UTC - in response to Message 2046712.  
Last modified: 24 Apr 2020, 20:57:29 UTC

. . I recommend a Sci-Fi short story called 'The Marching Morons' by C.M.Kornbluth (also nicely subtle in the title 8-})

Stephen

:)


+1 I have always liked that story :)

Tom M

+1
Nice to see it's online now for free reading.
ID: 2046807 · Report as offensive     Reply Quote
Profile Link
Avatar

Send message
Joined: 18 Sep 03
Posts: 834
Credit: 1,807,369
RAC: 0
Germany
Message 2046808 - Posted: 24 Apr 2020, 20:59:14 UTC - in response to Message 2046760.  
Last modified: 24 Apr 2020, 21:10:27 UTC

I'm now just crunching WUs that already have matching valids.

I'd simply abort them, on point wasting time on them.

I'd actually expect aborting such tasks by the server as part of "nice" shutdown, wasting other people's electricity and eventually slowing down science done by other BOINC projects by generating data for dev/nul isn't what I imagine as nice way to finish the project.

But somehow I've expected it, so I didn't even try to cache many WUs, just finnished what I had and moved to Einstein, there are more than enough machines who can crunch ready what's left. Perhaps even too many and that leads to the current situation...
ID: 2046808 · Report as offensive     Reply Quote
Ville Saari
Avatar

Send message
Joined: 30 Nov 00
Posts: 1158
Credit: 49,177,052
RAC: 82,530
Finland
Message 2046815 - Posted: 24 Apr 2020, 21:42:13 UTC - in response to Message 2046808.  

Perhaps even too many and that leads to the current situation...
The problem is that Boinc prefers handing any new tasks to those hosts that already have work.

Hosts that have work do scheduler requests as often as the cooldown allows. Empty ones end up having several hour backoffs between requests. So the empty hosts that could return the results almost immediately never get them.
ID: 2046815 · Report as offensive     Reply Quote
Wild6-NJ
Volunteer tester

Send message
Joined: 4 Aug 99
Posts: 43
Credit: 100,336,791
RAC: 140
Message 2046844 - Posted: 24 Apr 2020, 23:27:49 UTC - in response to Message 2046553.  

It's a dumbed down way to run boinc for the social media generation. One step towards the society depicted in the movie 'Idiocracy'.


As time passes, Idiocracy is looking more like a documentary. [:o(
ID: 2046844 · Report as offensive     Reply Quote
Profile tullio
Volunteer tester

Send message
Joined: 9 Apr 04
Posts: 8797
Credit: 2,930,782
RAC: 1
Italy
Message 2046847 - Posted: 25 Apr 2020, 0:00:26 UTC

David Anderson has updated the list of the 100 top users. Most of them run Collatz, so the list does note reflect the real position of users, like me, who do not Collatz- But I am only a dumb user, those running Collatz are geniuses.
Tullio
ID: 2046847 · Report as offensive     Reply Quote
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 2046856 - Posted: 25 Apr 2020, 1:00:58 UTC - in response to Message 2046815.  

The problem is that Boinc prefers handing any new tasks to those hosts that already have work.
Hosts that have work do scheduler requests as often as the cooldown allows. Empty ones end up having several hour backoffs between requests. So the empty hosts that could return the results almost immediately never get them.

. . It isn't just that, I am running a batch file to hit the server at the minimum backoff period every time so NO prolonged backoffs, but I get ZERO resends. It is very much a lottery, a matter of hitting the server at just the right moment within windows that are/were very short. Some get lucky (Wiggo) and some don't (me). BTW, I bought that scratchie .. as expected nothing :( Your turn when you 'get into town' :)

Stephen

:)
ID: 2046856 · Report as offensive     Reply Quote
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34872
Credit: 261,360,520
RAC: 489
Australia
Message 2046864 - Posted: 25 Apr 2020, 1:48:34 UTC

. . It isn't just that, I am running a batch file to hit the server at the minimum backoff period every time so NO prolonged backoffs, but I get ZERO resends.
Now that may have been your problem Stephen as I was just letting it do its own thing. ;-)

Cheers.
ID: 2046864 · Report as offensive     Reply Quote
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13746
Credit: 208,696,464
RAC: 304
Australia
Message 2046876 - Posted: 25 Apr 2020, 3:12:13 UTC - in response to Message 2046625.  
Last modified: 25 Apr 2020, 3:13:14 UTC

April 22 0:40 UTC
I've still got 259 Pendings and 342 Inconclusives
April 22 23:20
My Pendings are down to 145 and Inconclusives 237.
April 23 22:21
65 Pendings, 145 Inconclusives.
25 Pendings, 101 Inconclusives.
Grant
Darwin NT
ID: 2046876 · Report as offensive     Reply Quote
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 2046884 - Posted: 25 Apr 2020, 6:06:29 UTC - in response to Message 2046864.  

. . It isn't just that, I am running a batch file to hit the server at the minimum backoff period every time so NO prolonged backoffs, but I get ZERO resends.
Now that may have been your problem Stephen as I was just letting it do its own thing. ;-)

Cheers.


. . But then as Ville says (or was it Siran), it simply stops polling for work when you have none to return. I was letting it do that but it was only polling a few times a day, it was ridiculous.

. . Either way ... no work :(

Stephen

:(
ID: 2046884 · Report as offensive     Reply Quote
Previous · 1 . . . 90 · 91 · 92 · 93 · 94 · 95 · 96 . . . 107 · Next

Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (119)


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.