The Server Issues / Outages Thread - Panic Mode On! (119)

Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (119)
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 49 · 50 · 51 · 52 · 53 · 54 · 55 . . . 107 · Next

AuthorMessage
Ville Saari
Avatar

Send message
Joined: 30 Nov 00
Posts: 1158
Credit: 49,177,052
RAC: 82,530
Finland
Message 2041540 - Posted: 30 Mar 2020, 7:47:32 UTC
Last modified: 30 Mar 2020, 7:47:43 UTC

My slower machine has recovered full cache but the faster one is just barely not depleting its cache.

It's receiving new tasks at a steady pace but all tasks split from 31mr11ai are shorties or noise bombs, so it is also returning results at way higher than normal rate.
ID: 2041540 · Report as offensive     Reply Quote
Cherokee150

Send message
Joined: 11 Nov 99
Posts: 192
Credit: 58,513,758
RAC: 74
United States
Message 2041545 - Posted: 30 Mar 2020, 8:07:23 UTC

Shortly after 07:00 GMT (midnight, Berkeley time) my computer 6156281 started getting the following on requests for work:

3/30/2020 2:36:18 | SETI@home | Sending scheduler request: To fetch work.
3/30/2020 2:36:18 | SETI@home | Requesting new tasks for CPU and NVIDIA GPU
3/30/2020 2:36:19 | SETI@home | Scheduler request completed: got 0 new tasks
3/30/2020 2:36:19 | SETI@home | No tasks sent
3/30/2020 2:36:19 | SETI@home | No tasks are available for AstroPulse v7
3/30/2020 2:36:19 | SETI@home | No tasks are available for SETI@home v8
3/30/2020 2:36:19 | SETI@home | This computer has finished a daily quota of 3 tasks
3/30/2020 2:36:19 | SETI@home | This computer has reached a limit on tasks in progress


Normally my limit is very high, and I can usually cache 150 each of CPU and GPU units.
Does anyone know what is causing this, and how I can fix it?
Thanks!
ID: 2041545 · Report as offensive     Reply Quote
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 2041549 - Posted: 30 Mar 2020, 8:29:53 UTC - in response to Message 2041545.  

It probably means that a number (a large number) of your tasks have failed recently. Normally, we'd say 'look at your account on this website', and the task lists for that computer. But that's going to be difficult at the moment. Have a look at that machine, see what it's processing at the moment, and have a search through the Event Log.
ID: 2041549 · Report as offensive     Reply Quote
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 2041550 - Posted: 30 Mar 2020, 8:31:39 UTC - in response to Message 2041545.  

Shortly after 07:00 GMT (midnight, Berkeley time) my computer 6156281 started getting the following on requests for work:
3/30/2020 2:36:19 | SETI@home | This computer has finished a daily quota of 3 tasks
3/30/2020 2:36:19 | SETI@home | This computer has reached a limit on tasks in progress
Normally my limit is very high, and I can usually cache 150 each of CPU and GPU units.
Does anyone know what is causing this, and how I can fix it?
Thanks!


. . There has been a glitch and the schedulers have defaulted your rig. Just keep processing the work it gives you and returning valid results and it will gradually restore your previous limits. But it will probably take a few hours.

Stephen
ID: 2041550 · Report as offensive     Reply Quote
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34744
Credit: 261,360,520
RAC: 489
Australia
Message 2041551 - Posted: 30 Mar 2020, 8:33:11 UTC - in response to Message 2041545.  

Shortly after 07:00 GMT (midnight, Berkeley time) my computer 6156281 started getting the following on requests for work:

3/30/2020 2:36:18 | SETI@home | Sending scheduler request: To fetch work.
3/30/2020 2:36:18 | SETI@home | Requesting new tasks for CPU and NVIDIA GPU
3/30/2020 2:36:19 | SETI@home | Scheduler request completed: got 0 new tasks
3/30/2020 2:36:19 | SETI@home | No tasks sent
3/30/2020 2:36:19 | SETI@home | No tasks are available for AstroPulse v7
3/30/2020 2:36:19 | SETI@home | No tasks are available for SETI@home v8
3/30/2020 2:36:19 | SETI@home | This computer has finished a daily quota of 3 tasks
3/30/2020 2:36:19 | SETI@home | This computer has reached a limit on tasks in progress


Normally my limit is very high, and I can usually cache 150 each of CPU and GPU units.
Does anyone know what is causing this, and how I can fix it?
Thanks!
It's a bit hard to say for sure with the replica being 4.6 days behind, but it very likely started to dump errored work.

A restart will likely clear it unless you just recently upgraded your GPU drivers and if you did then roll them back to a working driver.

Cheers.
ID: 2041551 · Report as offensive     Reply Quote
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13731
Credit: 208,696,464
RAC: 304
Australia
Message 2041552 - Posted: 30 Mar 2020, 8:36:31 UTC - in response to Message 2041545.  
Last modified: 30 Mar 2020, 8:56:58 UTC

Does anyone know what is causing this, and how I can fix it?
Thanks!
Not really, as the Replica is several days behind so we can't see the results you are producing.
But looking at the application details for that system,
Max tasks per day	3
Number of tasks today	173
Consecutive valid tasks	0

For some reason, it's been producing errors, and that's why it's now limited to the amount of work it can get. As work Validates, you'll be able to get more, but as further errors occur, that will be cut again.
And with the Replica so far behind, and noting showing presently, there's no way to see that has been going on.

You could check to make sure the CPU fan is running, the heatsink isn't full of dust, reboot the system
Then check in the BOINC Manager any CPU WUs that do download & run & keep an eye on their progress.


EDIT- It looks like you've been rescheduling? If so, you've probably been doing it so much it's resulted in CPU WU time estimates being all screwed up not having enough time to be processed & erroring out.

Edit- yep every CPU WU i checked on in what the Replica is showing has been rescheduled to the GPU.
You broke it.
Grant
Darwin NT
ID: 2041552 · Report as offensive     Reply Quote
Harri Liljeroos
Avatar

Send message
Joined: 29 May 99
Posts: 4070
Credit: 85,281,665
RAC: 126
Finland
Message 2041562 - Posted: 30 Mar 2020, 9:37:32 UTC

Has the allowed number of tasks per resource been reduced back to the 100 / CPU and GPU? I have two hosts with 1 CPU + 2 GPUs and both have now 300 tasks each.
ID: 2041562 · Report as offensive     Reply Quote
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 2041564 - Posted: 30 Mar 2020, 10:00:44 UTC - in response to Message 2041562.  

Has the allowed number of tasks per resource been reduced back to the 100 / CPU and GPU? I have two hosts with 1 CPU + 2 GPUs and both have now 300 tasks each.
Don't think so. One of my machines with 2 x GPU and CPU work disallowed (the CPUs do other things) has 268 tasks at the moment. Hasn't got any new work recently, but doesn't get the message about 'reached a limit of tasks in progress'.
ID: 2041564 · Report as offensive     Reply Quote
AllgoodGuy

Send message
Joined: 29 May 01
Posts: 293
Credit: 16,348,499
RAC: 266
United States
Message 2041579 - Posted: 30 Mar 2020, 11:30:24 UTC - in response to Message 2041564.  
Last modified: 30 Mar 2020, 11:38:01 UTC

Has the allowed number of tasks per resource been reduced back to the 100 / CPU and GPU? I have two hosts with 1 CPU + 2 GPUs and both have now 300 tasks each.
Don't think so. One of my machines with 2 x GPU and CPU work disallowed (the CPUs do other things) has 268 tasks at the moment. Hasn't got any new work recently, but doesn't get the message about 'reached a limit of tasks in progress'.

I'm not seeing anything like that across my three machines either. numbers just don't add up at all.
ID: 2041579 · Report as offensive     Reply Quote
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 2041597 - Posted: 30 Mar 2020, 13:00:13 UTC - in response to Message 2041579.  

I've now got two machines at precisely the 300 limit for two GPUs.
ID: 2041597 · Report as offensive     Reply Quote
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 2041599 - Posted: 30 Mar 2020, 13:15:00 UTC - in response to Message 2041562.  

Has the allowed number of tasks per resource been reduced back to the 100 / CPU and GPU? I have two hosts with 1 CPU + 2 GPUs and both have now 300 tasks each.


. . Same as Richard, I am still getting the 150 per device. When I can get them. I am still getting frequent no tasks messages so the caches are somewhat less than full, but over the 100 mark.

Stephen

:(
ID: 2041599 · Report as offensive     Reply Quote
AllgoodGuy

Send message
Joined: 29 May 01
Posts: 293
Credit: 16,348,499
RAC: 266
United States
Message 2041609 - Posted: 30 Mar 2020, 13:56:40 UTC - in response to Message 2041599.  

Has the allowed number of tasks per resource been reduced back to the 100 / CPU and GPU? I have two hosts with 1 CPU + 2 GPUs and both have now 300 tasks each.


. . Same as Richard, I am still getting the 150 per device. When I can get them. I am still getting frequent no tasks messages so the caches are somewhat less than full, but over the 100 mark.

Stephen

:(

I have my CPU machines eat 170 and 166. My other at 239 with two eGPUs.
ID: 2041609 · Report as offensive     Reply Quote
Harri Liljeroos
Avatar

Send message
Joined: 29 May 99
Posts: 4070
Credit: 85,281,665
RAC: 126
Finland
Message 2041610 - Posted: 30 Mar 2020, 14:05:22 UTC - in response to Message 2041564.  

OK, must have been just a coincidence. Now both my machines are under the 300.
ID: 2041610 · Report as offensive     Reply Quote
Ville Saari
Avatar

Send message
Joined: 30 Nov 00
Posts: 1158
Credit: 49,177,052
RAC: 82,530
Finland
Message 2041624 - Posted: 30 Mar 2020, 15:24:27 UTC

Assimilators have made some progress. Not enough to make any meaningful difference but the queue is now below 7 million wus.
ID: 2041624 · Report as offensive     Reply Quote
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 2041626 - Posted: 30 Mar 2020, 15:27:43 UTC - in response to Message 2041624.  
Last modified: 30 Mar 2020, 15:29:59 UTC

Assimilators have made some progress. Not enough to make any meaningful difference but the queue is now below 7 million wus.

Few hosts will continue to work after March 31 but at the end even them will deplete their lasts WUs.
And then S@h will end in a cold & darkness eternity as expected.
ID: 2041626 · Report as offensive     Reply Quote
AllgoodGuy

Send message
Joined: 29 May 01
Posts: 293
Credit: 16,348,499
RAC: 266
United States
Message 2041627 - Posted: 30 Mar 2020, 15:33:00 UTC - in response to Message 2041626.  

Assimilators have made some progress. Not enough to make any meaningful difference but the queue is now below 7 million wus.

Few hosts will continue to work after March 31 but at the end even them will deplete their lasts WUs.
And then S@h will end in a cold & darkness eternity as expected.

I'll take SOME progress over none.
30-Mar-2020 08:01:39 [SETI@home] Scheduler request completed: got 0 new tasks
30-Mar-2020 08:06:48 [SETI@home] Scheduler request completed: got 1 new tasks
30-Mar-2020 08:11:59 [SETI@home] Scheduler request completed: got 0 new tasks
30-Mar-2020 08:25:35 [SETI@home] Scheduler request completed: got 47 new tasks
30-Mar-2020 08:30:40 [SETI@home] Scheduler request completed: got 18 new tasks
ID: 2041627 · Report as offensive     Reply Quote
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 2041628 - Posted: 30 Mar 2020, 15:35:10 UTC - in response to Message 2041627.  
Last modified: 30 Mar 2020, 15:36:04 UTC

Assimilators have made some progress. Not enough to make any meaningful difference but the queue is now below 7 million wus.

Few hosts will continue to work after March 31 but at the end even them will deplete their lasts WUs.
And then S@h will end in a cold & darkness eternity as expected.

I'll take SOME progress over none.
30-Mar-2020 08:01:39 [SETI@home] Scheduler request completed: got 0 new tasks
30-Mar-2020 08:06:48 [SETI@home] Scheduler request completed: got 1 new tasks
30-Mar-2020 08:11:59 [SETI@home] Scheduler request completed: got 0 new tasks
30-Mar-2020 08:25:35 [SETI@home] Scheduler request completed: got 47 new tasks
30-Mar-2020 08:30:40 [SETI@home] Scheduler request completed: got 18 new tasks

You are lucky, seems like my host is on a dark cloud of black matter heavily charged with black energy (did that exist?), nothing new here as usual.
ID: 2041628 · Report as offensive     Reply Quote
Ville Saari
Avatar

Send message
Joined: 30 Nov 00
Posts: 1158
Credit: 49,177,052
RAC: 82,530
Finland
Message 2041634 - Posted: 30 Mar 2020, 15:51:00 UTC

My slower computer is getting nice amount of stuff every now and then, The faster cruncher gets almost nothing. I'm starting to wonder if I should use the slower computer to 'farm' tasks and then transfer them over to the faster computer...
ID: 2041634 · Report as offensive     Reply Quote
AllgoodGuy

Send message
Joined: 29 May 01
Posts: 293
Credit: 16,348,499
RAC: 266
United States
Message 2041643 - Posted: 30 Mar 2020, 16:06:46 UTC - in response to Message 2041634.  
Last modified: 30 Mar 2020, 16:07:27 UTC

My slower computer is getting nice amount of stuff every now and then, The faster cruncher gets almost nothing. I'm starting to wonder if I should use the slower computer to 'farm' tasks and then transfer them over to the faster computer...

I used to do that with classic SETI, but I've been told it doesn't work with this one.
ID: 2041643 · Report as offensive     Reply Quote
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 2041720 - Posted: 30 Mar 2020, 20:21:11 UTC
Last modified: 30 Mar 2020, 20:40:02 UTC

Seems like something else breaks look this WU just received: https://setiathome.berkeley.edu/workunit.php?wuid=3829209762

Initial replication of 8?

There are a lot of them. What is to do? Suggestions?
ID: 2041720 · Report as offensive     Reply Quote
Previous · 1 . . . 49 · 50 · 51 · 52 · 53 · 54 · 55 . . . 107 · Next

Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (119)


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.