The Server Issues / Outages Thread - Panic Mode On! (118)

Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (118)
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 44 · 45 · 46 · 47 · 48 · 49 · 50 . . . 94 · Next

AuthorMessage
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 2028848 - Posted: 23 Jan 2020, 22:19:28 UTC - in response to Message 2028843.  

Got 16 on one host that went to immediate download backoff. No luck in budging them through.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 2028848 · Report as offensive
andivb

Send message
Joined: 9 Aug 09
Posts: 7
Credit: 14,510,909
RAC: 2
United States
Message 2028855 - Posted: 23 Jan 2020, 22:54:24 UTC

Getting "No tasks available" with 770k ready to send on the server. I gave up trying to understand the logic behind this...
ID: 2028855 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 2028856 - Posted: 23 Jan 2020, 22:57:43 UTC - in response to Message 2028855.  

Getting "No tasks available" with 770k ready to send on the server. I gave up trying to understand the logic behind this...

100 other hosts beat you to the RTS buffer first.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 2028856 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 2028857 - Posted: 23 Jan 2020, 23:05:53 UTC - in response to Message 2028856.  

Getting "No tasks available" with 770k ready to send on the server. I gave up trying to understand the logic behind this...

100 other hosts beat you to the RTS buffer first.


. . :)

. . There are clearly still issues to be worked out ... :(

. . Unless those 100 hosts got 8,000 WUs each :)

Stephen

? ?
ID: 2028857 · Report as offensive
Ville Saari
Avatar

Send message
Joined: 30 Nov 00
Posts: 1158
Credit: 49,177,052
RAC: 82,530
Finland
Message 2028858 - Posted: 23 Jan 2020, 23:07:31 UTC

I got 'no tasks available' again and again. Then I though maybe those 100 other hosts are hitting the server at the same five minute frequency as my host but so that my host always hits it just after them. So for the next update I disabled network activity before the timer reached zero. And then when it had zeroed, I waited another 10 seconds before enabling network activity. The client requested stuff right after I enabled it and immediately got a bunch of tasks!

But I guess this was just a coincidence because all the requests after that have resulted 'no tasks available' despite this phase shift.
ID: 2028858 · Report as offensive
andivb

Send message
Joined: 9 Aug 09
Posts: 7
Credit: 14,510,909
RAC: 2
United States
Message 2028864 - Posted: 23 Jan 2020, 23:58:18 UTC - in response to Message 2028855.  

Getting "No tasks available" with 770k ready to send on the server. I gave up trying to understand the logic behind this...

For the record: the 770k was "0m" ago. Something is broken.
Btw, is there some kind of bug tracker somewhere where the issues are tracked and addressed?
ID: 2028864 · Report as offensive
Tagged

Send message
Joined: 6 Oct 02
Posts: 2
Credit: 105,573,486
RAC: 241
United States
Message 2028867 - Posted: 24 Jan 2020, 0:06:36 UTC - in response to Message 2028864.  

Getting "No tasks available" with 770k ready to send on the server. I gave up trying to understand the logic behind this...

For the record: the 770k was "0m" ago. Something is broken.
Btw, is there some kind of bug tracker somewhere where the issues are tracked and addressed?


Nothing is broken .. there's just (tens of) thousands of machines hitting the server wanting units. RTS is now 330k. Truthfully, even if you did get WUs assigned, you probably couldn't download them due to those machines being overwhelmed also. This is very normal following long outages .. it will be back to normal in a day or so.
ID: 2028867 · Report as offensive
Profile HAL
Avatar

Send message
Joined: 18 May 99
Posts: 535
Credit: 8,246,955
RAC: 3
United States
Message 2028872 - Posted: 24 Jan 2020, 0:26:15 UTC - in response to Message 2028867.  

All systems here up and running - maybe 10% left "Downloading" :-)
I'm putting myself to the fullest possible use, which is all, I think, that any conscious entity can ever hope to do.
ID: 2028872 · Report as offensive
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5124
Credit: 276,046,078
RAC: 462
Message 2028874 - Posted: 24 Jan 2020, 0:35:11 UTC
Last modified: 24 Jan 2020, 1:25:45 UTC

I had a lot of downloads pending when I got back from my "Dinner date" with a guy who is willing to buy me steak :)

So I hit the retry and there for a while I was getting "http: failed transient error" issues.

But as I write, the queue is now downloading......

Ah, Sweat Seti!

Tom

ps. I now have both gpu and cpu tasks running.... the cache is not even close to full but at least I got "some".
A proud member of the OFA (Old Farts Association).
ID: 2028874 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 2028877 - Posted: 24 Jan 2020, 0:46:27 UTC - in response to Message 2028857.  

Getting "No tasks available" with 770k ready to send on the server. I gave up trying to understand the logic behind this...

100 other hosts beat you to the RTS buffer first.


. . :)

. . There are clearly still issues to be worked out ... :(

. . Unless those 100 hosts got 8,000 WUs each :)

Stephen

? ?

The actual buffer that you download from only holds about 250 tasks at any time. Constantly replenishes from the RTS buffer that shows on the SSP. If 5 hosts hit it at the same time and get 50 tasks, it empties very fast before refilling. If your request comes in after it was emptied, you get the "no tasks are available" message.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 2028877 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 2028879 - Posted: 24 Jan 2020, 0:49:03 UTC

Lots of stalled downloads on my hosts. The project will eventually clear the congestion and hosts will get normal returns and requests fulfilled.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 2028879 · Report as offensive
Niteryder
Volunteer tester

Send message
Joined: 1 Mar 99
Posts: 64
Credit: 22,663,988
RAC: 18
United States
Message 2028880 - Posted: 24 Jan 2020, 0:51:30 UTC - in response to Message 2028879.  

+1
No downloads are going through.
ID: 2028880 · Report as offensive
Profile Freewill Project Donor
Avatar

Send message
Joined: 19 May 99
Posts: 766
Credit: 354,398,348
RAC: 11,693
United States
Message 2028887 - Posted: 24 Jan 2020, 1:24:17 UTC - in response to Message 2028877.  

Getting "No tasks available" with 770k ready to send on the server. I gave up trying to understand the logic behind this...

100 other hosts beat you to the RTS buffer first.


. . :)

. . There are clearly still issues to be worked out ... :(

. . Unless those 100 hosts got 8,000 WUs each :)

Stephen

? ?

The actual buffer that you download from only holds about 250 tasks at any time. Constantly replenishes from the RTS buffer that shows on the SSP. If 5 hosts hit it at the same time and get 50 tasks, it empties very fast before refilling. If your request comes in after it was emptied, you get the "no tasks are available" message.

I don't understand why they don't make the 250 task buffer larger. Seems to be a bottleneck.
Can't get enough jobs to keep my top machine's GPUs busy. Why does it keep giving that one CPU tasks?
ID: 2028887 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65773
Credit: 55,293,173
RAC: 49
United States
Message 2028891 - Posted: 24 Jan 2020, 1:38:32 UTC

I got 34 sogs. All hail the sogs, My 970's ought to love them.
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 2028891 · Report as offensive
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5124
Credit: 276,046,078
RAC: 462
Message 2028892 - Posted: 24 Jan 2020, 1:40:02 UTC - in response to Message 2028887.  


I don't understand why they don't make the 250 task buffer larger. Seems to be a bottleneck.
Can't get enough jobs to keep my top machine's GPUs busy. Why does it keep giving that one CPU tasks?


That buffer might be in Ram. Until they have SSD's installed, it might be "too slow" to use a regular HD.

Tom
A proud member of the OFA (Old Farts Association).
ID: 2028892 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 2028894 - Posted: 24 Jan 2020, 1:43:40 UTC - in response to Message 2028887.  

I don't understand why they don't make the 250 task buffer larger. Seems to be a bottleneck.
Can't get enough jobs to keep my top machine's GPUs busy. Why does it keep giving that one CPU tasks?

Don't know, probably something to do with the I/O to the buffer and how many concurrent http connections the download server can handle. Don't know enough about server hardware to make a better guess.

Depends on the host. Most of my hosts will ALWAYS fill the gpu cache before ever asking to fill the cpu cache. On one host though, it gets cpu work first and then fills the gpu cache. Something to do with the APR of the applications and what the scheduler thinks is the faster bit of hardware.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 2028894 · Report as offensive
Profile Freewill Project Donor
Avatar

Send message
Joined: 19 May 99
Posts: 766
Credit: 354,398,348
RAC: 11,693
United States
Message 2028908 - Posted: 24 Jan 2020, 2:30:02 UTC - in response to Message 2028894.  

I'm pretty sure my (idle) 2070 Supers are faster than the i7-5820K. ;)
ID: 2028908 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 2028909 - Posted: 24 Jan 2020, 2:30:39 UTC - in response to Message 2028877.  

The actual buffer that you download from only holds about 250 tasks at any time. Constantly replenishes from the RTS buffer that shows on the SSP. If 5 hosts hit it at the same time and get 50 tasks, it empties very fast before refilling. If your request comes in after it was emptied, you get the "no tasks are available" message.


. . True, but I liked the imagery and I am sure there are still some knots to be unravelled.

Stephen

:(
ID: 2028909 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 2028916 - Posted: 24 Jan 2020, 3:01:38 UTC - in response to Message 2028908.  

I'm pretty sure my (idle) 2070 Supers are faster than the i7-5820K. ;)

But I never suggested the BOINC client code and the scheduler code was logical. It is way more convoluted and I don't think anyone that has ever looked at it would state it works the same way, every time under all circumstances.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 2028916 · Report as offensive
Kevin Olley

Send message
Joined: 3 Aug 99
Posts: 906
Credit: 261,085,289
RAC: 572
United Kingdom
Message 2028925 - Posted: 24 Jan 2020, 5:28:49 UTC - in response to Message 2028585.  


And my Einstein@Home RAC has topped 80,000. That is way to much for a "backup" project :)

Tom


Yes, Well, Wait until you find you are climbing up their top 50 computers list.

Looks like I am on the bottom of the queue when they hand out tasks here:-(


And it blew past 100,000 RAC too.

Tom


ATM you are No 30 on E@H top computers list wit an RAC of over 1,017,000

I have just cracked No 25 :-)

Just got a reasonable bunch of Seti WU's with a fair No of CPU's, Hopefully the cache will refill properly now.
Kevin


ID: 2028925 · Report as offensive
Previous · 1 . . . 44 · 45 · 46 · 47 · 48 · 49 · 50 . . . 94 · Next

Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (118)


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.